Categories: FAANG

Regularized Training of Nearest Neighbor Language Models

Including memory banks in a natural language processing architecture increases model capacity by equipping it with additional data at inference time. In this paper, we build upon kNN-LM, which uses a pre-trained language model together with an exhaustive kNN search through the training data (memory bank) to achieve state-of-the-art results. We investigate whether we can improve the kNN-LM performance by instead training a LM with the knowledge that we will be using a kNN post-hoc. We achieved significant improvement using our method on language modeling tasks on WIKI-2 and WIKI-103. The main…

Power recommendations and search using an IMDb knowledge graph – Part 3

January 7, 2023

In "FAANG"

Build a crop segmentation machine learning model with Planet data and Amazon SageMaker geospatial capabilities

September 30, 2023

In "FAANG"

Introducing Stable LM 3B: Bringing Sustainable, High-Performance Language Models to Smart Devices

Today, we proudly launch an experimental version of Stable LM 3B, the latest in our suite of high-performance generative AI solutions. At 3 billion parameters (vs. the 7 to 70 billion parameters typically used by the industry), Stable LM 3B is a compact language model designed to operate on portable…

October 3, 2023

In "Image"

AI Generated Robotic Content

Next FORML: Learning to Reweight Data for Fairness »

Previous « Benign, Tempered, or Catastrophic: A Taxonomy of Overfitting

Published by

AI Generated Robotic Content

Tags: ai/mlfaang

4 years ago

Anima – Sharing Some Prompts and Results

Been experimenting with Anima lately and ended up spending way too much time refining prompts.…

22 hours ago

AI/ML News

Keychron K2 HE Concrete Edition Review: Rock-Solid Typing

Keychron's K2 HE Concrete Edition sounds like a cute gimmick, but as I discovered, there's…

23 hours ago

AI/ML News

AI generates full battery electrolyte recipes, matching top lithium metal battery performance

Battery electrolytes aren't just one chemical, but a complex mixture of salts, solvents, and additives…

23 hours ago

Image

Nava – A 6.3B audio-video model .

Page: https://ernie-research.github.io/NAVA/ Model: https://huggingface.co/ernie-research/NAVA Github: https://github.com/ernie-research/NAVA NAVA is a 6.3 B-parameter joint audio-video generator that…

2 days ago