Categories: FAANG

When can transformers reason with abstract symbols?

We investigate the capabilities of transformer models on relational reasoning tasks. In these tasks, models are trained on a set of strings encoding abstract relations, and are then tested out-of-distribution on data that contains symbols that did not appear in the training dataset. We prove that for any relational reasoning task in a large family of tasks, transformers learn the abstract relations and generalize to the test set when trained by gradient descent on sufficiently large quantities of training data. This is in contrast to classical fully-connected networks, which we prove fail to…

AI Generated Robotic Content

Next Beware of AI-based deception detection, warns scientific community »

Previous « How generative AI will revolutionize supply chain

Share

Published by

AI Generated Robotic Content

Tags: ai/mlfaang

1 year ago

Recent Posts

Image

I’ll definitely try this one out later… oh… it’s already obsolete

submitted by /u/Dry-Resist-4426 [link] [comments]

8 hours ago

AI/ML News

From hallucinations to hardware: Lessons from a real-world computer vision project gone sideways

What we tried, what didn't work and how a combination of approaches eventually helped us…

9 hours ago

AI/ML News

OpenAI Loses 4 Key Researchers to Meta

Mark Zuckerberg has been working to poach talent from rival labs for his new superintelligence…

9 hours ago

Image

Inpainting style edits from prompt ONLY with the fp8 quant of Kontext, this is mindblowing in how simple it is

submitted by /u/OrangeFluffyCatLover [link] [comments]

1 day ago

FAANG

Evaluating Long Range Dependency Handling in Code Generation LLMs

As language models support larger and larger context sizes, evaluating their ability to make effective…

1 day ago

FAANG

AWS costs estimation using Amazon Q CLI and AWS Cost Analysis MCP

Managing and optimizing AWS infrastructure costs is a critical challenge for organizations of all sizes.…

1 day ago

L