Categories: FAANG

When can transformers reason with abstract symbols?

We investigate the capabilities of transformer models on relational reasoning tasks. In these tasks, models are trained on a set of strings encoding abstract relations, and are then tested out-of-distribution on data that contains symbols that did not appear in the training dataset. We prove that for any relational reasoning task in a large family of tasks, transformers learn the abstract relations and generalize to the test set when trained by gradient descent on sufficiently large quantities of training data. This is in contrast to classical fully-connected networks, which we prove fail to…
AI Generated Robotic Content

Recent Posts

Google’s new AI algorithm reduces memory 6x and increases speed 8x

https://arstechnica.com/ai/2026/03/google-says-new-turboquant-compression-can-lower-ai-memory-usage-without-sacrificing-quality/ submitted by /u/pheonis2 [link] [comments]

13 hours ago

LlamaAgents Builder: From Prompt to Deployed AI Agent in Minutes

Creating an AI agent for tasks like analyzing and processing documents autonomously used to require…

13 hours ago

To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models

State Space Models (SSMs) have become the leading alternative to Transformers for sequence modeling. Their…

13 hours ago

How to build production-ready AI agents with Google-managed MCP servers

As ​​developers build AI agents with more sophisticated reasoning systems, they require higher-quality fuel–in the…

13 hours ago

AI Research Is Getting Harder to Separate From Geopolitics

A policy change announced by NeurIPS, the world’s leading AI research conference, drew widespread backlash…

14 hours ago

Brain-inspired AI hardware helps autonomous devices operate efficiently and independently

The human brain constantly makes decisions. It requires minimal power to move bodies in a…

14 hours ago