Categories: FAANG

The Super Weight in Large Language Models

Recent works have shown a surprising result: a small fraction of Large Language Model (LLM) parameter outliers are disproportionately important to the quality of the model. LLMs contain billions of parameters, so these small fractions, such as 0.01%, translate to hundreds of thousands of parameters. In this work, we present an even more surprising finding: Pruning as few as a single parameter can destroy an LLM’s ability to generate text — increasing perplexity by 3 orders of magnitude and reducing zero-shot accuracy to guessing. We propose a data-free method for identifying such parameters…
AI Generated Robotic Content

Recent Posts

Chroma Radiance, Mid training but the most aesthetic model already imo

submitted by /u/Different_Fix_2217 [link] [comments]

8 hours ago

From human clicks to machine intent: Preparing the web for agentic AI

For three decades, the web has been designed with one audience in mind: People. Pages…

9 hours ago

Best GoPro Camera (2025): Compact, Budget, Accessories

You’re an action hero, and you need a camera to match. We guide you through…

9 hours ago

What tools would you use to make morphing videos like this?

submitted by /u/nikitagent [link] [comments]

1 day ago

Bias after Prompting: Persistent Discrimination in Large Language Models

A dangerous assumption that can be made from prior work on the bias transfer hypothesis…

1 day ago

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

Author: Keertana Chidambaram, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya(*The work was done when Keertana interned…

1 day ago