Categories: FAANG

The “Super Weight:” How Even a Single Parameter can Determine a Large Language Model’s Behavior

A recent paper from Apple researchers, “The Super Weight in Large Language Models,” reveals that an extremely small subset of parameters in LLMs (in some cases, a single parameter) can exert a disproportionate influence on an LLM’s overall functionality (see Figure 1). This work highlights the critical role of these “super weights” and their corresponding “super activations,” offering a new insight into LLM architecture and avenues for efficient model compression. The paper provides full technical details and experimental results; in this post, we provide a high-level overview of the key…

The Super Weight in Large Language Models

Recent works have shown a surprising result: a small fraction of Large Language Model (LLM) parameter outliers are disproportionately important to the quality of the model. LLMs contain billions of parameters, so these small fractions, such as 0.01%, translate to hundreds of thousands of parameters. In this work, we present…

July 3, 2025

In "FAANG"

Scalable Pre-training of Large Autoregressive Image Models

This paper introduces AIM, a collection of vision models pre-trained with an autoregressive objective. These models are inspired by their textual counterparts, i.e., Large Language Models (LLMs), and exhibit similar scaling properties. Specifically, we highlight two key findings: (1) the performance of the visual features scale with both the model…

February 2, 2024

In "FAANG"

Apple Intelligence Foundation Language Models

We present foundation language models developed to power Apple Intelligence features, including a ∼3 billion parameter model designed to run efficiently on devices and a large server-based language model designed for Private Cloud Compute. These models are designed to perform a wide range of tasks efficiently, accurately, and responsibly. This…

July 30, 2024

In "FAANG"

AI Generated Robotic Content

Next A Practical Guide to Handling Out-of-Memory Data in Python »

Previous « About Palantir

Published by

AI Generated Robotic Content

Tags: ai/mlfaang

9 months ago

Microsoft Lens First Tests: It’s Pretty Decent! – ComfyUI Native Support About to Be Merged

Model weights: https://huggingface.co/Comfy-Org/Lens PR: https://github.com/Comfy-Org/ComfyUI/pull/14077 You'll need to git the merge pull request if you're…

23 hours ago

Image

Tencent released Z-Image 6B with pixel space gen. No VAE & 1k Resolution.

Link: https://nju-pcalab.github.io/projects/L2P/ submitted by /u/switch2stock [link] [comments]

2 days ago

AI/ML Research

Building Context-Aware Search in Python with LLM Embeddings + Metadata

Keyword search breaks the moment a user types something a document doesn't literally say.

2 days ago

FAANG

The Blueprint: How Movix fills a gap in dental skills with specialized agentic AI

Welcome to The Blueprint, a regular feature where we highlight how Google Cloud customers are…

2 days ago

AI/ML News

Memorial Day Tech Deals: Sony, Apple, Beats (2026)

Lots of our most-recommended headphones, power banks, and other gadgets are on sale for Memorial…

2 days ago

AI/ML News

Unlocking soft robotics control with AI’s cousin: Reservoir computing

Soft robotics—machines made of flexible, muscle-like materials—can bend and stretch in fluid ways that put…

2 days ago

The “Super Weight:” How Even a Single Parameter can Determine a Large Language Model’s Behavior

Related Post

Recent Posts