Categories: FAANG

Controlling Language and Diffusion Models by Transporting Activations

The increasing capabilities of large generative models and their ever more widespread deployment have raised concerns about their reliability, safety, and potential misuse. To address these issues, recent works have proposed to control model generation by steering model activations in order to effectively induce or prevent the emergence of concepts or behaviours in the generated output. In this paper we introduce Activation Transport (AcT), a general framework to steer activations guided by optimal transport theory that generalizes many previous activation-steering works. AcT is…
AI Generated Robotic Content

Recent Posts

Fine-Tuning DistilBERT for Question Answering

This post is divided into three parts; they are: • Fine-tuning DistilBERT for Custom Q&A…

5 hours ago

A Practical Guide to Building Local RAG Applications with LangChain

Retrieval augmented generation (RAG) encompasses a family of systems that extend conventional language models ,…

5 hours ago

Universally Instance-Optimal Mechanisms for Private Statistical Estimation

We consider the problem of instance-optimal statistical estimation under the constraint of differential privacy where…

5 hours ago

Introducing AWS MCP Servers for code assistants (Part 1)

We’re excited to announce the open source release of AWS MCP Servers for code assistants…

5 hours ago

Doctor Behind Award-Winning Parkinson’s Research Among Scientists Purged From NIH

Leading scientists at the National Institutes of Health, the US’s leading medical research agency, were…

6 hours ago