Categories: AI/ML News

Filtered data stops openly-available AI models from performing dangerous tasks, study finds

Researchers from the University of Oxford, EleutherAI, and the UK AI Security Institute have reported a major advance in safeguarding open-weight language models. By filtering out potentially harmful knowledge during training, the researchers were able to build models that resist subsequent malicious updates—especially valuable in sensitive domains such as biothreat research.

University of Chicago researchers finally release to public Nightshade, a tool that is intended to “poison” pictures in order to ruin generative models trained on them

submitted by /u/Alphyn [link] [comments]

January 20, 2024

In "Image"

After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board

May 23, 2025

In "AI/ML News"

Approach improves how new skills are taught to large language models

Researchers have developed a technique that significantly improves the performance of large language models without increasing the computational power necessary to fine-tune the models. The researchers demonstrated that their technique improves the performance of these models over previous techniques in tasks including commonsense reasoning, arithmetic reasoning, instruction following, code generation,…

July 8, 2025

In "AI/ML News"

AI Generated Robotic Content

Next AOL Will Shut Down Dial-Up Internet Access in September »

Previous « UltraReal + Nice Girls LoRAs for Qwen-Image

Published by

AI Generated Robotic Content

12 months ago

Stateful vs. Stateless Agent Design: Tradeoffs for Scalable Agentic Systems

In this article, you will learn how an agent's approach to managing state — stateless…

35 mins ago

FAANG

LEAD: Breaking the No-Recovery Bottleneck in Long-Horizon Reasoning

Long-horizon execution in Large Language Models (LLMs) remains unstable even when high-level strategies are provided.…

35 mins ago

FAANG

Introducing Claude Opus 5 on AWS: Anthropic’s most capable Opus model

Today, we announce the availability of Claude Opus 5 on Amazon Bedrock and Claude Platform…

35 mins ago

AI/ML News

One of NASA’s Most Important Deep Space Observatories Hit by Spanish Wildfires

Flames burned through the Deep Space Communications Complex near Madrid, but NASA has been unable…

2 hours ago

AI/ML News

Get ready for mobile ‘stores on wheels.’ Research shows they can outperform traditional retail stores

As retailers increasingly embrace artificial intelligence (AI), robotics and autonomous vehicles, a new retail model…

2 hours ago

AI/ML Research

An Introduction to Loop Engineering

It's tempting to treat loop engineering as something invented in a single week in June,…

1 day ago

Filtered data stops openly-available AI models from performing dangerous tasks, study finds

Recent Posts