Categories: AI/ML News

Filtered data stops openly-available AI models from performing dangerous tasks, study finds

Researchers from the University of Oxford, EleutherAI, and the UK AI Security Institute have reported a major advance in safeguarding open-weight language models. By filtering out potentially harmful knowledge during training, the researchers were able to build models that resist subsequent malicious updates—especially valuable in sensitive domains such as biothreat research.

After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board

May 23, 2025

In "AI/ML News"

University of Chicago researchers finally release to public Nightshade, a tool that is intended to “poison” pictures in order to ruin generative models trained on them

submitted by /u/Alphyn [link] [comments]

January 20, 2024

In "Image"

Approach improves how new skills are taught to large language models

Researchers have developed a technique that significantly improves the performance of large language models without increasing the computational power necessary to fine-tune the models. The researchers demonstrated that their technique improves the performance of these models over previous techniques in tasks including commonsense reasoning, arithmetic reasoning, instruction following, code generation,…

July 8, 2025

In "AI/ML News"

AI Generated Robotic Content