Categories: FAANG

Training Software Engineering Agents and Verifiers with SWE-Gym

We present SWE-Gym, the first environment for training real-world software engineering (SWE) agents. SWE-Gym contains 2,438 real-world Python task instances, each comprising a codebase with an executable runtime environment, unit tests, and a task specified in natural language. We use SWE-Gym to train language model based SWE agents, achieving up to 19% absolute gains in resolve rate on the popular SWE-Bench Verified and Lite test sets. We also experiment with inference-time scaling through verifiers trained on agent trajectories sampled from SWE-Gym. When combined with our fine-tuned SWE…

Software engineering-native AI models have arrived: What Windsurf’s SWE-1 means for technical decision-makers

May 16, 2025

In "AI/ML News"

Reimagining software development with the Amazon Q Developer Agent

June 12, 2024

In "FAANG"

Cognition emerges from stealth to launch AI software engineer Devin

March 13, 2024

In "AI/ML News"

AI Generated Robotic Content

Next Bringing AI to the next generation of fusion energy »

Previous « Iterative fine-tuning on Amazon Bedrock for strategic model improvement

Share

Published by

AI Generated Robotic Content

Tags: ai/mlfaang

4 months ago

Recent Posts

Image

Fine-tuning SDXL with childhood pictures → audio-reactive geometries – [Experiment]

After a deeply introspective and emotional journey, I fine-tuned SDXL using old family album pictures…

7 hours ago

AI/ML Research

Beyond Accuracy: 5 Metrics That Actually Matter for AI Agents

AI agents , or autonomous systems powered by agentic AI, have reshaped the current landscape…

7 hours ago

FAANG

Apple Workshop on Reasoning and Planning 2025

Reasoning and planning are the bedrock of intelligent AI systems, enabling them to plan, interact,…

7 hours ago

FAANG

MediaFM: The Multimodal AI Foundation for Media Understanding at Netflix

Avneesh Saluja, Santiago Castro, Bowei Yan, Ashish RastogiIntroductionNetflix’s core mission is to connect millions of members…

7 hours ago

FAANG

Scaling data annotation using vision-language models to power physical AI systems

Critical labor shortages are constraining growth across manufacturing, logistics, construction, and agriculture. The problem is…

7 hours ago

AI/ML News

Start Your Surround Sound Journey With $50 off This Klipsch Soundbar

This soundbar is just the beginning, with the option to add wireless bookshelf speakers or…

8 hours ago

L