Categories: FAANG

Training Software Engineering Agents and Verifiers with SWE-Gym

We present SWE-Gym, the first environment for training real-world software engineering (SWE) agents. SWE-Gym contains 2,438 real-world Python task instances, each comprising a codebase with an executable runtime environment, unit tests, and a task specified in natural language. We use SWE-Gym to train language model based SWE agents, achieving up to 19% absolute gains in resolve rate on the popular SWE-Bench Verified and Lite test sets. We also experiment with inference-time scaling through verifiers trained on agent trajectories sampled from SWE-Gym. When combined with our fine-tuned SWE…
AI Generated Robotic Content

Recent Posts

Flux.2-Klein pipeline for real-time webcam stream processing in 30 FPS

I have built a pipeline based on the Flux.2-Klein-4B model that allows processing of a…

11 hours ago

Implementing Permission-Gated Tool Calling in Python Agents

AI agents have evolved beyond passive chatbots.

11 hours ago

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

Overview of adaptive parallel reasoning. What if a reasoning model could decide for itself when…

11 hours ago

Scaling ArchUnit with Nebula ArchRules

By John Burns and Emily YuanIntroductionAt Netflix, we operate using a polyrepo strategy with tens of…

11 hours ago

Halliburton enhances seismic workflow creation with Amazon Bedrock and Generative AI

Seismic data analysis is an essential component of energy exploration, but configuring complex processing workflows…

11 hours ago

Top Megelin Deals for Laser and LED Therapy Devices (2026)

This Mother's Day, Megelin is slashing prices on its best-selling laser and LED devices.

12 hours ago