FAANG

Training Software Engineering Agents and Verifiers with SWE-Gym

We present SWE-Gym, the first environment for training real-world software engineering (SWE) agents. SWE-Gym contains 2,438 real-world Python task instances,…

2 weeks ago

Iterative fine-tuning on Amazon Bedrock for strategic model improvement

Organizations often face challenges when implementing single-shot fine-tuning approaches for their generative AI models. The single-shot fine-tuning method involves selecting…

2 weeks ago

Announcing prompt management in the Vertex AI SDK

As generative AI applications grow in sophistication, development workflows become more fragmented. Although AI can be a force multiplier, teams…

2 weeks ago

Introducing Veo 3.1 and advanced creative capabilities

We’re rolling out significant updates to Veo that give people even more creative control.

2 weeks ago

Agentic RAG for Software Testing with Hybrid Vector-Graph and Multi-Agent Orchestration

We present an approach to software testing automation using Agentic Retrieval-Augmented Generation (RAG) systems for Quality Engineering (QE) artifact creation.…

2 weeks ago

Transforming enterprise operations: Four high-impact use cases with Amazon Nova

Since the launch of Amazon Nova at AWS re:Invent 2024, we have seen adoption trends across industries, with notable gains…

2 weeks ago

The ultimate prompting guide for Veo 3.1

If a picture is worth a thousand words, a video is worth a million.  For creators, generative video holds the…

2 weeks ago

Build a device management agent with Amazon Bedrock AgentCore

The proliferation of Internet of Things (IoT) devices has transformed how we interact with our environments, from homes to industrial…

2 weeks ago

How AI can scale customer experience — online and IRL

Customer service teams at fast-growing companies face a challenging reality: customer inquiries are growing exponentially, but scaling human teams at…

2 weeks ago

FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models

Autoregressive language models (ARMs) deliver strong likelihoods, but are inherently serial: they generate one token per forward pass, which limits…

2 weeks ago