Adaptive Knowledge Distillation for Device-Directed Speech Detection

Device-directed speech detection (DDSD) is a binary classification task that separates the user’s queries to a voice assistant (VA) from background speech or side conversations. This is important for achieving naturalistic user experience. To this end, we propose knowledge distillation (KD) to enhance DDSD accuracy while ensuring efficient deployment. Specifically, we introduce a novel adaptive …

ML 17898 DTDC Chatbot 17Apr2025

The DIVA logistics agent, powered by Amazon Bedrock

DTDC is India’s leading integrated express logistics provider, operating the largest network of customer access points in the country. DTDC’s technology-driven logistics solutions cater to a wide range of customers across diverse industry verticals, making them a trusted partner in delivering excellence. DTDC Express Limited receives over 400,000 customer queries each month, ranging from tracking …

The Interspeech 2025 Speech Accessibility Project Challenge

While the last decade has witnessed significant advancements in Automatic Speech Recognition (ASR) systems, performance of these systems for individuals with speech disabilities remains inadequate, partly due to limited public training data. To bridge this gap, the 2025 Interspeech Speech Accessibility Project (SAP) Challenge was launched, utilizing over 400 hours of SAP data collected and …

Asana Q index Arch diagram

Pioneering AI workflows at scale: A deep dive into Asana AI Studio and Amazon Q index collaboration

Organizations today face a critical challenge: managing an ever-increasing volume of tasks and information across multiple systems. Although traditional task management tools help organize work, they often fall short in delivering the intelligence needed for truly efficient operations. Today, we’re excited to announce the integration of Asana AI Studio with Amazon Q index, bringing generative …

ML 19045 1

Build an AI assistant using Amazon Q Business with Amazon S3 clickable URLs

Organizations need user-friendly ways to build AI assistants that can reference enterprise documents while maintaining document security. This post shows how to use Amazon Q Business to create an AI assistant that provides clickable URLs to source documents stored in Amazon Simple Storage Service (Amazon S3), to support secure document access and verification. Amazon Q …

1 YdkRyqo

Redefining enterprise data with agents and AI-native foundations

The world is not just changing; it’s being re-engineered in real-time by data and AI. The way we interact with data is undergoing a fundamental transformation, moving beyond human-led analysis to a collaborative partnership with intelligent agents. This is the agentic shift, a new era where specialized AI agents work autonomously and cooperatively to unlock …

Ambisonics Super-Resolution Using A Waveform-Domain Neural Network

Ambisonics is a spatial audio format describing a sound field. First-order Ambisonics (FOA) is a popular format comprising only four channels. This limited channel count comes at the expense of spatial accuracy. Ideally one would be able to take the efficiency of a FOA format without its limitations. We have devised a data-driven spatial audio …