Categories: FAANG

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented groups, such as accented speakers. In this work, we propose a novel voice trigger detector that can use a small number of utterances from a target speaker to improve detection accuracy. Our proposed…
AI Generated Robotic Content

Recent Posts

A Gentle Introduction to Graph Neural Networks in Python

Graph neural networks (GNNs) can be pictured as a special class of neural network models…

8 hours ago

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour…

8 hours ago

Gemini 2.5: Our most intelligent AI model

Gemini 2.5 is our most intelligent AI model, now with thinking built in.

8 hours ago

Exploring Empty Spaces: Human-in-the-Loop Data Augmentation

Data augmentation is crucial to make machine learning models more robust and safe. However, augmenting…

8 hours ago

Amazon Bedrock launches Session Management APIs for generative AI applications (Preview)

Amazon Bedrock announces the preview launch of Session Management APIs, a new capability that enables…

8 hours ago