Categories: FAANG

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented groups, such as accented speakers. In this work, we propose a novel voice trigger detector that can use a small number of utterances from a target speaker to improve detection accuracy. Our proposed…
AI Generated Robotic Content

Recent Posts

Understanding RAG Part VI: Effective Retrieval Optimization

Be sure to check out the previous articles in this series: •

20 hours ago

PR Agencies in the Age of AI

TL;DR We compared Grok 3 and o3-mini’s results on this topic. They both passed. Since…

20 hours ago

How Rocket Companies modernized their data science solution on AWS

This post was written with Dian Xu and Joel Hawkins of Rocket Companies. Rocket Companies…

20 hours ago

Optimizing image generation pipelines on Google Cloud: A practical guide

Generative AI diffusion models such as Stable Diffusion and Flux produce stunning visuals, empowering creators…

20 hours ago

Supergiant Games battles back accusations it is working around SAG-AFTRA strike

After a public callout, the developers of Hades took to social media to clarify that…

21 hours ago

DOGE Put Him in the Treasury Department. His Company Has Federal Contracts Worth Millions

Experts say the conflicts posed by Tom Krause’s dual roles are unprecedented in the modern…

21 hours ago