Categories: FAANG

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented groups, such as accented speakers. In this work, we propose a novel voice trigger detector that can use a small number of utterances from a target speaker to improve detection accuracy. Our proposed…
AI Generated Robotic Content

Recent Posts

LTX-2 Image-to-Video Adapter LoRA

https://huggingface.co/MachineDelusions/LTX-2_Image2Video_Adapter_LoRa A high-rank LoRA adapter for LTX-Video 2 that substantially improves image-to-video generation quality. No…

17 hours ago

Leveling Up Your Machine Learning: What To Do After Andrew Ng’s Course

Finishing Andrew Ng's machine learning course

17 hours ago

The AI Evolution of Graph Search at Netflix

The AI Evolution of Graph Search at Netflix: From Structured Queries to Natural LanguageBy Alex Hutter…

17 hours ago

Build a serverless AI Gateway architecture with AWS AppSync Events

AWS AppSync Events can help you create more secure, scalable Websocket APIs. In addition to…

17 hours ago

BigQuery AI supports Gemini 3.0, simplified embedding generation and new similarity function

The digital landscape is flooded with unstructured data — images, videos, audio, and documents —…

17 hours ago

Judge Delays Minnesota ICE Decision While Weighing Whether State Is Being Illegally Punished

A federal judge ordered a new briefing due Wednesday on whether DHS is using armed…

18 hours ago