Categories: FAANG

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented groups, such as accented speakers. In this work, we propose a novel voice trigger detector that can use a small number of utterances from a target speaker to improve detection accuracy. Our proposed…
AI Generated Robotic Content

Recent Posts

AOL Will Shut Down Dial-Up Internet Access in September

The move will pinch users in rural or remote areas not yet served by broadband…

56 mins ago

Filtered data stops openly-available AI models from performing dangerous tasks, study finds

Researchers from the University of Oxford, EleutherAI, and the UK AI Security Institute have reported…

57 mins ago

UltraReal + Nice Girls LoRAs for Qwen-Image

TL;DR — I trained two LoRAs for Qwen-Image: Lenovo: my cross-model realism booster (I port…

24 hours ago

How to Interpret Your XGBoost Model: A Practical Guide to Feature Importance

One of the most widespread machine learning techniques is XGBoost (Extreme Gradient Boosting).

24 hours ago

Misty: UI Prototyping Through Interactive Conceptual Blending

UI prototyping often involves iterating and blending elements from examples such as screenshots and sketches,…

24 hours ago

Accelerating Video Quality Control at Netflix with Pixel Error Detection

By Leo Isikdogan, Jesse Korosi, Zile Liao, Nagendra Kamath, Ananya PoddarAt Netflix, we support the filmmaking…

24 hours ago