Categories: FAANG

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically suffers from performance degradation on speech from underrepresented groups, such as accented speakers. In this work, we propose a novel voice trigger detector that can use a small number of utterances from a target speaker to improve detection accuracy. Our proposed…
AI Generated Robotic Content

Recent Posts

Experimenting with Continuity Edits | Wan 2.2 + InfiniteTalk + Qwen Image Edit

Here is the Episode 3 of my AI sci-fi film experiment. Earlier episodes are posted…

14 hours ago

10 Python One-Liners Every Machine Learning Practitioner Should Know

Developing machine learning systems entails a well-established lifecycle, consisting of a series of stages from…

14 hours ago

Authenticate Amazon Q Business data accessors using a trusted token issuer

Since its general availability in 2024, Amazon Q Business (Amazon Q) has enabled independent software…

14 hours ago

From query to cart: Inside Target’s search bar overhaul with AlloyDB AI

Editor’s note: Target set out to modernize its digital search experience to better match guest…

14 hours ago

Automated Sextortion Spyware Takes Webcam Pics of Victims Watching Porn

A new specimen of “infostealer” malware offers a disturbing feature: It monitors a target's browser…

15 hours ago

A robot learns to handle bulky objects like humans do after just one lesson

For all their technological brilliance, from navigating distant planets to performing complex surgery, robots still…

15 hours ago