Categories: FAANG

Multichannel Voice Trigger Detection Based on Transform-average-concatenate

This paper was accepted at the workshop HSCMA at ICASSP 2024.
Voice triggering (VT) enables users to activate their devices by just speaking a trigger phrase. A front-end system is typically used to perform speech enhancement and/or separation, and produces multiple enhanced and/or separated signals. Since conventional VT systems take only single-channel audio as input, channel selection is performed. A drawback of this approach is that unselected channels are discarded, even if the discarded channels could contain useful information for VT. In this work, we propose multichannel acoustic…

How Multichannel Marketing and AI Elevate the Customer Experience

When we think of AI, our minds might drift to sophisticated robotics, self-driving cars, or the AI application du jour, ChatGPT. But, AI technology has also been making a big splash when it comes to marketing orchestration. From elevating the customer journey to powering personalization to transforming the customer experience,…

July 19, 2023

In "Text"

Voice Trigger System for Siri

August 12, 2023

In "FAANG"

Improving Voice Trigger Detection with Metric Learning

Voice trigger detection is an important task, which enables activating a voice assistant when a target user speaks a keyword phrase. A detector is typically trained on speech data independent of speaker information and used for the voice trigger detection task. However, such a speaker independent voice trigger detector typically…

September 3, 2022

In "FAANG"

AI Generated Robotic Content