Adaptive Knowledge Distillation for Device-Directed Speech Detection
Device-directed speech detection (DDSD) is a binary classification task that separates the user’s queries to a voice assistant (VA) from background speech or side conversations. This is important for achieving naturalistic user experience. To this end, we propose knowledge distillation (KD) to enhance DDSD accuracy while ensuring efficient deployment. Specifically, we introduce a novel adaptive …
Read more “Adaptive Knowledge Distillation for Device-Directed Speech Detection”