Categories: FAANG

Improvements to Embedding-Matching Acoustic-to-Word ASR Using Multiple-Hypothesis Pronunciation-Based Embeddings

In embedding-matching acoustic-to-word (A2W) ASR, every word in the vocabulary is represented by a fixed-dimension embedding vector that can be added or removed independently of the rest of the system. The approach is potentially an elegant solution for the dynamic out-of-vocabulary (OOV) words problem, where speaker- and context-dependent named entities like contact names must be incorporated into the ASR on-the-fly for every speech utterance at testing time. Challenges still remain, however, in improving the overall accuracy of embedding-matching A2W. In this paper, we contribute two methods…
AI Generated Robotic Content

Recent Posts

Weekly Showcase Thread September 15, 2024

A huge thank you to everyone who participated in our first Weekly Showcase! We saw…

18 hours ago

Suspected Trump Gunman Was Once Charged With Possession of a Weapon of Mass Destruction

"I figured he was either dead or in prison by now," says the charging officer…

19 hours ago

New research could make weird AI images a thing of the past

Generative artificial intelligence (AI) has notoriously struggled to create consistent images, often getting details like…

19 hours ago

What does it cost to build a conversational AI?

Integrating conversational AI can be incredibly useful. Just make sure you’re guided by what makes…

2 days ago

Bote Lowrider Aero Paddleboard Review: This SUP Knows What’s Up

Want a stand-up paddleboard? How about a kayak? Get both with the Bote Lowrider Aero,…

2 days ago

Automating Data Cleaning Processes with Pandas

Few data science projects are exempt from the necessity of cleaning data. Data cleaning encompasses…

3 days ago