Categories: FAANG

Corpus Synthesis for Zero-shot ASR Domain Adaptation using Large Language Models

While Automatic Speech Recognition (ASR) systems are widely used in many real-world applications, they often do not generalize well to new domains and need to be finetuned on data from these domains. However, target-domain data is usually not readily available in many scenarios. In this paper, we propose a new strategy for adapting ASR models to new target domains without any text or speech from those domains. To accomplish this, we propose a novel data synthesis pipeline that uses a Large Language Model (LLM) to generate a target domain text corpus, and a state-of-the-art controllable speech…
AI Generated Robotic Content

Recent Posts

AVERAGE COMFYUI USER

submitted by /u/james_za666 [link] [comments]

17 hours ago

Optimal Corpus Aware Training for Neural Machine Translation

Corpus Aware Training (CAT) leverages valuable corpus metadata during training by injecting corpus information into…

17 hours ago

Securely launch and scale your agents and tools on Amazon Bedrock AgentCore Runtime

Organizations are increasingly excited about the potential of AI agents, but many find themselves stuck…

17 hours ago

Applications Now Open for $60,000 NVIDIA Graduate Fellowship Awards

Bringing together the world’s brightest minds and the latest accelerated computing technology leads to powerful…

17 hours ago

Google adds limited chat personalization to Gemini, trails Anthropic and OpenAI in memory features

Google updated the Gemini app running of Gemini 2.5 Pro to reference all historical chats…

18 hours ago

OpenAI Designed GPT-5 to Be Safer. It Still Outputs Gay Slurs

The new version of ChatGPT explains why it won’t generate rule-breaking outputs. WIRED’s initial analysis…

18 hours ago