Categories: FAANG

Hypernetworks for Personalizing ASR to Atypical Speech

*Equal Contributors
Parameter-efficient fine-tuning (PEFT) for personalizing automatic speech recognition (ASR) has recently shown promise for adapting general population models to atypical speech. However, these approaches assume a priori knowledge of the atypical speech disorder being adapted for — the diagnosis of which requires expert knowledge that is not always available. Even given this knowledge, data scarcity and high inter/intra-speaker variability further limit the effectiveness of traditional fine-tuning. To circumvent these challenges, we first identify the minimal set of model…
AI Generated Robotic Content

Recent Posts

The Ninja Slushi Is Only $200: Early Amazon Prime Day Deal 2026

Two years after it turned Marg Monday into a daily, the Ninja Slushi is only…

2 hours ago

Building Browser-Using AI Agents in Python

Most AI agent tutorials start with an API.

2 hours ago

Building pay-per-intelligence for AI agents: How Ampersend uses Amazon Bedrock AgentCore Payments

This post was co-written with Kevin Jones from Ampersend (Edge & Node) and Chethan Shriyan…

3 hours ago

Embed the world: Multimodal AI for searchable aerial imagery at scale

Turning a library of aerial imagery into a natural-language-searchable knowledge base is a problem that…

4 hours ago

Introducing Web Search on Amazon Bedrock AgentCore

AI agents are changing how organizations find and act on information, but they share one…

3 days ago

The Most Promising Ebola Vaccine Has Been Sitting on the Shelf for 15 Years

Years after initial tests, researchers are now racing to see if a vaccine developed in…

3 days ago