Categories: FAANG

Navigating the Challenges and Opportunities of Synthetic Voices

We’re sharing lessons from a small scale preview of Voice Engine, a model for creating custom voices.

Compact Neural TTS Voices for Accessibility

Contemporary text-to-speech solutions for accessibility applications can typically be classified into two categories: (i) device-based statistical parametric speech synthesis (SPSS) or unit selection (USEL) and (ii) cloud-based neural TTS. SPSS and USEL offer low latency and low disk footprint at the expense of naturalness and audio quality. Cloud-based neural TTS…

January 31, 2025

In "FAANG"

Is your conversational AI setting the right tone?

Conversational AI is too artificial Nothing is more frustrating than calling a customer support line to be greeted by a monotone, robotic, automated voice. The voice on the other end of the phone is taking painfully long to read you the menu options. You’re two seconds away from either hanging…

September 27, 2022

In "FAANG"

Speech AI Year in Review

Almost anywhere you looked, AI-based speech technologies continued to blossom in 2022, from increased interest measured in Google Trends, to surprising medical advances that suggest speech patterns can help detect some illnesses, to the variety of digital services and devices that users control with their voices. At Google Cloud, we spent…

December 17, 2022

In "FAANG"

AI Generated Robotic Content