Emphasis Control for Parallel Neural TTS

Recent parallel neural text-to-speech (TTS) synthesis methods are able to generate speech with high fidelity while maintaining high performance. However, these systems often lack control over the output prosody, thus restricting the semantic information conveyable for a given text. This paper proposes a hierarchical parallel neural TTS system for prosodic emphasis control by learning a …

Optimizing shipping logistics in a time of change

Within logistics, shipping is a vast and delicate ecosystem. Over the last couple of years many people were directly impacted by complete production shutdowns, huge and unexpected swings in consumer demand, lack of labor at ports, a shortage of shipping containers… just to name a few! Addressing challenges with business analytics To help with some …

image2

A Multi-Axis Approach for Vision Transformer and MLP Models

Posted by Zhengzhong Tu and Yinxiao Li, Software Engineers, Google Research Convolutional neural networks have been the dominant machine learning architecture for computer vision since the introduction of AlexNet in 2012. Recently, inspired by the evolution of Transformers in natural language processing, attention mechanisms have been prominently incorporated into vision models. These attention methods boost …