Categories: FAANG

A Treatise On FST Lattice Based MMI Training

Maximum mutual information (MMI) has become one of the two de facto methods for sequence-level training of speech recognition acoustic models. This paper aims to isolate, identify and bring forward the implicit modelling decisions induced by the design implementation of standard finite state transducer (FST) lattice based MMI training framework. The paper particularly investigates the necessity to maintain a preselected numerator alignment and raises the importance of determinizing FST denominator lattices on the fly. The efficacy of employing on the fly FST lattice determinization is…

Space-Efficient Representation of Entity-centric Query Language Models

Virtual assistants make use of automatic speech recognition (ASR) to help users answer entity-centric queries. However, spoken entity recognition is a difficult problem, due to the large number of frequently-changing named entities. In addition, resources available for recognition are constrained when ASR is performed on-device. In this work, we investigate…

September 3, 2022

In "FAANG"

Researchers unleash machine learning in designing advanced lattice structures

Characterized by their intricate patterns and hierarchical designs, lattice structures hold immense potential for revolutionizing industries ranging from aerospace to biomedical engineering, due to their versatility and customizability. However, the complexity of these structures and the vast design space they encompass have posed significant hurdles for engineers and scientists, and…

August 23, 2024

In "AI/ML News"