Weighted finite-state transducers in speech recognition

15 years 7 months ago

Download www.cs.nyu.edu

We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that WFSTs provide a common and natural representation for HMM models, context-dependency, pronunciation dictionaries, grammars, and alternative recognition outputs. Furthermore, general transducer operations combine these representations flexibly and efficiently. Weighted determinization and minimization algorithms optimize their time and space requirements, and a weight pushing algorithm distributes the weights along the paths of a weighted transducer optimally for speech recognition. As an example, we describe a North American Business News (NAB) recognition system built using these techniques that combines the HMMs, full cross-word triphones, a lexicon of forty thousand words, and a large trigram grammar into a single weighted transducer that is only somewhat larger than the trigram word grammar and that runs NAB in real-time on a very simple decoder. In another example, we show that the ...

Mehryar Mohri, Fernando Pereira, Michael Riley

Real-time Traffic

Automated Reasoning | CSL 2002 | Speech Recognition | Transducer | Weighted Transducer |

claim paper

Post Info
More Details (n/a)

Added	18 Dec 2010
Updated	18 Dec 2010
Type	Journal
Year	2002
Where	CSL
Authors	Mehryar Mohri, Fernando Pereira, Michael Riley

Comments (0)

Sciweavers

Weighted finite-state transducers in speech recognition

Automated Reasoning | CSL 2002 | Speech Recognition | Transducer | Weighted Transducer |

Explore & Download

Productivity Tools

Sciweavers