Unsupervised acoustic and language model training with small amounts of labelled data

14 years 7 months ago

Download web.jhu.edu

We measure the effects of a weak language model, estimated from as little as 100k words of text, on unsupervised acoustic model training and then explore the best method of using word conﬁdences to estimate n-gram counts for unsupervised language model training. Even with 100k words of text and 10 hours of training data, unsupervised acoustic modeling is robust, with 50% of the gain recovered when compared to supervised training. For language model training, multiplying the word conﬁdences together to get a weighted count produces the best reduction in WER by 2% over the baseline language model and 0.5% absolute over using unweighted transcripts. Oracle experiments show that a larger gain is possible, but better conﬁdence estimation techniques are needed to identify correct n-grams.

Scott Novotney, Richard M. Schwartz, Jeff Ma

Real-time Traffic

ICASSP 2009 | Language Model | Language Model Training | Signal Processing | Unsupervised Acoustic Model |

claim paper

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Scott Novotney, Richard M. Schwartz, Jeff Ma

Comments (0)

Sciweavers

Unsupervised acoustic and language model training with small amounts of labelled data

ICASSP 2009 | Language Model | Language Model Training | Signal Processing | Unsupervised Acoustic Model |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers