Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

125

ICASSP
2009
IEEE

favoriteEmaildiscussreport

123views Signal Processing» more ICASSP 2009»

Unsupervised acoustic and language model training with small amounts of labelled data

15 years 8 months ago

Unsupervised acoustic and language model training with small amounts of labelled data

Download web.jhu.edu

We measure the effects of a weak language model, estimated from as little as 100k words of text, on unsupervised acoustic model training and then explore the best method of using word conﬁdences to estimate n-gram counts for unsupervised language model training. Even with 100k words of text and 10 hours of training data, unsupervised acoustic modeling is robust, with 50% of the gain recovered when compared to supervised training. For language model training, multiplying the word conﬁdences together to get a weighted count produces the best reduction in WER by 2% over the baseline language model and 0.5% absolute over using unweighted transcripts. Oracle experiments show that a larger gain is possible, but better conﬁdence estimation techniques are needed to identify correct n-grams.

Scott Novotney, Richard M. Schwartz, Jeff Ma

Real-time Traffic

ICASSP 2009 | Language Model | Language Model Training | Signal Processing | Unsupervised Acoustic Model |

claim paper

Related Content

» Lightly supervised and unsupervised acoustic model training

» Unsupervised discovery and training of maximally dissimilar cluster models

» Latticebased unsupervised acoustic model training

» Acoustic data sharing for Afghan and Persian languages

» Revisiting graphemes with increasing amounts of data

» Acoustic model training for nonaudible murmur recognition using transformed normal speech ...

» Classifier Combination for Contextual Idiom Detection Without Labelled Data

» A samplingbased environment population projection approach for rapid acoustic model adapta...

» Unsupervised speaker adaptation for telephone call transcription

Post Info
More Details (n/a)

Added	21 May 2010
Updated	21 May 2010
Type	Conference
Year	2009
Where	ICASSP
Authors	Scott Novotney, Richard M. Schwartz, Jeff Ma

Comments (0)