Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures

15 years 9 months ago

Download crow.ee.washington.edu

Sources of training data suitable for language modeling of conversational speech are limited. In this paper, we show how training data can be supplemented with text from the web ﬁltered to match the style and/or topic of the target recognition task, but also that it is possible to get bigger performance gains from the data by using class-dependent interpolation of N-grams.

Ivan Bulyko, Mari Ostendorf, Andreas Stolcke

Real-time Traffic

NAACL 2003 | NAACL 2007 | Style And/or Topic | Target Recognition Task | Training Data |

claim paper

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NAACL
Authors	Ivan Bulyko, Mari Ostendorf, Andreas Stolcke

Comments (0)

Sciweavers

Getting More Mileage from Web Text Sources for Conversational Speech Language Modeling using Class-Dependent Mixtures

NAACL 2003 | NAACL 2007 | Style And/or Topic | Target Recognition Task | Training Data |

Explore & Download

Productivity Tools

Sciweavers