Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

147

Voted

ACL
2010

127views Computational Linguistics» more ACL 2010»

Intelligent Selection of Language Model Training Data

15 years 4 months ago

Intelligent Selection of Language Model Training Data

Download www.aclweb.org

We address the problem of selecting nondomain-specific language model training data to build auxiliary language models for use in tasks such as machine translation. Our approach is based on comparing the cross-entropy, according to domainspecific and non-domain-specifc language models, for each sentence of the text source used to produce the latter language model. We show that this produces better language models, trained on less data, than both random data selection and two other previously proposed methods.

Robert C. Moore, William Lewis

Real-time Traffic

ACL 2010 | Auxiliary Language Models | Computational Linguistics | Language Models | Nondomain-specific Language Model |

claim paper

Related Content

» A PhonemeBased Student Model for Adaptive Spelling Training

» A knowledgebased approach for designing intelligent team training systems

» Twolevel clustering approach to training data instance selection A case study for the stee...

» Combining Competing Language Understanding Approaches in an Intelligent Tutoring System

» Language Modeling for Determiner Selection

» Modeling Users of Crisis Training Environments by Integrating Psychological and Physiologi...

» Neural Network Language Models for Translation with Limited Data

» Inferencing Bayesian Networks from Time Series Data Using Natural Selection

» An Intelligent Agent That Autonomously Learns How to Translate

Post Info
More Details (n/a)

Added	10 Feb 2011
Updated	10 Feb 2011
Type	Journal
Year	2010
Where	ACL
Authors	Robert C. Moore, William Lewis

Comments (0)