Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

111

NAACL
2010

favoriteEmaildiscussreport

144views Computational Linguistics» more NAACL 2010»

Stream-based Translation Models for Statistical Machine Translation

14 years 12 months ago

Stream-based Translation Models for Statistical Machine Translation

Download www.aclweb.org

Typical statistical machine translation systems are trained with static parallel corpora. Here we account for scenarios with a continuous incoming stream of parallel training data. Such scenarios include daily governmental proceedings, sustained output from translation agencies, or crowd-sourced translations. We show incorporating recent sentence pairs from the stream improves performance compared with a static baseline. Since frequent batch retraining is computationally demanding we introduce a fast incremental alternative using an online version of the EM algorithm. To bound our memory requirements we use a novel data-structure and associated training regime. When compared to frequent batch retraining, our online time and space-bounded model achieves the same performance with significantly less computational overhead.

Abby Levenberg, Chris Callison-Burch, Miles Osborn

Real-time Traffic

Computational Linguistics | Continuous Incoming Stream | Frequent Batch | NAACL 2010 | Static Parallel Corpora |

claim paper

Related Content

» Mixing Multiple Translation Models in Statistical Machine Translation

» Given Bilingual Terminology in Statistical Machine Translation MWESensitve Word Alignment ...

» Translation Model Pruning via Usage Statistics for Statistical Machine Translation

» Towards a Unified Approach to Memory and StatisticalBased Machine Translation

» NGramBased Statistical Machine Translation versus Syntax Augmented Machine Translation Com...

» Combining WordLevel and CharacterLevel Models for Machine Translation Between CloselyRelat...

» FeatureRich Statistical Translation of Noun Phrases

» ChunkBased Statistical Translation

» MISTRAL a Statistical Machine Translation Decoder for Speech Recognition Lattices

» Fast and Scalable Decoding with Language Model LookAhead for Phrasebased Statistical Machi...

Post Info
More Details (n/a)

Added	14 Feb 2011
Updated	14 Feb 2011
Type	Journal
Year	2010
Where	NAACL
Authors	Abby Levenberg, Chris Callison-Burch, Miles Osborne

Comments (0)