Sciweavers

17 search results - page 1 / 4
» Improved Machine Translation Performance via Parallel Senten...
Sort
View
102
Voted
ACL
2006
15 years 3 months ago
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
We present a novel method for extracting parallel sub-sentential fragments from comparable, non-parallel bilingual corpora. By analyzing potentially similar sentence pairs using a...
Dragos Stefan Munteanu, Daniel Marcu
NAACL
2010
15 years 8 days ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
ACL
2004
15 years 3 months ago
Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora
The parameters of statistical translation models are typically estimated from sentence-aligned parallel corpora. We show that significant improvements in the alignment and transla...
Chris Callison-Burch, David Talbot, Miles Osborne
108
Voted
COLING
2010
14 years 9 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim