Sciweavers

242 search results - page 44 / 49
» Improving Statistical Machine Translation with Monolingual C...
Sort
View
ACL
2008
15 years 4 months ago
Combining Multiple Resources to Improve SMT-based Paraphrasing Model
This paper proposes a novel method that exploits multiple resources to improve statistical machine translation (SMT) based paraphrasing. In detail, a phrasal paraphrase table and ...
Shiqi Zhao, Cheng Niu, Ming Zhou, Ting Liu, Sheng ...
EMNLP
2008
15 years 4 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
ACL
2006
15 years 4 months ago
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
We present a novel method for extracting parallel sub-sentential fragments from comparable, non-parallel bilingual corpora. By analyzing potentially similar sentence pairs using a...
Dragos Stefan Munteanu, Daniel Marcu
COLING
2010
14 years 10 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim
CLEF
2009
Springer
15 years 4 months ago
Multilingual Query Expansion for CLEF Adhoc-TEL
In this paper we will briefly describe the approaches taken by the Cheshire (Berkeley) Group for the CLEF Adhoc-TEL 2009 tasks (Mono and Bilingual retrieval). Recognizing that man...
Ray R. Larson