Sciweavers

242 search results - page 44 / 49
» Improving Statistical Machine Translation with Monolingual C...
Sort
View
ACL
2008
13 years 9 months ago
Combining Multiple Resources to Improve SMT-based Paraphrasing Model
This paper proposes a novel method that exploits multiple resources to improve statistical machine translation (SMT) based paraphrasing. In detail, a phrasal paraphrase table and ...
Shiqi Zhao, Cheng Niu, Ming Zhou, Ting Liu, Sheng ...
EMNLP
2008
13 years 9 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
ACL
2006
13 years 9 months ago
Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora
We present a novel method for extracting parallel sub-sentential fragments from comparable, non-parallel bilingual corpora. By analyzing potentially similar sentence pairs using a...
Dragos Stefan Munteanu, Daniel Marcu
COLING
2010
13 years 2 months ago
An Empirical Study on Web Mining of Parallel Data
This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...
Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim
CLEF
2009
Springer
13 years 8 months ago
Multilingual Query Expansion for CLEF Adhoc-TEL
In this paper we will briefly describe the approaches taken by the Cheshire (Berkeley) Group for the CLEF Adhoc-TEL 2009 tasks (Mono and Bilingual retrieval). Recognizing that man...
Ray R. Larson