Sciweavers

46 search results - page 5 / 10
» Finding translations for low-frequency words in comparable c...
Sort
View
BMCBI
2008
171views more  BMCBI 2008»
13 years 6 months ago
Comparative analysis of five protein-protein interaction corpora
Background: Growing interest in the application of natural language processing methods to biomedical text has led to an increasing number of corpora and methods targeting protein-...
Sampo Pyysalo, Antti Airola, Juho Heimonen, Jari B...
ACL
2008
13 years 8 months ago
Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora
Chinese abbreviations are widely used in modern Chinese texts. Compared with English abbreviations (which are mostly acronyms and truncations), the formation of Chinese abbreviati...
Zhifei Li, David Yarowsky
LREC
2010
181views Education» more  LREC 2010»
13 years 8 months ago
Linguistically Motivated Unsupervised Segmentation for Machine Translation
In this paper we use statistical machine translation and morphology information from two different morphological analyzers to try to improve translation quality by linguistically ...
Mark Fishel, Harri Kirik
EMNLP
2009
13 years 4 months ago
Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases
Untranslated words still constitute a major problem for Statistical Machine Translation (SMT), and current SMT systems are limited by the quantity of parallel training texts. Augm...
Yuval Marton, Chris Callison-Burch, Philip Resnik
ACL
2010
13 years 4 months ago
Improving Statistical Machine Translation with Monolingual Collocation
This paper proposes to use monolingual collocations to improve Statistical Machine Translation (SMT). We make use of the collocation probabilities, which are estimated from monoli...
Zhan-yi Liu, Haifeng Wang, Hua Wu, Sheng Li