Sciweavers

51 search results - page 9 / 11
» Automatic Filtering of Bilingual Corpora for Statistical Mac...
Sort
View
ACL
2006
13 years 9 months ago
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised d...
Alexandre Klementiev, Dan Roth
LREC
2010
213views Education» more  LREC 2010»
13 years 9 months ago
Active Learning and Crowd-Sourcing for Machine Translation
In recent years, corpus based approaches to machine translation have become predominant, with Statistical Machine Translation (SMT) being the most actively progressing area. Succe...
Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell
ICCPOL
2009
Springer
14 years 4 days ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
EMNLP
2009
13 years 5 months ago
Collocation Extraction Using Monolingual Word Alignment Method
Statistical bilingual word alignment has been well studied in the context of machine translation. This paper adapts the bilingual word alignment algorithm to monolingual scenario ...
Zhan-yi Liu, Haifeng Wang, Hua Wu, Sheng Li
LREC
2010
178views Education» more  LREC 2010»
13 years 9 months ago
Data Issues in English-to-Hindi Machine Translation
Statistical machine translation to morphologically richer languages is a challenging task and more so if the source and target languages differ in word order. Current state-of-the...
Ondrej Bojar, Pavel Stranák, Daniel Zeman