Sciweavers

498 search results - page 94 / 100
» Statistical Machine Translation by Parsing
Sort
View
EMNLP
2008
13 years 9 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou
ACL
2003
13 years 8 months ago
tRuEcasIng
Truecasing is the process of restoring case information to badly-cased or noncased text. This paper explores truecasing issues and proposes a statistical, language modeling based ...
Lucian Vlad Lita, Abraham Ittycheriah, Salim Rouko...
CIKM
2009
Springer
14 years 2 months ago
Cross-language linking of news stories on the web using interlingual topic modelling
We have studied the problem of linking event information across different languages without the use of translation systems or dictionaries. The linking is based on interlingua in...
Wim De Smet, Marie-Francine Moens
CIVR
2004
Springer
117views Image Analysis» more  CIVR 2004»
14 years 26 days ago
Using Maximum Entropy for Automatic Image Annotation
In this paper, we propose the use of the Maximum Entropy approach for the task of automatic image annotation. Given labeled training data, Maximum Entropy is a statistical techniqu...
Jiwoon Jeon, R. Manmatha
NIPS
2007
13 years 9 months ago
Mining Internet-Scale Software Repositories
Large repositories of source code create new challenges and opportunities for statistical machine learning. Here we first develop Sourcerer, an infrastructure for the automated c...
Erik Linstead, Paul Rigor, Sushil Krishna Bajracha...