Sciweavers

125 search results - page 17 / 25
» Learning Translation Templates From Bilingual Text
Sort
View
WWW
2009
ACM
14 years 7 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
ICML
1998
IEEE
14 years 7 months ago
Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus
Cross-language latent semantic indexing is a method that learns useful languageindependent vector representations of terms through a statistical analysis of a documentaligned text...
Michael L. Littman, Fan Jiang, Greg A. Keim
SIGIR
2003
ACM
13 years 12 months ago
Transliteration of proper names in cross-language applications
Translation of proper names is generally recognized as a significant problem in many multi-lingual text and speech processing applications. Even when large bilingual lexicons use...
Paola Virga, Sanjeev Khudanpur
LREC
2008
155views Education» more  LREC 2008»
13 years 8 months ago
Using Reordering in Statistical Machine Translation based on Alignment Block Classification
Statistical Machine Translation (SMT) is based on alignment models which learn from bilingual corpora the word correspondences between source and target language. These models are...
Marta R. Costa-Jussà, José A. R. Fon...
PAMI
2006
117views more  PAMI 2006»
13 years 6 months ago
Metric Learning for Text Documents
High dimensional structured data such as text and images is often poorly understood and misrepresented in statistical modeling. The standard histogram representation suffers from ...
Guy Lebanon