Sciweavers

76 search results - page 6 / 16
» Pseudo-Aligned Multilingual Corpora
Sort
View
NAACL
1994
13 years 9 months ago
Multilingual Speech Databases at LDC
As multilingual products and technology grow in importance, the Linguistic Data Consortium (LDC) intends to provide the resources needed for research and development activities, e...
John J. Godfrey
ECIR
2010
Springer
13 years 9 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III
ACL
2004
13 years 9 months ago
Creating Multilingual Translation Lexicons with Regional Variations Using Web Corpora
The purpose of this paper is to automatically create multilingual translation lexicons with regional variations. We propose a transitive translation approach to determine translat...
Pu-Jen Cheng, Wen-Hsiang Lu, Jei-Wen Teng, Lee-Fen...
LREC
2010
141views Education» more  LREC 2010»
13 years 9 months ago
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
The paper presents the fourth, "Mondilex" edition of the MULTEXT-East language resources, a multilingual dataset for language engineering research and development, focus...
Tomaz Erjavec
ACL
2006
13 years 9 months ago
Weakly Supervised Named Entity Transliteration and Discovery from Multilingual Comparable Corpora
Named Entity recognition (NER) is an important part of many natural language processing tasks. Current approaches often employ machine learning techniques and require supervised d...
Alexandre Klementiev, Dan Roth