Sciweavers

76 search results - page 10 / 16
» Pseudo-Aligned Multilingual Corpora
Sort
View
LREC
2008
127views Education» more  LREC 2008»
13 years 9 months ago
An Automatic Close Copy Speech Synthesis Tool for Large-Scale Speech Corpus Evaluation
The production of rich multilingual speech corpus resources on a large scale is a requirement for many linguistic, phonetic and technological tasks, in both research and applicati...
Dafydd Gibbon, Jolanta Bachan
LREC
2010
157views Education» more  LREC 2010»
13 years 9 months ago
Syntactic Dependencies for Multilingual and Multilevel Corpus Annotation
The relevance of syntactic dependency annotated corpora is nowadays unquestioned. However, a broad debate on the optimal set of dependency relation tags did not take place yet. As...
Simon Mille, Leo Wanner
ACL
2006
13 years 9 months ago
Exploiting Comparable Corpora and Bilingual Dictionaries for Cross-Language Text Categorization
Cross-language Text Categorization is the task of assigning semantic classes to documents written in a target language (e.g. English) while the system is trained using labeled doc...
Alfio Massimiliano Gliozzo, Carlo Strapparava
ACL
2006
13 years 9 months ago
Accurate Collocation Extraction Using a Multilingual Parser
This paper focuses on the use of advanced techniques of text analysis as support for collocation extraction. A hybrid system is presented that combines statistical methods and mul...
Violeta Seretan, Eric Wehrli
LREC
2010
182views Education» more  LREC 2010»
13 years 9 months ago
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with l...
Samuel Reese, Gemma Boleda, Montse Cuadros, Llu&ia...