Sciweavers

76 search results - page 8 / 16
» Pseudo-Aligned Multilingual Corpora
Sort
View
ICASSP
2010
IEEE
13 years 8 months ago
Framework for cross-language automatic phonetic segmentation
Annotation of large multilingual corpora remains a challenge to the data-driven approach to speech research, especially for under-resourced languages. This paper presents crosslan...
Udochukwu Kalu Ogbureke, Julie Carson-Berndsen
CLEF
2010
Springer
13 years 8 months ago
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...
Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...
LREC
2008
101views Education» more  LREC 2008»
13 years 9 months ago
Glossa: a Multilingual, Multimodal, Configurable User Interface
We describe a web-based corpus query system, Glossa, which combines the expressiveness of regular query languages with the user-friendliness of a graphical interface. Since corpus...
Lars Nygaard, Joel Priestley, Anders Nøkles...
LREC
2010
189views Education» more  LREC 2010»
13 years 9 months ago
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content
Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of c...
Yulia Tsvetkov, Shuly Wintner
LREC
2008
62views Education» more  LREC 2008»
13 years 9 months ago
Speaker Recognition: Building the Mixer 4 and 5 Corpora
The original Mixer corpus was designed to satisfy developing commercial and forensic needs. The resulting Mixer corpora, Phases 1 through 5, have evolved to support and increasing...
Linda Brandschain, Christopher Cieri, David Graff,...