Sciweavers

LREC
2008
122views Education» more  LREC 2008»
13 years 9 months ago
LILA: Cellular Telephone Speech Databases from Asia
The goal of the LILA project was the collection of speech databases over cellular telephone networks of five languages in three Asian countries. Three languages were recorded in I...
Eric Sanders, Asunción Moreno, Herbert Trop...
LREC
2008
81views Education» more  LREC 2008»
13 years 9 months ago
Annotation Tool Development for Large-Scale Corpus Creation Projects at the Linguistic Data Consortium
The Linguistic Data Consortium (LDC) creates a variety of linguistic resources
Kazuaki Maeda, Haejoong Lee, Shawn Medero, Julie M...
LREC
2008
122views Education» more  LREC 2008»
13 years 9 months ago
Rapid Deployment of a New METIS Language Pair: Catalan-English
We show here the viability of a rapid deployment of a new language pair within the METIS architecture. Contrarily to other SMT or EBMT systems, the METIS architecture allows us to...
Toni Badia, Maite Melero, Oriol Valentín
LREC
2008
92views Education» more  LREC 2008»
13 years 9 months ago
Annotating "tense" in a Tense-less Language
In the context of Natural Language Processing, annotation is about recovering implicit information that is useful for natural language applications. In this paper we describe a &q...
Nianwen Xue, Hua Zhong, Kai-Yun Chen
LREC
2008
147views Education» more  LREC 2008»
13 years 9 months ago
Word Segmentation of Vietnamese Texts: a Comparison of Approaches
We present in this paper a comparison between three segmentation systems for the Vietnamese language. Indeed, the majority of Vietnamese words is built by semantic composition fro...
Quang Thang Dinh, Hong Phuong Le, Thi Minh Huyen N...
LREC
2008
121views Education» more  LREC 2008»
13 years 9 months ago
Holy Moses! Leveraging Existing Tools and Resources for Entity Translation
Recently, there has been an emphasis on creating shared resources for natural language processing applications. This has resulted in the development of high-quality tools and data...
Jean Tavernier, Rosa Cowan, Michelle Vanni
LREC
2008
80views Education» more  LREC 2008»
13 years 9 months ago
Phone Segmentation Tool with Integrated Pronunciation Lexicon and Czech Phonetically Labelled Reference Database
Phonetic segmentation is the procedure which is used in many applications of speech processing, both as a subpart of automated systems or as the tool for an interactive work. In t...
Petr Pollák, Jan Volín, Radek Skarni...
LREC
2008
94views Education» more  LREC 2008»
13 years 9 months ago
LX-Service: Web Services of Language Technology for Portuguese
In the present paper we report on the development of a cluster of web services of language technology for Portuguese that we named as LXService. These web services permit the dire...
António Branco, Francisco Costa, Pedro Mart...
LREC
2008
256views Education» more  LREC 2008»
13 years 9 months ago
The ATIS Sign Language Corpus
Systems that automatically process sign language rely on appropriate data. We therefore present the ATIS sign language corpus that is based on the domain of air travel information...
Jan Bungeroth, Daniel Stein, Philippe Dreuw, Herma...