Sciweavers

LREC
2010
121views Education» more  LREC 2010»
14 years 28 days ago
Ngram Search Engine with Patterns Combining Token, POS, Chunk and NE Information
We developed a search tool for ngrams extracted from a very large corpus (the current system uses the entire Wikipedia, which has
Satoshi Sekine, Kapil Dalwani
LREC
2010
163views Education» more  LREC 2010»
14 years 28 days ago
PDTB XML: the XMLization of the Penn Discourse TreeBank 2.0
The current study presents a conversion and unification of the Penn Discourse TreeBank 2.0 under the XML format. The converted corpus allows for a simultaneous search for syntacti...
Xuchen Yao, Irina V. Borisova, Mehwish Alam
LREC
2010
135views Education» more  LREC 2010»
14 years 28 days ago
Multilingual Voice Creation Toolkit for the MARY TTS Platform
This paper describes an open source voice creation toolkit that supports the creation of unit selection and HMM-based voices, for the MARY (Modular Architecture for Research on sp...
Sathish Pammi, Marcela Charfuelan, Marc Schrö...
LREC
2010
141views Education» more  LREC 2010»
14 years 28 days ago
MULTEXT-East Version 4: Multilingual Morphosyntactic Specifications, Lexicons and Corpora
The paper presents the fourth, "Mondilex" edition of the MULTEXT-East language resources, a multilingual dataset for language engineering research and development, focus...
Tomaz Erjavec
LREC
2010
157views Education» more  LREC 2010»
14 years 28 days ago
Indexing Methods for Faster and More Effective Person Name Search
This paper compares several indexing methods for person names extracted from text, developed for an information retrieval system with requirements for fast approximate matching of...
Mark Arehart
LREC
2010
176views Education» more  LREC 2010»
14 years 28 days ago
LAF/GrAF-grounded Representation of Dependency Structures
This paper shows that a LAF/GrAF-based annotation schema can be used for the adequate representation of syntactic dependency structures in many languages. We first argue that ther...
Yoshihiko Hayashi, Thierry Declerck, Chiharu Naraw...
LREC
2010
132views Education» more  LREC 2010»
14 years 28 days ago
Resources for Controlled Languages for Alert Messages and Protocols in the European Perspective
This paper is concerned with resources for controlled languages for alert messages and protocols in the European perspective. These resources have been produced as the outcome of ...
Sylviane Cardey, Krzysztof Bogacki, Xavier Blanco,...
LREC
2010
146views Education» more  LREC 2010»
14 years 28 days ago
Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue
This paper examines how Natural Language Process (NLP) resources and online dialogue corpora can be used to extend coverage of Information Extraction (IE) templates in a Spoken Di...
Roberta Catizone, Alexiei Dingli, Robert J. Gaizau...
LREC
2010
142views Education» more  LREC 2010»
14 years 28 days ago
The Demo / Kemo Corpus: A Principled Approach to the Study of Cross-cultural Differences in the Vocal Expression and Perception
This paper presents the Demo / Kemo corpus of Dutch and Korean emotional speech. The corpus has been specifically developed for the purpose of cross-linguistic comparison, and is ...
Martijn Goudbeek, Mirjam Broersma