Sciweavers

LREC
2010
148views Education» more  LREC 2010»
13 years 9 months ago
MLIF : A Metamodel to Represent and Exchange Multilingual Textual Information
The fast evolution of language technology has produced pressing needs in standardization. The multiplicity of language resources representation levels and the specialization of th...
Samuel Cruz-Lara, Gil Francopoulo, Laurent Romary,...
LREC
2010
191views Education» more  LREC 2010»
13 years 9 months ago
The SignSpeak Project - Bridging the Gap Between Signers and Speakers
The SignSpeak project will be the first step to approach sign language recognition and translation at a scientific level already reached in similar research fields such as automat...
Philippe Dreuw, Hermann Ney, Gregorio Martinez, On...
LREC
2010
156views Education» more  LREC 2010»
13 years 9 months ago
A General Method for Creating a Bilingual Transliteration Dictionary
Transliteration is the rendering in one language of terms from another language (and, possibly, another writing system), approximating spelling and/or phonetic equivalents between...
Amit Kirschenbaum, Shuly Wintner
LREC
2010
157views Education» more  LREC 2010»
13 years 9 months ago
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning
Existing approaches to classifying documents by sentiment include machine learning with features created from n-grams and part of speech. This paper explores a different approach ...
Aleksander Wawer
DGO
2007
192views Education» more  DGO 2007»
13 years 9 months ago
D-HOTM: distributed higher order text mining
We present D-HOTM, a framework for Distributed Higher Order Text Mining based on named entities extracted from textual data that are stored in distributed relational databases. Unl...
William M. Pottenger
LREC
2010
143views Education» more  LREC 2010»
13 years 9 months ago
A Flexible Representation of Heterogeneous Annotation Data
This paper describes a new flexible representation for the annotation of complex structures of metadata over heterogeneous data collections containing text and other types of medi...
Richard Johansson, Alessandro Moschitti
LREC
2010
188views Education» more  LREC 2010»
13 years 9 months ago
How Large a Corpus Do We Need: Statistical Method Versus Rule-based Method
We investigate the impact of input data scale in corpus-based learning using a study style of Zipf's law. In our research, Chinese word segmentation is chosen as the study ca...
Hai Zhao, Yan Song, Chunyu Kit
LREC
2010
111views Education» more  LREC 2010»
13 years 9 months ago
Two-level Annotation of Utterance-units in Japanese Dialogs: An Empirically Emerged Scheme
In this paper, we propose a scheme for annotating utterance-level units in Japanese dialogs, which emerged from an analysis of the interrelationship among four schemes, i) inter-p...
Yasuharu Den, Hanae Koiso, Takehiko Maruyama, Kiku...
LREC
2010
170views Education» more  LREC 2010»
13 years 9 months ago
Transcription Methods for Consistency, Volume and Efficiency
This paper describes recent efforts at Linguistic Data Consortium at the University of Pennsylvania to create manual transcripts as a shared resource for human language technology...
Meghan Lammie Glenn, Stephanie Strassel, Haejoong ...
LREC
2010
166views Education» more  LREC 2010»
13 years 9 months ago
A Derivational Rephrasing Experiment for Question Answering
In Knowledge Management, variations in information expressions have proven a real challenge. In particular, classical semantic relations (e.g. synonymy) do not connect words with ...
Bernard Jacquemin