Sciweavers

LREC
2008
88views Education» more  LREC 2008»
13 years 9 months ago
An Infrastructure, Tools and Methodology for Evaluation of Multicultural Name Matching Systems
This paper describes a Name Matching Evaluation Laboratory that is a joint effort across multiple projects. The lab houses our evaluation infrastructure as well as multiple name m...
Keith J. Miller, Mark Arehart, Catherine Ball, Joh...
LREC
2008
220views Education» more  LREC 2008»
13 years 9 months ago
Introducing DRS (The Digital Replay System): a Tool for the Future of Corpus Linguistic Research and Analysis
This paper outlines the new resource technologies, products and applications that have been constructed during the development of a multi-modal (MM hereafter) corpus tool on the D...
Dawn Knight, Paul Tennent
LREC
2008
119views Education» more  LREC 2008»
13 years 9 months ago
What's in a Colour? Studying and Contrasting Colours with COMPARA
In this paper we present contrastive colour studies done using COMPARA, the largest edited parallel corpus in the world (as far as we know). The studies were the result of semanti...
Diana Santos, Maria do Rosário Silva, Susan...
LREC
2008
160views Education» more  LREC 2008»
13 years 9 months ago
Automatic extraction of subcategorization frames for Italian
Subcategorization is a kind of knowledge which can be considered as crucial in several NLP tasks, such as Information Extraction or parsing, but the collection of very large resou...
Dino Ienco, Serena Villata, Cristina Bosco
LREC
2008
154views Education» more  LREC 2008»
13 years 9 months ago
Benchmark Databases for Video-Based Automatic Sign Language Recognition
A new, linguistically annotated, video database for automatic sign language recognition is presented. The new RWTH-BOSTON-400 corpus, which consists of 843 sentences, several spea...
Philippe Dreuw, Carol Neidle, Vassilis Athitsos, S...
LREC
2008
119views Education» more  LREC 2008»
13 years 9 months ago
Assessing the Costs of Machine-Assisted Corpus Annotation through a User Study
Fixed, limited budgets often constrain the amount of expert annotation that can go into the construction of annotated corpora. Estimating the cost of annotation is the first step ...
Eric K. Ringger, Marc Carmen, Robbie Haertel, Kevi...
LREC
2008
114views Education» more  LREC 2008»
13 years 9 months ago
Ontology Search with the OntoSelect Ontology Library
OntoSelect is a dynamic web-based ontology library that harvests, analyzes and organizes ontologies published on the Semantic Web. OntoSelect allows searching as well as browsing ...
Paul Buitelaar, Thomas Eigner
LREC
2008
135views Education» more  LREC 2008»
13 years 9 months ago
CORP-ORAL: Spontaneous Speech Corpus for European Portuguese
Research activity on the Portuguese language for speech synthesis and recognition has suffered from a considerable lack of human and material resources. This has raised some obsta...
Fabíola Santos, Tiago Freitas
LREC
2008
98views Education» more  LREC 2008»
13 years 9 months ago
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
In aiming at research and development on machine translation, we produced a test collection for Japanese-English machine translation in the seventh NTCIR Workshop. This paper desc...
Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Take...