Sciweavers

LREC
2008
134views Education» more  LREC 2008»
13 years 9 months ago
Dependency-Based Relation Mining for Biomedical Literature
We describe techniques for the automatic detection of relationships among domain entities (e.g. genes, proteins, diseases) mentioned in the biomedical literature. Our approach is ...
Fabio Rinaldi, Gerold Schneider, Kaarel Kaljurand,...
LREC
2008
97views Education» more  LREC 2008»
13 years 9 months ago
Towards a Reference Corpus of Web Genres for the Evaluation of Genre Identification Systems
We present initial results from an international and multi-disciplinary research collaboration that aims at the construction of a reference corpus of web genres. The primary appli...
Georg Rehm, Marina Santini, Alexander Mehler, Pave...
LREC
2008
131views Education» more  LREC 2008»
13 years 9 months ago
Learning Morphology with Morfette
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological tagging and lemmatization from morphologically annotated corpora. The system i...
Grzegorz Chrupala, Georgiana Dinu, Josef van Genab...
LREC
2008
120views Education» more  LREC 2008»
13 years 9 months ago
The U.S. Policy Agenda Legislation Corpus Volume 1 - a Language Resource from 1947 - 1998
We introduce the corpus of United States Congressional bills from 1947 to 1998 for use by language research communities. The U.S. Policy Agenda Legislation Corpus Volume 1 (USPALC...
Stephen Purpura, John Wilkerson, Dustin Hillard
LREC
2008
100views Education» more  LREC 2008»
13 years 9 months ago
Ensuring Semantic Interoperability on Lexical Resources
In this paper, we describe a unifying approach to tackle data heterogeneity issues for lexica and related resources. We present LEXUS, our software that implements the Lexical Mar...
Marc Kemps-Snijders, Claus Zinn, Jacquelijn Ringer...
LREC
2008
155views Education» more  LREC 2008»
13 years 9 months ago
Exploring and Enriching a Language Resource Archive via the Web
The "download first, then process paradigm" is still the predominant working method amongst the research community. The web-based paradigm, however, offers many advantag...
Marc Kemps-Snijders, Alexander Klassmann, Claus Zi...
LREC
2008
106views Education» more  LREC 2008»
13 years 9 months ago
A Corpus for Cross-Document Co-reference
This paper describes a newly created text corpus of news articles that has been annotated for cross-document co-reference. Being able to robustly resolve references to entities ac...
David Day, Janet Hitzeman, Michael L. Wick, Keith ...
LREC
2008
95views Education» more  LREC 2008»
13 years 9 months ago
Application of Resource-based Machine Translation to Real Business Scenes
As huge quantities of documents have become available, services using natural language processing technologies trained by huge corpora have emerged, such as information retrieval ...
Hitoshi Isahara, Masao Utiyama, Eiko Yamamoto, Aki...
LREC
2008
158views Education» more  LREC 2008»
13 years 9 months ago
Linguistic Description and Automatic Extraction of Definitions from German Court Decisions
This paper discusses the use of computational linguistic technology to extract definitions from a large corpus of German court decisions. We present a corpus-based survey of defin...
Stephan Walter
LREC
2008
111views Education» more  LREC 2008»
13 years 9 months ago
The ATCOSIM Corpus of Non-Prompted Clean Air Traffic Control Speech
Air traffic control (ATC) is based on voice communication between pilots and controllers and uses a highly task and domain specific language. Due to this very reason, spoken langu...
Konrad Hofbauer, Stefan Petrik, Horst Hering