Sciweavers

COLING
2010
13 years 7 months ago
Broad Coverage Multilingual Deep Sentence Generation with a Stochastic Multi-Level Realizer
Most of the known stochastic sentence generators use syntactically annotated corpora, performing the projection to the surface in one stage. However, in full-fledged text generati...
Bernd Bohnet, Leo Wanner, Simon Mille, Alicia Burg...
EMNLP
2010
13 years 10 months ago
WikiWars: A New Corpus for Research on Temporal Expressions
The reliable extraction of knowledge from text requires an appropriate treatment of the time at which reported events take place. Unfortunately, there are very few annotated data ...
Pawel P. Mazur, Robert Dale
ACL
2010
13 years 10 months ago
Temporal Information Processing of a New Language: Fast Porting with Minimal Resources
We describe the semi-automatic adaptation of a TimeML annotated corpus from English to Portuguese, a language for which TimeML annotated data was not available yet. In order to va...
Francisco Costa, António Branco
CORR
2006
Springer
158views Education» more  CORR 2006»
14 years 14 days ago
Building a resource for studying translation shifts
This paper describes an interdisciplinary approach which brings together the fields of corpus linguistics and translation studies. It presents ongoing work on the creation of a co...
Lea Cyrus
BMCBI
2007
124views more  BMCBI 2007»
14 years 16 days ago
BioInfer: a corpus for information extraction in the biomedical domain
Background: Lately, there has been a great interest in the application of information extraction methods to the biomedical domain, in particular, to the extraction of relationship...
Sampo Pyysalo, Filip Ginter, Juho Heimonen, Jari B...
LREC
2008
157views Education» more  LREC 2008»
14 years 1 months ago
AnCora: Multilevel Annotated Corpora for Catalan and Spanish
This paper presents AnCora, a multilingual corpus annotated at different linguistic levels consisting of 500,000 words in Catalan (AnCora-Ca) and in Spanish (AnCora-Es). At presen...
Mariona Taulé, Maria Antònia Mart&ia...
LREC
2008
155views Education» more  LREC 2008»
14 years 1 months ago
Exploiting Multiply Annotated Corpora in Biomedical Information Extraction Tasks
This paper discusses the problem of utilising multiply annotated data in training biomedical information extraction systems. Two corpora, annotated with entities and relations, an...
Barry Haddow, Beatrice Alex
LREC
2010
132views Education» more  LREC 2010»
14 years 1 months ago
The NOMCO Multimodal Nordic Resource - Goals and Characteristics
This paper presents the multimodal corpora that are being collected and annotated in the Nordic NOMCO project. The corpora will be used to study communicative phenomena such as fe...
Patrizia Paggio, Jens Allwood, Elisabeth Ahlsen, K...
LREC
2010
194views Education» more  LREC 2010»
14 years 1 months ago
Building a Gold Standard for Event Detection in Croatian
This paper describes the process of building a newspaper corpus annotated with events described in specific documents. The main difference to the corpora built as part of the TDT ...
Nikola Ljubesic, Tomislava Lauc, Damir Boras
DILS
2007
Springer
14 years 6 months ago
Using Annotations from Controlled Vocabularies to Find Meaningful Associations
This paper presents the LSLink (or Life Science Link) methodology that provides users with a set of tools to explore the rich Web of interconnected and annotated objects in multipl...
Woei-Jyh Lee, Louiqa Raschid, Padmini Srinivasan, ...