Sciweavers

118 search results - page 16 / 24
» BioInfer: a corpus for information extraction in the biomedi...
Sort
View
BMCBI
2008
139views more  BMCBI 2008»
13 years 7 months ago
Abbreviation definition identification based on automatic precision estimates
Background: The rapid growth of biomedical literature presents challenges for automatic text processing, and one of the challenges is abbreviation identification. The presence of ...
Sunghwan Sohn, Donald C. Comeau, Won Kim, W. John ...
CORR
2006
Springer
84views Education» more  CORR 2006»
13 years 7 months ago
The JRC-Acquis: A multilingual aligned parallel corpus with 20+ languages
We present a new, unique and freely available parallel corpus containing European Union (EU) documents of mostly legal nature. It is available in all 20 official EU languages, wit...
Ralf Steinberger, Bruno Pouliquen, Anna Widiger, C...
BMCBI
2008
171views more  BMCBI 2008»
13 years 7 months ago
Comparative analysis of five protein-protein interaction corpora
Background: Growing interest in the application of natural language processing methods to biomedical text has led to an increasing number of corpora and methods targeting protein-...
Sampo Pyysalo, Antti Airola, Juho Heimonen, Jari B...
SIGMOD
2010
ACM
201views Database» more  SIGMOD 2010»
13 years 7 months ago
I4E: interactive investigation of iterative information extraction
Information extraction systems are increasingly being used to mine structured information from unstructured text documents. A commonly used unsupervised technique is to build iter...
Anish Das Sarma, Alpa Jain, Divesh Srivastava
RANLP
2003
13 years 8 months ago
A framework for named entity recognition in the open domain
In this paper, a system for Named Entity Recognition in the Open domain (NERO) is described. It is concerned with recognition of various types of entity, types that will be approp...
Richard J. Evans