Sciweavers

2876 search results - page 53 / 576
» A Conceptual-Modeling Approach to Extracting Data from the W...
Sort
View
WSDM
2012
ACM
214views Data Mining» more  WSDM 2012»
12 years 4 months ago
Selecting actions for resource-bounded information extraction using reinforcement learning
Given a database with missing or uncertain content, our goal is to correct and fill the database by extracting specific information from a large corpus such as the Web, and to d...
Pallika H. Kanani, Andrew K. McCallum
INLG
2010
Springer
13 years 6 months ago
Extracting Parallel Fragments from Comparable Corpora for Data-to-text Generation
Building NLG systems, in particular statistical ones, requires parallel data (paired inputs and outputs) which do not generally occur naturally. In this paper, we investigate the ...
Anja Belz, Eric Kow
ITBAM
2010
13 years 6 months ago
MEDCollector: Multisource Epidemic Data Collector
This paper analyzes the requirements and presents a novel approach to the development of a system for epidemiological data collection and integration based on the principles of int...
João Zamite, Fabrício A. B. Silva, F...
WWW
2003
ACM
14 years 9 months ago
Annotating Web pages for the needs of Web Information Extraction Applications
This paper outlines our approach to the creation of annotated corpora for the purposes of Web Information Extraction, and presents the Web Annotation tool. This tool enables the a...
Georgios Sigletos, Dimitra Farmakiotou, Konstantin...
BMCBI
2005
189views more  BMCBI 2005»
13 years 8 months ago
A sentence sliding window approach to extract protein annotations from biomedical articles
Background: Within the emerging field of text mining and statistical natural language processing (NLP) applied to biomedical articles, a broad variety of techniques have been deve...
Martin Krallinger, Maria Padron, Alfonso Valencia