Sciweavers

152 search results - page 6 / 31
» Redundancy-Driven Web Data Extraction and Integration
Sort
View
FGCS
2007
108views more  FGCS 2007»
13 years 7 months ago
From bioinformatic web portals to semantically integrated Data Grid networks
We propose a semi-automated method for redeploying bioinformatic databases indexed in a Web portal as a decentralized, semantically integrated and service-oriented Data Grid. We g...
Adriana Budura, Philippe Cudré-Mauroux, Kar...
ICDIM
2008
IEEE
14 years 2 months ago
A geo-temporal Web gazetteer integrating data from multiple sources
This paper presents a geo-temporal gazetteer Web service that provides access to names of places and historical periods, together with the associated geotemporal information. With...
Hugo Manguinhas, Bruno Martins, José Luis B...
PODS
2004
ACM
189views Database» more  PODS 2004»
14 years 7 months ago
The Lixto Data Extraction Project - Back and Forth between Theory and Practice
We present the Lixto project, which is both a research project in database theory and a commercial enterprise that develops Web data extraction (wrapping) and Web service definiti...
Georg Gottlob, Christoph Koch, Robert Baumgartner,...
WWW
2009
ACM
14 years 8 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
14 years 8 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman