Sciweavers

2677 search results - page 33 / 536
» Extracting Structured Data from Web Pages
Sort
View
UIST
2006
ACM
14 years 2 months ago
Enabling web browsers to augment web sites' filtering and sorting functionalities
Existing augmentations of web pages are mostly small cosmetic changes (e.g., removing ads) and minor addition of third-party content (e.g., product prices from competing sites). N...
David F. Huynh, Robert C. Miller, David R. Karger
LREC
2010
216views Education» more  LREC 2010»
13 years 10 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis
ISEC
2001
Springer
180views ECommerce» more  ISEC 2001»
14 years 1 months ago
i-Cube: A Tool-Set for the Dynamic Extraction and Integration of Web Data Content
Over the past decade the Internet has evolved into the largest public community in the world. It provides a wealth of data content and services in almost every field of science, t...
Frankie Poon, Kostas Kontogiannis
WWW
2001
ACM
14 years 9 months ago
Finding Related Web Pages Based on Connectivity Information from a Search Engine
This paper proposes a method for finding related Web pages based on connectivity information of hyperlinks. As claimed by Kumar, a complete bipartite graph of Web pages can be reg...
Tsuyoshi Murata
IJMMS
2008
108views more  IJMMS 2008»
13 years 8 months ago
Ontology-based information extraction and integration from heterogeneous data sources
In this paper we present the design, implementation and evaluation of SOBA, a system for ontology-based information extraction from heterogeneous data resources, including plain t...
Paul Buitelaar, Philipp Cimiano, Anette Frank, Mat...