Sciweavers

391 search results - page 23 / 79
» Finding and Extracting Data Records from Web Pages
Sort
View
WIDM
2004
ACM
14 years 1 months ago
Stylistic and lexical co-training for web block classification
Many applications which use web data extract information from a limited number of regions on a web page. As such, web page division into blocks and the subsequent block classifica...
Chee How Lee, Min-Yen Kan, Sandra Lai
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 9 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
ADC
2006
Springer
130views Database» more  ADC 2006»
14 years 2 months ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
PVLDB
2010
114views more  PVLDB 2010»
13 years 6 months ago
ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data
We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTM...
Talel Abdessalem, Bogdan Cautis, Nora Derouiche
GIR
2007
ACM
14 years 10 days ago
Geo-tagging for imprecise regions of different sizes
Extracting geographical information from various web sources is likely to be important for a variety of applications. One such use for this information is to enable the study of v...
Robert Pasley, Paul Clough, Mark Sanderson