Search Sciweavers | Sciweavers

391 search results - page 23 / 79

» Finding and Extracting Data Records from Web Pages

125

click to vote

WIDM
2004
ACM

96views Internet Technology» more WIDM 2004»

Stylistic and lexical co-training for web block classification

15 years 9 months ago

Download www.comp.nus.edu.sg

Many applications which use web data extract information from a limited number of regions on a web page. As such, web page division into blocks and the subsequent block classifica...

Chee How Lee, Min-Yen Kan, Sandra Lai

claim paper

Read More »

127

click to vote

KDD
2002
ACM

148views Data Mining» more KDD 2002»

Discovering informative content blocks from Web documents

16 years 4 months ago

Download www.cs.ualberta.ca

In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...

Shian-Hua Lin, Jan-Ming Ho

claim paper

Read More »

156

click to vote

ADC
2006
Springer

130views Database» more ADC 2006»

A two-phase rule generation and optimization approach for wrapper generation

15 years 10 months ago

Download crpit.com

Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...

Yanan Hao, Yanchun Zhang

claim paper

Read More »

137

click to vote

PVLDB
2010

114views more PVLDB 2010»

ObjectRunner: Lightweight, Targeted Extraction and Querying of Structured Web Data

15 years 2 months ago

Download www.comp.nus.edu.sg

We present in this paper ObjectRunner, a system for extracting, integrating and querying structured data from the Web. Our system harvests real-world items from template-based HTM...

Talel Abdessalem, Bogdan Cautis, Nora Derouiche

claim paper

Read More »

129

click to vote

GIR
2007
ACM

86views Information Technology» more GIR 2007»

Geo-tagging for imprecise regions of different sizes

15 years 7 months ago

Download dis.shef.ac.uk

Extracting geographical information from various web sources is likely to be important for a variety of applications. One such use for this information is to enable the study of v...

Robert Pasley, Paul Clough, Mark Sanderson

claim paper

Read More »

« Prev « First page 23 / 79 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers