Sciweavers

591 search results - page 25 / 119
» Extracting Route Directions from Web Pages
Sort
View
ADC
2006
Springer
130views Database» more  ADC 2006»
14 years 3 months ago
A two-phase rule generation and optimization approach for wrapper generation
Web information extraction is a fundamental issue for web information management and integrations. A common approach is to use wrappers to extract data from web pages or documents...
Yanan Hao, Yanchun Zhang
ICMCS
2005
IEEE
89views Multimedia» more  ICMCS 2005»
14 years 3 months ago
Semantic Knowledge Building for Image Database by Analyzing Web Page Contents
In this paper, we present a method of semantic knowledge building for image database by extracting semantic meanings from Web page contents. The novelty of our method is that it i...
Yung-Kwang Lai, Song Liu, Liang-Tien Chia, Syin Ch...
WIDM
2003
ACM
14 years 3 months ago
Datarover: a taxonomy based crawler for automated data extraction from data-intensive websites
The advent of e-commerce has created a trend that brought thousands of catalogs online. Most of these websites are “taxonomy-directed”. A Web site is said to be ``taxonomydire...
Hasan Davulcu, S. Koduri, Saravanakumar Nagarajan
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
14 years 10 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
WWW
2007
ACM
14 years 10 months ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger