Sciweavers

468 search results - page 1 / 94
» Automatic Data Extraction from Data-Rich Web Pages
Sort
View
DASFAA
2005
IEEE
123views Database» more  DASFAA 2005»
13 years 8 months ago
Automatic Data Extraction from Data-Rich Web Pages
Abstract. Extracting data from web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interests. In this paper, we propose a...
Dongdong Hu, Xiaofeng Meng
CIKM
1998
Springer
13 years 11 months ago
Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents
We present a new approach to extracting information from unstructured documents based on an application ontology that describes a domain of interest. Starting with such an ontolog...
David W. Embley, Douglas M. Campbell, Randy D. Smi...
SYNASC
2006
IEEE
211views Algorithms» more  SYNASC 2006»
14 years 25 days ago
HTML Pattern Generator--Automatic Data Extraction from Web Pages
Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...
Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...
AUSDM
2006
Springer
160views Data Mining» more  AUSDM 2006»
13 years 10 months ago
Extraction of Flat and Nested Data Records from Web Pages
This paper deals with studies the problem of identification and extraction of flat and nested data records from a given web page. With the explosive growth of information sources ...
Siddu P. Algur, P. S. Hiremath