Sciweavers

SPIRE
1999
Springer

Top-down Extraction of Semi-Structured Data

14 years 4 months ago
Top-down Extraction of Semi-Structured Data
In this paper, we propose an innovative approach to extracting semi-structured data from Web sources. The idea is to collect a couple of example objects from the user and to use this information to extract new objects from new pages or texts. We propose a top-down strategy that extracts complex objects decomposing them in objects less complex, until atomic objects have been extracted. Through experimentation, we demonstrate that with a small number of given examples our strategy is able to extract most of the objects present in a Web source given as input.
Berthier A. Ribeiro-Neto, Alberto H. F. Laender, A
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1999
Where SPIRE
Authors Berthier A. Ribeiro-Neto, Alberto H. F. Laender, Altigran Soares da Silva
Comments (0)