Sciweavers

COOPIS
1997
IEEE

Semi-Automatic Wrapper Generation for Internet Information Sources

14 years 4 months ago
Semi-Automatic Wrapper Generation for Internet Information Sources
To simplify the task of obtaining information from the vast number of information sources that are available on the World Wide Web (WWW), we are building tools to build information mediators for extracting and integrating data from multiple Web sources. In a mediator based approach, wrappers are built around individual information sources, that provide translation between the mediator query language and the individual source. We present an approach for semi-automatically generating wrappers for structured internet sources. The key idea is to exploit formatting information in Web pages from the source to hypothesize the underlying structure of a page. From this structure the system generates a wrapper thatfacilitatesquerying ofa source and possibly integrating it with other sources. We demonstrate the ease with which we are able to build wrappers for a number of Web sources using our implemented wrapper generation toolkit.
Naveen Ashish, Craig A. Knoblock
Added 05 Aug 2010
Updated 05 Aug 2010
Type Conference
Year 1997
Where COOPIS
Authors Naveen Ashish, Craig A. Knoblock
Comments (0)