Sciweavers

468 search results - page 2 / 94
» Automatic Data Extraction from Data-Rich Web Pages
Sort
View
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
14 years 3 days ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
WISE
2005
Springer
14 years 13 days ago
NET - A System for Extracting Web Data from Flat and Nested Data Records
This paper studies automatic extraction of structured data from Web pages. Each of such pages may contain several groups of structured data records. Existing automatic methods stil...
Bing Liu, Yanhong Zhai
VLDB
2001
ACM
144views Database» more  VLDB 2001»
13 years 11 months ago
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
The paper investigates techniques for extracting data from HTML sites through the use of automatically generated wrappers. To automate the wrapper generation and the data extracti...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
ER
2001
Springer
148views Database» more  ER 2001»
13 years 11 months ago
On the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are e...
Stephen W. Liddle, Sai Ho Yau, David W. Embley
DEXAW
2004
IEEE
130views Database» more  DEXAW 2004»
13 years 10 months ago
Data Extraction from Web Data Sources
This paper provides an explanation of the basic data structures used in a new page analysis technique to create wrappers (data extractors) for the result pages produced by web sit...
Jerome Robinson