Sciweavers

203 search results - page 6 / 41
» Conceptual-Model-Based Data Extraction from Multiple-Record ...
Sort
View
WWW
2009
ACM
14 years 8 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
UIST
1998
ACM
13 years 11 months ago
Internet Scrapbook: Automating Web Browsing Tasks by Demonstration
This paper describes a programming-by-demonstration system, called Internet Scrapbook, which allows users with little programming skill to automate repetitive browsing tasks. With...
Atsushi Sugiura, Yoshiyuki Koseki
VLDB
2001
ACM
144views Database» more  VLDB 2001»
14 years 19 hour ago
RoadRunner: Towards Automatic Data Extraction from Large Web Sites
The paper investigates techniques for extracting data from HTML sites through the use of automatically generated wrappers. To automate the wrapper generation and the data extracti...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
ER
2001
Springer
148views Database» more  ER 2001»
14 years 2 days ago
On the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are e...
Stephen W. Liddle, Sai Ho Yau, David W. Embley
WWW
2011
ACM
13 years 2 months ago
Web information extraction using Markov logic networks
In this paper, we consider the problem of extracting structured data from web pages taking into account both the content of individual attributes as well as the structure of pages...
Sandeepkumar Satpal, Sahely Bhadra, Sundararajan S...