Sciweavers

2677 search results - page 40 / 536
» Extracting Structured Data from Web Pages
Sort
View
ACMICEC
2006
ACM
141views ECommerce» more  ACMICEC 2006»
14 years 2 months ago
From HTML documents to web tables and rules
We present a browser-extending Semantic Web extraction system that maps HTML documents to tables and, where possible, to rules. First, the basic data extractor ViPER distills and ...
Kai Simon, Georg Lausen, Harold Boley
JAIR
2008
173views more  JAIR 2008»
13 years 8 months ago
Creating Relational Data from Unstructured and Ungrammatical Data Sources
In order for agents to act on behalf of users, they will have to retrieve and integrate vast amounts of textual data on the World Wide Web. However, much of the useful data on the...
Matthew Michelson, Craig A. Knoblock
RULEML
2004
Springer
14 years 2 months ago
Rule Learning for Feature Values Extraction from HTML Product Information Sheets
The Web is now a huge information repository with a rich semantic structure that, however, is primarily addressed to human understanding rather than automated processing by a compu...
Costin Badica, Amelia Badica
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
14 years 2 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
IJCAI
2003
13 years 10 months ago
Visual Programming of Web Data Aggregation Applications
Most of the information needs today can be satisfied by searching and browsing the Web. However, repetitive tasks such as monitoring information on Web sites should be done autom...
Robert Baumgartner, Georg Gottlob, Marcus Herzog