Sciweavers

81 search results - page 4 / 17
» Estimating web site readability using content extraction
Sort
View
EWMF
2003
Springer
14 years 22 days ago
Mining Web Sites Using Wrapper Induction, Named Entities, and Post-processing
This paper presents a novel method for extracting information from collections of Web pages across different sites. Our method uses a standard wrapper induction algorithm and explo...
Georgios Sigletos, Georgios Paliouras, Constantine...
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
14 years 1 months ago
Site-Independent Template-Block Detection
Detection of template and noise blocks in web pages is an important step in improving the performance of information retrieval and content extraction. Of the many approaches propos...
Aleksander Kolcz, Wen-tau Yih
IJHIS
2006
95views more  IJHIS 2006»
13 years 7 months ago
A hybrid system for concept-based web usage mining
A web site should be easy to browse by visitors. However, sometimes the reality is quite different. Situations like several unrelated topics in a single web page may lead to confus...
Sebastián A. Ríos, Juan D. Vel&aacut...
DILS
2009
Springer
14 years 2 months ago
Site-Wide Wrapper Induction for Life Science Deep Web Databases
We present a novel approach to automatic information extraction from Deep Web Life Science databases using wrapper induction. Traditional wrapper induction techniques focus on lear...
Saqib Mir, Steffen Staab, Isabel Rojas
IADIS
2003
13 years 9 months ago
An Examination of the Relationship between Information Provided and Sales When Dealing Online
This study stems from a suggestion in the literature (Lohse and Spiller, 1999) that for some products an increase in the amount of information presented on a web site has a negati...
Aritz Lopez Trueba, Thomas Chesney