Sciweavers

1947 search results - page 19 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
IADIS
2003
13 years 10 months ago
Data Extraction from Web Database Query Result Pages via Tagsets and Integer Sequences
The World Wide Web is a collection of databases as well as web sites. Databases associated with web sites provide public access via query forms on web pages. They constitute an en...
Jerome Robinson
KDD
2010
ACM
244views Data Mining» more  KDD 2010»
14 years 16 days ago
Connecting the dots between news articles
The process of extracting useful knowledge from large datasets has become one of the most pressing problems in today’s society. The problem spans entire sectors, from scientists...
Dafna Shahaf, Carlos Guestrin
IDEAS
2005
IEEE
142views Database» more  IDEAS 2005»
14 years 2 months ago
Automatically Maintaining Wrappers for Web Sources
A substantial subset of the web data follows some kind of underlying structure. Nevertheless, HTML does not contain any schema or semantic information about the data it represents...
Juan Raposo, Alberto Pan, Manuel Álvarez, J...
IICAI
2003
13 years 10 months ago
Web Usage Mining: Extraction, Maintenance and Behaviour Trends
With the growing popularity of the web, large volumes of data are gathered automatically by Web Servers and collected into access log files. Analysis of such files is generally cal...
Pierre-Alain Laur, Maguelonne Teisseire, Pascal Po...
EDBT
2011
ACM
222views Database» more  EDBT 2011»
13 years 4 days ago
The hidden web, XML and the Semantic Web: scientific data management perspectives
The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wea...
Fabian M. Suchanek, Aparna S. Varde, Richi Nayak, ...