Sciweavers

820 search results - page 100 / 164
» Deep web data extraction
Sort
View
SIGIR
2011
ACM
13 years 23 days ago
Pseudo test collections for learning web search ranking functions
Test collections are the primary drivers of progress in information retrieval. They provide a yardstick for assessing the effectiveness of ranking functions in an automatic, rapi...
Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy L...
SIGMOD
1999
ACM
114views Database» more  SIGMOD 1999»
14 years 2 months ago
A Layered Architecture for Querying Dynamic Web Content
The design of webbases, database systems for supporting Webbased applications, is currently an active area of research. In this paper, we propose a 3-layer architecture for design...
Hasan Davulcu, Juliana Freire, Michael Kifer, I. V...
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
14 years 2 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
GIR
2007
ACM
14 years 1 months ago
Facilitating situation assessment through gir with multi-scale open source web documents
In this paper, we present our preliminary work on a Geographic Information Retrieval (GIR) system that utilizes loosely coupled web services and Google EarthTM (GE) to retrieve, e...
Brian M. Tomaszewski, Chi-Chun Pan, Prasenjit Mitr...
EUSFLAT
2003
145views Fuzzy Logic» more  EUSFLAT 2003»
13 years 11 months ago
Proximity fuzzy clustering for web context analysis
This study extends the web classification approach through a proximity-based fuzzy clustering sensible to the influence of the page. The proximity-based fuzzy clustering works in ...
Vincenzo Loia, Witold Pedrycz, Sabrina Senatore