Sciweavers

820 search results - page 100 / 164
» Deep web data extraction
Sort
View
SIGIR
2011
ACM
14 years 5 months ago
Pseudo test collections for learning web search ranking functions
Test collections are the primary drivers of progress in information retrieval. They provide a yardstick for assessing the effectiveness of ranking functions in an automatic, rapi...
Nima Asadi, Donald Metzler, Tamer Elsayed, Jimmy L...
SIGMOD
1999
ACM
114views Database» more  SIGMOD 1999»
15 years 6 months ago
A Layered Architecture for Querying Dynamic Web Content
The design of webbases, database systems for supporting Webbased applications, is currently an active area of research. In this paper, we propose a 3-layer architecture for design...
Hasan Davulcu, Juliana Freire, Michael Kifer, I. V...
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
15 years 6 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
GIR
2007
ACM
15 years 6 months ago
Facilitating situation assessment through gir with multi-scale open source web documents
In this paper, we present our preliminary work on a Geographic Information Retrieval (GIR) system that utilizes loosely coupled web services and Google EarthTM (GE) to retrieve, e...
Brian M. Tomaszewski, Chi-Chun Pan, Prasenjit Mitr...
111
Voted
EUSFLAT
2003
145views Fuzzy Logic» more  EUSFLAT 2003»
15 years 3 months ago
Proximity fuzzy clustering for web context analysis
This study extends the web classification approach through a proximity-based fuzzy clustering sensible to the influence of the page. The proximity-based fuzzy clustering works in ...
Vincenzo Loia, Witold Pedrycz, Sabrina Senatore