Sciweavers

232 search results - page 26 / 47
» Query-related data extraction of hidden web documents
Sort
View
141
Voted
COMAD
2009
15 years 4 months ago
Querying for relations from the semi-structured Web
We present a class of web queries whose result is a multi-column relation instead of a collection of unstructured documents as in standard web search. The user specifies the query...
Sunita Sarawagi
135
Voted
WWW
2004
ACM
16 years 4 months ago
An efficient and systematic method to generate xslt stylesheets for different wireless pervasive devices
It is a tedious and cumbersome process to update directly a WML document for the wireless Web because its content composes of both data and presentation. Thus, XML is used to hand...
Thomas Kwok, Thao Nguyen, Linh Lam, Kakan Roy
131
Voted
WWW
2010
ACM
15 years 10 months ago
Not so creepy crawler: easy crawler generation with standard xml queries
Web crawlers are increasingly used for focused tasks such as the extraction of data from Wikipedia or the analysis of social networks like last.fm. In these cases, pages are far m...
Franziska von dem Bussche, Klara A. Weiand, Benedi...
120
Voted
WISE
2000
Springer
15 years 7 months ago
Modelling the Webspace of an Intranet
Searching the internet using the currently available searchengines is not satisfactory. Thetechniquesused there focus on the extraction of relevant informationdirectlyfrom the doc...
Roelof van Zwol, Peter M. G. Apers
ICWE
2007
Springer
15 years 9 months ago
Fixing Weakly Annotated Web Data Using Relational Models
In this paper, we present a fast and scalable Bayesian model for improving weakly annotated data – which is typically generated by a (semi) automated information extraction (IE) ...
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu