Sciweavers

2190 search results - page 53 / 438
» Unweaving a web of documents
Sort
View
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
14 years 2 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
EUSFLAT
2003
100views Fuzzy Logic» more  EUSFLAT 2003»
13 years 11 months ago
Evaluating the informative quality of web documents using fuzzy linguistic techniques
Recommender systems evaluate and filter the great amount of information available on the Web to assist people in their search processes. A fuzzy linguistic evaluation method of We...
Enrique Herrera-Viedma, Eduardo Peis, Jesus Canelo...
CIKM
2005
Springer
14 years 3 months ago
Document quality models for web ad hoc retrieval
The quality of document content, which is an issue that is usually ignored for the traditional ad hoc retrieval task, is a critical issue for Web search. Web pages have a huge var...
Yun Zhou, W. Bruce Croft
ICPR
2008
IEEE
14 years 11 months ago
Clustering of short commercial documents for the web
Document clustering techniques have been applied in several areas, with the web as one of the most recent and influent. Both general-purpose and text-oriented techniques exist and...
Elisabetta Binaghi, Ignazio Gallo, Moreno Carullo,...
IJMSO
2008
149views more  IJMSO 2008»
13 years 10 months ago
Categorisation of web documents using extraction ontologies
: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...
Li Xu, David W. Embley