Sciweavers

8316 search results - page 118 / 1664
» Web Document Modeling
Sort
View
TMM
2002
140views more  TMM 2002»
13 years 10 months ago
Narrowing the semantic gap - improved text-based web document retrieval using visual features
In this paper, we present the results of our work that seek to negotiate the gap between low-level features and high-level concepts in the domain of web document retrieval. This wo...
Rong Zhao, William I. Grosky
ICCS
2009
Springer
14 years 5 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
14 years 3 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
EUSFLAT
2003
100views Fuzzy Logic» more  EUSFLAT 2003»
14 years 4 days ago
Evaluating the informative quality of web documents using fuzzy linguistic techniques
Recommender systems evaluate and filter the great amount of information available on the Web to assist people in their search processes. A fuzzy linguistic evaluation method of We...
Enrique Herrera-Viedma, Eduardo Peis, Jesus Canelo...
IJMSO
2008
149views more  IJMSO 2008»
13 years 10 months ago
Categorisation of web documents using extraction ontologies
: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...
Li Xu, David W. Embley