Search Sciweavers | Sciweavers

8316 search results - page 118 / 1664

» Web Document Modeling

127

Voted

TMM
2002

140views more TMM 2002»

Narrowing the semantic gap - improved text-based web document retrieval using visual features

15 years 2 months ago

Download www.cs.sunysb.edu

In this paper, we present the results of our work that seek to negotiate the gap between low-level features and high-level concepts in the domain of web document retrieval. This wo...

Rong Zhao, William I. Grosky

claim paper

Read More »

115

click to vote

ICCS
2009
Springer

107views Applied Computing» more ICCS 2009»

Frequent Itemset Mining for Clustering Near Duplicate Web Documents

15 years 9 months ago

Download www.mendeley.com

A vast amount of documents in the Web have duplicates, which is a challenge for developing eﬃcient methods that would compute clusters of similar documents. In this paper we use ...

Dmitry I. Ignatov, Sergei O. Kuznetsov

claim paper

Read More »

144

click to vote

WEBDB
1999
Springer

196views Database» more WEBDB 1999»

Web Ecology: Recycling HTML Pages as XML Documents Using W4F

15 years 6 months ago

Download db.cis.upenn.edu

In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...

Arnaud Sahuguet, Fabien Azavant

claim paper

Read More »

click to vote

EUSFLAT
2003

100views Fuzzy Logic» more EUSFLAT 2003»

Evaluating the informative quality of web documents using fuzzy linguistic techniques

15 years 3 months ago

Download www.eusflat.org

Recommender systems evaluate and filter the great amount of information available on the Web to assist people in their search processes. A fuzzy linguistic evaluation method of We...

Enrique Herrera-Viedma, Eduardo Peis, Jesus Canelo...

claim paper

Read More »

133

Voted

IJMSO
2008

149views more IJMSO 2008»

Categorisation of web documents using extraction ontologies

15 years 2 months ago

Download www.deg.byu.edu

: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...

Li Xu, David W. Embley

claim paper

Read More »

« Prev « First page 118 / 1664 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers