Search Sciweavers | Sciweavers

52 search results - page 5 / 11

» Effective compression for the web: exploiting document linka...

147

click to vote

WWW
2005
ACM

183views Internet Technology» more WWW 2005»

Improving Web search efficiency via a locality based static pruning method

16 years 6 months ago

Download www2005.org

The unarguably fast, and continuous, growth of the volume of indexed (and indexable) documents on the Web poses a great challenge for search engines. This is true regarding not on...

Edleno Silva de Moura, Célia Francisca dos ...

claim paper

Read More »

160

click to vote

WWW
2003
ACM

139views Internet Technology» more WWW 2003»

Detecting Near-replicas on the Web by Content and Hyperlink Analysis

16 years 6 months ago

Download nautilus.dii.unisi.it

The presence of replicas or near-replicas of documents is very common on the Web. Documents may be replicated completely or partially for different reasons (versions, mirrors, etc...

Ernesto Di Iorio, Michelangelo Diligenti, Marco Go...

claim paper

Read More »

194

click to vote

CACM
1998

110views more CACM 1998»

Viewing WISs as Database Applications

15 years 5 months ago

Download www.cs.toronto.edu

abstraction for modeling these problems is to view the Web as a collection of (usually small and heterogeneous) databases, and to view programs that extract and process Web data au...

Gustavo O. Arocena, Alberto O. Mendelzon

claim paper

Read More »

137

click to vote

WISE
2009
Springer

126views Internet Technology» more WISE 2009»

Entry Pairing in Inverted File

16 years 3 months ago

Download www.di.unipi.it

Abstract. This paper proposes to exploit content and usage information to rearrange an inverted index for a full-text IR system. The idea is to merge the entries of two frequently ...

Hoang Thanh Lam, Raffaele Perego, Nguyen Thoi Minh...

claim paper

Read More »

205

click to vote

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

15 years 11 months ago

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...

Leonardo Rigutini, Marco Maggini

claim paper

Read More »

« Prev « First page 5 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers