Search Sciweavers | Sciweavers

77 search results - page 11 / 16

» Pairwise Document Similarity in Large Collections with MapRe...

click to vote

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

14 years 2 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

click to vote

CPM
2000
Springer

177views Combinatorics» more CPM 2000»

Identifying and Filtering Near-Duplicate Documents

13 years 11 months ago

Download www.cs.princeton.edu

Abstract. The mathematical concept of document resemblance captures well the informal notion of syntactic similarity. The resemblance can be estimated using a ﬁxed size “sketch...

Andrei Z. Broder

claim paper

Read More »

click to vote

SEKE
2010
Springer

164views Software Engineering» more SEKE 2010»

Incremental Construction of Topic Hierarchies using Hierarchical Term Clustering

13 years 5 months ago

Download www.labic.icmc.usp.br

Topic hierarchies are very useful for managing, searching and browsing large repositories of text documents. The hierarchical clustering methods are used to support the constructi...

Ricardo M. Marcacini, Solange O. Rezende

claim paper

Read More »

click to vote

ACL
1992

149views Computational Linguistics» more ACL 1992»

SEXTANT: Exploring Unexplored Contexts for Semantic Extraction from Syntactic Analysis

13 years 8 months ago

Download www.aclweb.org

For a very long time, it has been considered that the only way of automatically extracting similar groups of words from a text collection for which no semantic information exists ...

Gregory Grefenstette

claim paper

Read More »

click to vote

CIKM
2005
Springer

165views Information Technology» more CIKM 2005»

Query expansion using term relationships in language models for information retrieval

14 years 1 months ago

Download www.iro.umontreal.ca

Language Modeling (LM) has been successfully applied to Information Retrieval (IR). However, most of the existing LM approaches only rely on term occurrences in documents, queries...

Jing Bai, Dawei Song, Peter Bruza, Jian-Yun Nie, G...

claim paper

Read More »

« Prev « First page 11 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers