document clustering | Sciweavers

176

IPM
2002

92views more IPM 2002»

The effectiveness of query-specific hierarchic clustering in information retrieval

15 years 6 months ago

Hierarchic document clustering has been widely applied to Information Retrieval (IR) on the grounds of its potential improved effectiveness over inverted file search. However, pre...

Anastasios Tombros, Robert Villa, C. J. van Rijsbe...

claim paper

Read More »

169

click to vote

SDM
2003
SIAM

134views Data Mining» more SDM 2003»

Hierarchical Document Clustering using Frequent Itemsets

15 years 7 months ago

Download www.cs.sfu.ca

A major challenge in document clustering is the extremely high dimensionality. For example, the vocabulary for a document set can easily be thousands of words. On the other hand, ...

Benjamin C. M. Fung, Ke Wang, Martin Ester

claim paper

Read More »

216

click to vote

ACST
2006

274views Computer Science» more ACST 2006»

Distributed hierarchical document clustering

15 years 8 months ago

Download nsm1.nsm.iup.edu

This paper investigates the applicability of distributed clustering technique, called RACHET [1], to organize large sets of distributed text data. Although the authors of RACHET c...

Debzani Deb, M. Muztaba Fuad, Rafal A. Angryk

claim paper

Read More »

159

click to vote

LWA
2007

157views Software Engineering» more LWA 2007»

Multi-objective Frequent Termset Clustering

15 years 8 months ago

Download www-ai.cs.uni-dortmund.de

Large, high dimensional data spaces, are still a challenge for current data clustering methods. Frequent Termset (FTS) clustering is a technique developed to cope with these chall...

Andreas Kaspari, Michael Wurst

claim paper

Read More »

128

click to vote

LREC
2008

98views Education» more LREC 2008»

Ping-pong Document Clustering using NMF and Linkage-Based Refinement

15 years 8 months ago

Download www.lrec-conf.org

This paper proposes a ping-pong document clustering method using NMF and the linkage based refinement alternately, in order to improve the clustering result of NMF. The use of NMF...

Hiroyuki Shinnou, Minoru Sasaki

claim paper

Read More »

178

click to vote

CIKM
2008
Springer

108views Information Technology» more CIKM 2008»

Integrating clustering and multi-document summarization to improve document understanding

15 years 8 months ago

Download www.nec-labs.com

Document understanding techniques such as document clustering and multi-document summarization have been receiving much attention in recent years. Current document clustering meth...

Dingding Wang, Shenghuo Zhu, Tao Li, Yun Chi, Yiho...

claim paper

Read More »

173

click to vote

CIKM
2008
Springer

141views Information Technology» more CIKM 2008»

Winnowing-based text clustering

15 years 8 months ago

Download www.dc.fi.udc.es

We present an approach to document clustering based on winnowing fingerprints that achieved good values of effectiveness with considerable save in memory space and computation tim...

Javier Parapar, Alvaro Barreiro

claim paper

Read More »

158

click to vote

CIKM
2008
Springer

115views Information Technology» more CIKM 2008»

An extension of PLSA for document clustering

15 years 8 months ago

Download eprints.pascal-network.org

In this paper we propose an extension of the PLSA model in which an extra latent variable allows the model to cocluster documents and terms simultaneously. We show on three datase...

Young-Min Kim, Jean-François Pessiot, Massi...

claim paper

Read More »

165

click to vote

CIKM
2006
Springer

156views Information Technology» more CIKM 2006»

Incremental hierarchical clustering of text documents

15 years 10 months ago

Download www.cs.cmu.edu

Incremental hierarchical text document clustering algorithms are important in organizing documents generated from streaming on-line sources, such as, Newswire and Blogs. However, ...

Nachiketa Sahoo, Jamie Callan, Ramayya Krishnan, G...

claim paper

Read More »

200

click to vote

AIRS
2006
Springer

183views Information Technology» more AIRS 2006»

A Novel Ant-Based Clustering Approach for Document Clustering

15 years 10 months ago

Download people.kmi.open.ac.uk

Recently, much research has been proposed using nature inspired algorithms to perform complex machine learning tasks. Ant Colony Optimization (ACO) is one such algorithm based on s...

Yulan He, Siu Cheung Hui, Yongxiang Sim

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers