Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

145

IRAL
2003
ACM

144views Information Technology» more IRAL 2003»

Keyword-based document clustering

15 years 12 months ago

Keyword-based document clustering

Download www.aclweb.org

1 Document clustering is an aggregation of related documents to a cluster based on the similarity evaluation task between documents and the representatives of clusters. Terms and their discriminating features of terms are the clue to the clustering and the discriminating features are based on the term and document frequencies. Feature selection method on the basis of frequency statistics has a limitation to the enhancement of the clustering algorithm because it does not consider the contents of the cluster objects. In this paper, we adopt a content-based analytic approach to refine the similarity computation and propose a keyword-based clustering algorithm. Experimental results show that content-based keyword weighting outperforms frequency-based weighting method.

Seung-Shik Kang

Real-time Traffic

Clustering Algorithm | Document | Document Clustering | Information Retrieval | IRAL 2003 |

claim paper

Related Content

» Effectiveness of KeywordBased Display and Selection of Retrieval Results for Interactive S...

» Accelerating Dynamic Web Content Delivery Using KeywordBased Fragment Detection

» A Study of ChunkBased and KeywordBased Approaches for Generating Headlines

» MOVE A Large Scale KeywordBased Content Filtering and Dissemination System

» Document Clustering with Grouping and Chaining Algorithms

» Event Detection and Tracking in Social Streams

» Information retrieval on mind maps what could it be good for

» Integrating clustering and multidocument summarization to improve document understanding

» Multilingual Document Clustering Using Wikipedia as External Knowledge

» Chinese Keyword Spotting Using KnowledgeBased Clustering

Post Info
More Details (n/a)

Added	05 Jul 2010
Updated	05 Jul 2010
Type	Conference
Year	2003
Where	IRAL
Authors	Seung-Shik Kang

Comments (0)