Sciweavers

106 search results - page 6 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
JCP
2007
149views more  JCP 2007»
13 years 7 months ago
Partitional Clustering Techniques for Multi-Spectral Image Segmentation
Abstract— Analyzing unknown data sets such as multispectral images often requires unsupervised techniques. Data clustering is a well known and widely used approach in such cases....
Danielle Nuzillard, Cosmin Lazar
KDD
2009
ACM
243views Data Mining» more  KDD 2009»
14 years 8 months ago
Exploiting Wikipedia as external knowledge for document clustering
In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...
Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...
BMCBI
2007
163views more  BMCBI 2007»
13 years 7 months ago
A coherent graph-based semantic clustering and summarization approach for biomedical literature and a new summarization evaluati
Background: A huge amount of biomedical textual information has been produced and collected in MEDLINE for decades. In order to easily utilize biomedical information in the free t...
Illhoi Yoo, Xiaohua Hu, Il-Yeol Song
CSDA
2006
85views more  CSDA 2006»
13 years 7 months ago
Two-way Poisson mixture models for simultaneous document classification and word clustering
An approach to simultaneous document classification and word clustering is developed using a two-way mixture model of Poisson distributions. Each document is represented by a vect...
Jia Li, Hongyuan Zha
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
14 years 2 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...