Sciweavers

106 search results - page 5 / 22
» Document Representation and Dimension Reduction for Text Clu...
Sort
View
PAKDD
2009
ACM
127views Data Mining» more  PAKDD 2009»
14 years 2 months ago
Clustering Documents Using a Wikipedia-Based Concept Representation
Abstract. This paper shows how Wikipedia and the semantic knowledge it contains can be exploited for document clustering. We first create a concept-based document representation b...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
BMCBI
2010
243views more  BMCBI 2010»
13 years 7 months ago
Comparative study of unsupervised dimension reduction techniques for the visualization of microarray gene expression data
Background: Visualization of DNA microarray data in two or three dimensional spaces is an important exploratory analysis step in order to detect quality issues or to generate new ...
Christoph Bartenhagen, Hans-Ulrich Klein, Christia...
ICDM
2003
IEEE
138views Data Mining» more  ICDM 2003»
14 years 25 days ago
Ontologies Improve Text Document Clustering
Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large sets of documents into a small number of meaningful ...
Andreas Hotho, Steffen Staab, Gerd Stumme
SDM
2007
SIAM
182views Data Mining» more  SDM 2007»
13 years 9 months ago
Distance Preserving Dimension Reduction for Manifold Learning
Manifold learning is an effective methodology for extracting nonlinear structures from high-dimensional data with many applications in image analysis, computer vision, text data a...
Hyunsoo Kim, Haesun Park, Hongyuan Zha
CASCON
2006
150views Education» more  CASCON 2006»
13 years 9 months ago
Exploring a new space of features for document classification: figure clustering
Automatic document classification is an important step in organizing and mining documents. Information in documents is often conveyed using both text and images that complement ea...
Nawei Chen, Hagit Shatkay, Dorothea Blostein