Sciweavers

304 search results - page 19 / 61
» A Semi-Supervised Document Clustering Technique for Informat...
Sort
View
ICDAR
2009
IEEE
14 years 3 months ago
Robust Recognition of Documents by Fusing Results of Word Clusters
The word error rate of any optical character recognition system (OCR) is usually substantially below its component or character error rate. This is especially true of Indic langua...
Venkat Rasagna, Anand Kumar 0002, C. V. Jawahar, R...
TKDE
2011
280views more  TKDE 2011»
13 years 3 months ago
Locally Consistent Concept Factorization for Document Clustering
—Previous studies have demonstrated that document clustering performance can be improved significantly in lower dimensional linear subspaces. Recently, matrix factorization base...
Deng Cai, Xiaofei He, Jiawei Han
WEBI
2007
Springer
14 years 2 months ago
Pairwise Constraints-Guided Non-negative Matrix Factorization for Document Clustering
Nonnegative Matrix Factorization (NMF) has been proven to be effective in text mining. However, since NMF is a well-known unsupervised components analysis technique, the existing ...
Yujiu Yang, Bao-Gang Hu
IV
2008
IEEE
119views Visualization» more  IV 2008»
14 years 3 months ago
CAT: A Hierarchical Image Browser Using a Rectangle Packing Technique
The recent revolution of digital camera technology has resulted in much larger collections of images. Image browsing techniques thus become increasingly important for overview and...
Ai Gomi, Reiko Miyazaki, Takayuki Itoh, Jia Li
ICDE
2007
IEEE
211views Database» more  ICDE 2007»
14 years 3 months ago
Document Representation and Dimension Reduction for Text Clustering
Increasingly large text datasets and the high dimensionality associated with natural language create a great challenge in text mining. In this research, a systematic study is cond...
M. Mahdi Shafiei, Singer Wang, Roger Zhang, Evange...