document clustering | Sciweavers

153

SIGIR
2005
ACM

115views Information Technology» more SIGIR 2005»

Relation between PLSA and NMF and implications

16 years 2 days ago

Non-negative Matrix Factorization (NMF, [5]) and Probabilistic Latent Semantic Analysis (PLSA, [4]) have been successfully applied to a number of text analysis tasks such as docum...

Éric Gaussier, Cyril Goutte

claim paper

Read More »

213

click to vote

ICAIL
2005
ACM

139views Artificial Intelligence» more ICAIL 2005»

Effective Document Clustering for Large Heterogeneous Law Firm Collections

16 years 2 days ago

Download glaros.dtc.umn.edu

Computational resources for research in legal environments have historically implied remote access to large databases of legal documents such as case law, statutes, law reviews an...

Jack G. Conrad, Khalid Al-Kofahi, Ying Zhao, Georg...

claim paper

Read More »

174

click to vote

SIGIR
2006
ACM

133views Information Technology» more SIGIR 2006»

Feature diversity in cluster ensembles for robust document clustering

16 years 14 days ago

Download serpens.salleurl.edu

The performance of document clustering systems depends on employing optimal text representations, which are not only diﬃcult to determine beforehand, but also may vary from one ...

Xavier Sevillano, Germán Cobo, Francesc Al&...

claim paper

Read More »

193

click to vote

JCDL
2006
ACM

172views Education» more JCDL 2006»

A comprehensive comparison study of document clustering for a biomedical digital library MEDLINE

16 years 14 days ago

Download www.ischool.drexel.edu

Document clustering has been used for better document retrieval, document browsing, and text mining in digital library. In this paper, we perform a comprehensive comparison study ...

Illhoi Yoo, Xiaohua Hu

claim paper

Read More »

162

click to vote

ICDM
2006
IEEE

132views Data Mining» more ICDM 2006»

High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets

16 years 17 days ago

Download www.cs.columbia.edu

High dimensionality remains a significant challenge for document clustering. Recent approaches used frequent itemsets and closed frequent itemsets to reduce dimensionality, and to...

Hassan H. Malik, John R. Kender

claim paper

Read More »

141

click to vote

ICDE
2006
IEEE

114views Database» more ICDE 2006»

Novelty-based Incremental Document Clustering for On-line Documents

16 years 17 days ago

Download www.db.itc.nagoya-u.ac.jp

Document clustering has been used as a core technique in managing vast amount of data and providing needed information. In on-line environments, generally new information gains mo...

Sophoin Khy, Yoshiharu Ishikawa, Hiroyuki Kitagawa

claim paper

Read More »

198

click to vote

CBMS
2006
IEEE

178views Medical Imaging» more CBMS 2006»

Biomedical Ontology MeSH Improves Document Clustering Qualify on MEDLINE Articles: A Comparison Study

16 years 17 days ago

Download www.cis.drexel.edu

Document clustering has been used for better document retrieval, document browsing, and text mining. In this paper, we investigate if biomedical ontology MeSH improves the cluster...

Illhoi Yoo, Xiaohua Hu

claim paper

Read More »

178

click to vote

ICDM
2007
IEEE

179views Data Mining» more ICDM 2007»

GDClust: A Graph-Based Document Clustering Technique

16 years 25 days ago

Download www.cs.montana.edu

This paper introduces a new technique of document clustering based on frequent senses. The proposed system, GDClust (Graph-Based Document Clustering) works with frequent senses ra...

M. Shahriar Hossain, Rafal A. Angryk

claim paper

Read More »

161

click to vote

DASFAA
2007
IEEE

240views Database» more DASFAA 2007»

A Comparative Study of Ontology Based Term Similarity Measures on PubMed Document Clustering

16 years 26 days ago

Download www.pages.drexel.edu

Recent research shows that ontology as background knowledge can improve document clustering quality with its concept hierarchy knowledge. Previous studies take term semantic simila...

Xiaodan Zhang, Liping Jing, Xiaohua Hu, Michael K....

claim paper

Read More »

172

click to vote

ICDM
2008
IEEE

147views Data Mining» more ICDM 2008»

Clustering Documents with Active Learning Using Wikipedia

16 years 29 days ago

Download www.cs.waikato.ac.nz

Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...

Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers