Search Sciweavers | Sciweavers

328 search results - page 13 / 66

» A Multi-level Approach for Document Clustering

184

click to vote

ECIR
2008
Springer

185views Information Technology» more ECIR 2008»

Clustering Template Based Web Documents

15 years 8 months ago

Download www.informatik.uni-mainz.de

More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...

Thomas Gottron

claim paper

Read More »

212

click to vote

KDD
2009
ACM

243views Data Mining» more KDD 2009»

Exploiting Wikipedia as external knowledge for document clustering

16 years 7 months ago

Download www-ai.cs.uni-dortmund.de

In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...

Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...

claim paper

Read More »

164

click to vote

CSDA
2006

85views more CSDA 2006»

Two-way Poisson mixture models for simultaneous document classification and word clustering

15 years 7 months ago

Download www.stat.psu.edu

An approach to simultaneous document classification and word clustering is developed using a two-way mixture model of Poisson distributions. Each document is represented by a vect...

Jia Li, Hongyuan Zha

claim paper

Read More »

183

click to vote

SIGIR
2002
ACM

152views Information Technology» more SIGIR 2002»

Unsupervised document classification using sequential information maximization

15 years 6 months ago

Download www.cs.huji.ac.il

We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...

Noam Slonim, Nir Friedman, Naftali Tishby

claim paper

Read More »

216

click to vote

TKDE
2011

280views more TKDE 2011»

Locally Consistent Concept Factorization for Document Clustering

15 years 2 months ago

Download people.cs.uchicago.edu

—Previous studies have demonstrated that document clustering performance can be improved signiﬁcantly in lower dimensional linear subspaces. Recently, matrix factorization base...

Deng Cai, Xiaofei He, Jiawei Han

claim paper

Read More »

« Prev « First page 13 / 66 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers