Search Sciweavers | Sciweavers

832 search results - page 22 / 167

» Document clustering with committees

144

click to vote

CSDA
2006

85views more CSDA 2006»

Two-way Poisson mixture models for simultaneous document classification and word clustering

15 years 6 months ago

Download www.stat.psu.edu

An approach to simultaneous document classification and word clustering is developed using a two-way mixture model of Poisson distributions. Each document is represented by a vect...

Jia Li, Hongyuan Zha

claim paper

Read More »

174

click to vote

HIS
2003

131views Information Technology» more HIS 2003»

Evolving Better Stoplists for Document Clustering and Web Intelligence

15 years 7 months ago

Download www.macs.hw.ac.uk

: Text classification, document clustering and similar document analysis tasks are currently the subject of significant global research, since such areas underpin web intelligence,...

Mark P. Sinka, David Corne

claim paper

Read More »

160

click to vote

ECIR
2008
Springer

185views Information Technology» more ECIR 2008»

Clustering Template Based Web Documents

15 years 7 months ago

Download www.informatik.uni-mainz.de

More and more documents on the World Wide Web are based on templates. On a technical level this causes those documents to have a quite similar source code and DOM tree structure. G...

Thomas Gottron

claim paper

Read More »

164

click to vote

AIMSA
2008
Springer

118views Artificial Intelligence» more AIMSA 2008»

Using Text Segmentation to Enhance the Cluster Hypothesis

16 years 17 days ago

Download www.info.univ-angers.fr

An alternative way to tackle Information Retrieval, called Passage Retrieval, considers text fragments independently rather than assessing global relevance of documents. In such a ...

Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...

claim paper

Read More »

179

click to vote

KDD
2009
ACM

243views Data Mining» more KDD 2009»

Exploiting Wikipedia as external knowledge for document clustering

16 years 6 months ago

Download www-ai.cs.uni-dortmund.de

In traditional text clustering methods, documents are represented as "bags of words" without considering the semantic information of each document. For instance, if two ...

Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, ...

claim paper

Read More »

« Prev « First page 22 / 167 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers