Search Sciweavers | Sciweavers

88 search results - page 7 / 18

» Distributional Clustering of Words for Text Classification

148

click to vote

PRIS
2004

129views Pattern Recognition» more PRIS 2004»

Effect of Feature Smoothing Methods in Text Classification Tasks

15 years 7 months ago

Download www-i6.informatik.rwth-aachen.de

Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...

David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...

claim paper

Read More »

206

click to vote

ACST
2006

274views Computer Science» more ACST 2006»

Distributed hierarchical document clustering

15 years 7 months ago

Download nsm1.nsm.iup.edu

This paper investigates the applicability of distributed clustering technique, called RACHET [1], to organize large sets of distributed text data. Although the authors of RACHET c...

Debzani Deb, M. Muztaba Fuad, Rafal A. Angryk

claim paper

Read More »

159

click to vote

ACL
1994

120views Computational Linguistics» more ACL 1994»

A Corpus-Based Approach to Automatic Compound Extraction

15 years 7 months ago

Download www.mt-archive.info

An automatic compound retrieval method is proposed to extract compounds within a text message. It uses n-gram mutual information, relative frequency count and parts of speech as t...

Keh-Yih Su, Ming-Wen Wu, Jing-Shin Chang

claim paper

Read More »

147

Voted

ICML
2006
IEEE

158views Machine Learning» more ICML 2006»

Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution

16 years 6 months ago

Download cseweb.ucsd.edu

The Dirichlet compound multinomial (DCM) distribution, also called the multivariate Polya distribution, is a model for text documents that takes into account burstiness: the fact ...

Charles Elkan

claim paper

Read More »

143

click to vote

TREC
2007

123views Information Technology» more TREC 2007»

WIM at TREC 2007

15 years 7 months ago

Download trec.nist.gov

This paper introduced the four tracks that WIM-Lab Fudan University had taken part in at TREC 2007. For spam track, a multi-centre model was proposed considering the characteristi...

Jun Xu, Jing Yao, Jiaqian Zheng, Qi Sun, Junyu Niu

claim paper

Read More »

« Prev « First page 7 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers