Text Classification | Sciweavers

166

CORR
2010
Springer

215views Education» more CORR 2010»

Text Classification using the Concept of Association Rule of Data Mining

15 years 6 months ago

As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of...

Chowdhury Mofizur Rahman, Ferdous Ahmed Sohel, Par...

claim paper

Read More »

185

click to vote

SIGIR
2010
ACM

155views Information Technology» more SIGIR 2010»

SED: supervised experimental design and its application to text classification

15 years 6 months ago

Download www.cse.ust.hk

In recent years, active learning methods based on experimental design achieve state-of-the-art performance in text classification applications. Although these methods can exploit ...

Yi Zhen, Dit-Yan Yeung

claim paper

Read More »

163

click to vote

PRIS
2004

129views Pattern Recognition» more PRIS 2004»

Effect of Feature Smoothing Methods in Text Classification Tasks

15 years 8 months ago

Download www-i6.informatik.rwth-aachen.de

Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...

David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...

claim paper

Read More »

169

click to vote

CLIN
2001

184views Computational Linguistics» more CLIN 2001»

Accurate Stemming of Dutch for Text Classification

15 years 8 months ago

Download odur.let.rug.nl

This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finit...

Tanja Gaustad, Gosse Bouma

claim paper

Read More »

171

click to vote

CASCON
2001

148views Education» more CASCON 2001»

Email classification with co-training

15 years 8 months ago

Download www.site.uottawa.ca

The main problems in text classification are lack of labeled data, as well as the cost of labeling the unlabeled data. We address these problems by exploring co-training - an algo...

Svetlana Kiritchenko, Stan Matwin

claim paper

Read More »

187

click to vote

FLAIRS
2006

119views Artificial Intelligence» more FLAIRS 2006»

Using Web Searches on Important Words to Create Background Sets for LSI Classification

15 years 8 months ago

Download www.cs.csi.cuny.edu

The world wide web has a wealth of information that is related to almost any text classification task. This paper presents a method for mining the web to improve text classificati...

Sarah Zelikovitz, Marina Kogan

claim paper

Read More »

175

Voted

FLAIRS
2004

175views Artificial Intelligence» more FLAIRS 2004»

Automatic Generation of Background Text to Aid Classification

15 years 8 months ago

Download www.cs.csi.cuny.edu

We illustrate that Web searches can often be utilized to generate background text for use with text classification. This is the case because there are frequently many pages on the...

Sarah Zelikovitz, Robert Hafner

claim paper

Read More »

165

click to vote

DMIN
2006

114views Data Mining» more DMIN 2006»

Towards Using Fewer Features for Text Classification

15 years 8 months ago

Download ww1.ucmss.com

Abstract-- Text classification or categorization is a conventional classification problem applied to the text domain. In the cases when statistical classification methods are used,...

Yuan Yuan, Tianyang Gu

claim paper

Read More »

185

click to vote

ICMLA
2008

131views Machine Learning» more ICMLA 2008»

Text Classification Using Tree Kernels and Linguistic Information

15 years 8 months ago

Download www.di.uevora.pt

Standard Machine Learning approaches to text classification use the bag-of-words representation of documents to deceive the classification target function. Typical linguistic stru...

Teresa Gonçalves, Paulo Quaresma

claim paper

Read More »

195

click to vote

ECIR
2008
Springer

103views Information Technology» more ECIR 2008»

Semi-supervised Document Classification with a Mislabeling Error Model

15 years 8 months ago

Download eprints.pascal-network.org

Abstract. This paper investigates a new extension of the Probabilistic Latent Semantic Analysis (PLSA) model [6] for text classification where the training set is partially labeled...

Anastasia Krithara, Massih-Reza Amini, Jean-Michel...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers