Search Sciweavers | Sciweavers

192 search results - page 11 / 39

» Language sensitive text classification

213

click to vote

KDD
2008
ACM

199views Data Mining» more KDD 2008»

Building semantic kernels for text classification using wikipedia

16 years 7 months ago

Download cs.gmu.edu

Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...

Pu Wang, Carlotta Domeniconi

claim paper

Read More »

165

Voted

PRIS
2004

129views Pattern Recognition» more PRIS 2004»

Effect of Feature Smoothing Methods in Text Classification Tasks

15 years 8 months ago

Download www-i6.informatik.rwth-aachen.de

Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...

David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...

claim paper

Read More »

160

click to vote

ICML
1999
IEEE

195views Machine Learning» more ICML 1999»

Feature Engineering for Text Classification

15 years 11 months ago

Download gking.harvard.edu

Most research in text classification to date has used a “bag of words” representation in which each feature corresponds to a single word. This paper examines some alternative ...

Sam Scott, Stan Matwin

claim paper

Read More »

179

click to vote

LREC
2010

196views Education» more LREC 2010»

Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classifi

15 years 8 months ago

Download www.lrec-conf.org

In this paper, we report on a study that was performed within the "Semantics of History" project on how descriptions of historical events are realized in different types...

Agata Cybulska, Piek Vossen

claim paper

Read More »

173

click to vote

CICLING
2005
Springer

247views Natural Language Processing» more CICLING 2005»

Disentangling from Babylonian Confusion - Unsupervised Language Identification

16 years 6 days ago

Download wortschatz.uni-leipzig.de

: This work presents an unsupervised solution to language identification. The method sorts multilingual text corpora on the basis of sentences into the different languages that are...

Christian Biemann, Sven Teresniak

claim paper

Read More »

« Prev « First page 11 / 39 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers