Sciweavers

192 search results - page 11 / 39
» Language sensitive text classification
Sort
View
KDD
2008
ACM
199views Data Mining» more  KDD 2008»
14 years 8 months ago
Building semantic kernels for text classification using wikipedia
Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The tradi...
Pu Wang, Carlotta Domeniconi
PRIS
2004
13 years 9 months ago
Effect of Feature Smoothing Methods in Text Classification Tasks
Abstract. The number of features to be considered in a text classification system is given by the size of the vocabulary and this is normally in the range of the tens or hundreds o...
David Vilar, Hermann Ney, Alfons Juan, Enrique Vid...
ICML
1999
IEEE
14 years 9 hour ago
Feature Engineering for Text Classification
Most research in text classification to date has used a “bag of words” representation in which each feature corresponds to a single word. This paper examines some alternative ...
Sam Scott, Stan Matwin
LREC
2010
196views Education» more  LREC 2010»
13 years 9 months ago
Event Models for Historical Perspectives: Determining Relations between High and Low Level Events in Text, Based on the Classifi
In this paper, we report on a study that was performed within the "Semantics of History" project on how descriptions of historical events are realized in different types...
Agata Cybulska, Piek Vossen
CICLING
2005
Springer
14 years 1 months ago
Disentangling from Babylonian Confusion - Unsupervised Language Identification
: This work presents an unsupervised solution to language identification. The method sorts multilingual text corpora on the basis of sentences into the different languages that are...
Christian Biemann, Sven Teresniak