Sciweavers

51 search results - page 4 / 11
» Classifying High-Dimensional Text and Web Data Using Very Sh...
Sort
View
IDA
2006
Springer
13 years 7 months ago
Sequential patterns for text categorization
Text categorization is a well-known task based essentially on statistical approaches using neural networks, Support Vector Machines and other machine learning algorithms. Texts are...
Simon Jaillet, Anne Laurent, Maguelonne Teisseire
KDD
2002
ACM
179views Data Mining» more  KDD 2002»
14 years 8 months ago
Combining clustering and co-training to enhance text classification using unlabelled data
In this paper, we present a new co-training strategy that makes use of unlabelled data. It trains two predictors in parallel, with each predictor labelling the unlabelled data for...
Bhavani Raskutti, Herman L. Ferrá, Adam Kow...
LREC
2008
133views Education» more  LREC 2008»
13 years 9 months ago
Automatic Identification of Temporal Information in Tourism Web Pages
This paper presents our work on the detection of temporal information in web pages. The pages examined within the scope of this study were taken from the tourism sector and the te...
Stéphanie Weiser, Philippe Laublet, Jean-Lu...
MICAI
2007
Springer
14 years 1 months ago
Taking Advantage of the Web for Text Classification with Imbalanced Classes
A problem of supervised approaches for text classification is that they commonly require high-quality training data to construct an accurate classifier. Unfortunately, in many real...
Rafael Guzmán-Cabrera, Manuel Montes-y-G&oa...
COMPSAC
2005
IEEE
14 years 1 months ago
Recovering "Lack of Words" in Text Categorization for Item Banks
PKIP, Patterned Keywords in Phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item c...
Atorn Nuntiyagul, Nick Cercone, Kanlaya Naruedomku...