Sciweavers

315 search results - page 42 / 63
» Text classification from positive and unlabeled documents
Sort
View
DRR
2004
13 years 10 months ago
A nonparametric classifier for unsegmented text
Symbolic Indirect Correlation (SIC) is a new classification method for unsegmented patterns. SIC requires two levels of comparisons. First, the feature sequences from an unknown q...
George Nagy, Ashutosh Joshi, Mukkai S. Krishnamoor...
ICDE
2010
IEEE
251views Database» more  ICDE 2010»
14 years 7 months ago
Viewing a World of Annotations through AnnoVIP
The proliferation of electronic content has notably lead to the apparition of large corpora of interrelated structured documents (such as HTML and XML Web pages) and semantic annot...
Konstantinos Karanasos, Spyros Zoupanos
LREC
2010
138views Education» more  LREC 2010»
13 years 9 months ago
Evaluating a Text Mining Based Educational Search Portal
In this paper, we present the main features of a text mining based search engine for the UK Educational Evidence Portal available at the UK National Centre for Text Mining (NaCTeM...
Sophia Ananiadou, John McNaught, James Thomas, Mar...
LREC
2010
143views Education» more  LREC 2010»
13 years 9 months ago
Bigorna -- A Toolkit for Orthography Migration Challenges
Languages are born, evolve and, eventually, die. During this evolution their spelling rules (and sometimes the syntactic and semantic ones) change, putting old documents out of us...
José João Almeida, André Sant...
IAJIT
2011
13 years 2 months ago
Multilayer model for Arabic text compression
: This article describes a multilayer model-based approach for text compression. It uses linguistic information to develop a multilayer decomposition model of the text in order to ...
Arafat Awajan