Search Sciweavers | Sciweavers

315 search results - page 13 / 63

» Text classification from positive and unlabeled documents

173

click to vote

CIT
2005
Springer

226views Information Technology» more CIT 2005»

Simple Classification into Large Topic Ontology of Web Documents

15 years 5 months ago

Download eprints.pascal-network.org

The paper presents an approach to classifying Web documents into large topic ontology. The main emphasis is on having a simple approach appropriate for handling a large ontology an...

Marko Grobelnik, Dunja Mladenic

claim paper

Read More »

137

click to vote

SIGIR
2010
ACM

137views Information Technology» more SIGIR 2010»

Combining coregularization and consensus-based self-training for multilingual text categorization

15 years 9 months ago

Download webia.lip6.fr

We investigate the problem of learning document classiﬁers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...

Massih-Reza Amini, Cyril Goutte, Nicolas Usunier

claim paper

Read More »

173

click to vote

KDD
2009
ACM

269views Data Mining» more KDD 2009»

Extracting discriminative concepts for domain adaptation in text mining

16 years 6 months ago

Download 140.123.102.14

One common predictive modeling challenge occurs in text mining problems is that the training data and the operational (testing) data are drawn from different underlying distributi...

Bo Chen, Wai Lam, Ivor Tsang, Tak-Lam Wong

claim paper

Read More »

176

Voted

ICDAR
2003
IEEE

191views Document Analysis» more ICDAR 2003»

Document page similarity based on layout visual saliency: Application to query by example and document classification

15 years 11 months ago

Download www.cse.salford.ac.uk

In this paper we propose to define a measure of visual similarity to compare different pages in a corpus. This measure is based on the analysis of the visual layout saliency of th...

Véronique Eglin, Stéphane Bres

claim paper

Read More »

138

click to vote

DAWAK
2008
Springer

126views Information Technology» more DAWAK 2008»

Document-Base Extraction for Single-Label Text Classification

15 years 7 months ago

Download www.csc.liv.ac.uk

Many text mining applications, especially when investigating Text Classification (TC), require experiments to be performed using common textcollections, such that results can be co...

Yanbo J. Wang, Robert Sanderson, Frans Coenen, Pau...

claim paper

Read More »

« Prev « First page 13 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers