Sciweavers

315 search results - page 19 / 63
» Text classification from positive and unlabeled documents
Sort
View
SIGIR
2008
ACM
13 years 8 months ago
Topic-bridged PLSA for cross-domain text classification
In many Web applications, such as blog classification and newsgroup classification, labeled data are in short supply. It often happens that obtaining labeled data in a new domain ...
Gui-Rong Xue, Wenyuan Dai, Qiang Yang, Yong Yu
SIGIR
2005
ACM
14 years 2 months ago
Text classification with kernels on the multinomial manifold
Support Vector Machines (SVMs) have been very successful in text classification. However, the intrinsic geometric structure of text data has been ignored by standard kernels commo...
Dell Zhang, Xi Chen, Wee Sun Lee
WWW
2007
ACM
14 years 9 months ago
Efficient training on biased minimax probability machine for imbalanced text classification
The Biased Minimax Probability Machine (BMPM) constructs a classifier which deals with the imbalanced learning tasks. In this paper, we propose a Second Order Cone Programming (SO...
Xiang Peng, Irwin King
ICDAR
2009
IEEE
14 years 3 months ago
Text Line Segmentation Based on Morphology and Histogram Projection
Text extraction is an important phase in document recognition systems. In order to segment text from a page document it is necessary to detect all the possible manuscript text reg...
Rodolfo P. dos Santos, Gabriela S. Clemente, Ing R...
NIPS
2000
13 years 10 months ago
Text Classification using String Kernels
We propose a novel approach for categorizing text documents based on the use of a special kernel. The kernel is an inner product in the feature space generated by all subsequences...
Huma Lodhi, John Shawe-Taylor, Nello Cristianini, ...