Sciweavers

483 search results - page 81 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
IDMS
1998
Springer
107views Multimedia» more  IDMS 1998»
15 years 6 months ago
Classifying Objectionable Websites Based on Image Content
This paper describes IBCOW Image-based Classi cation of Objectionable Websites, a system capable of classifying a website as objectionable or benign based on image content. The sys...
James Ze Wang, Jia Li, Gio Wiederhold, Oscar Firsc...
111
Voted
KDD
2003
ACM
157views Data Mining» more  KDD 2003»
16 years 2 months ago
Cross-training: learning probabilistic mappings between topics
Classification is a well-established operation in text mining. Given a set of labels A and a set DA of training documents tagged with these labels, a classifier learns to assign l...
Sunita Sarawagi, Soumen Chakrabarti, Shantanu Godb...
AAAI
2010
15 years 4 months ago
Multi-Task Active Learning with Output Constraints
Many problems in information extraction, text mining, natural language processing and other fields exhibit the same property: multiple prediction tasks are related in the sense th...
Yi Zhang 0010
128
Voted
FLAIRS
2006
15 years 3 months ago
Corpus Based Unsupervised Labeling of Documents
Text categorization involves mapping of documents to a fixed set of labels. A similar but equally important problem is that of assigning labels to large corpora. With a deluge of ...
Delip Rao, Deepak P, Deepak Khemani
ICASSP
2008
IEEE
15 years 9 months ago
Mutual features for robust identification and verification
Noisy or distorted video/audio training sets represent constant challenges in automated identification and verification tasks. We propose the method of Mutual Interdependence An...
Heiko Claussen, Justinian Rosca, Robert I. Damper