Sciweavers

93 search results - page 13 / 19
» Semi-Supervised Text Classification Using Positive and Unlab...
Sort
View
EMNLP
2010
13 years 5 months ago
Negative Training Data Can be Harmful to Text Classification
This paper studies the effects of training data on binary text classification and postulates that negative training data is not needed and may even be harmful for the task. Tradit...
Xiaoli Li, Bing Liu, See-Kiong Ng
KDD
2004
ACM
103views Data Mining» more  KDD 2004»
14 years 8 months ago
An objective evaluation criterion for clustering
We propose and test an objective criterion for evaluation of clustering performance: How well does a clustering algorithm run on unlabeled data aid a classification algorithm? The...
Arindam Banerjee, John Langford
SIGIR
2008
ACM
13 years 7 months ago
Learning from labeled features using generalized expectation criteria
It is difficult to apply machine learning to new domains because often we lack labeled problem instances. In this paper, we provide a solution to this problem that leverages domai...
Gregory Druck, Gideon S. Mann, Andrew McCallum
AAAI
2008
13 years 9 months ago
Text Categorization with Knowledge Transfer from Heterogeneous Data Sources
Multi-category classification of short dialogues is a common task performed by humans. When assigning a question to an expert, a customer service operator tries to classify the cu...
Rakesh Gupta, Lev-Arie Ratinov
INFORMATICALT
2010
94views more  INFORMATICALT 2010»
13 years 4 months ago
Statistical Classification of Scientific Publications
The problem of automatic classification of scientific texts is considered. Methods based on statistical analysis of probabilistic distributions of scientific terms in texts are dis...
Vaidas Balys, Rimantas Rudzkis