Sciweavers

2827 search results - page 137 / 566
» Marking Text Documents
Sort
View
ISCI
2007
122views more  ISCI 2007»
13 years 8 months ago
On the strength of hyperclique patterns for text categorization
The use of association patterns for text categorization has attracted great interest and a variety of useful methods have been developed. However, the key characteristics of patte...
Tieyun Qian, Hui Xiong, Yuanzhen Wang, Enhong Chen
SIGIR
2010
ACM
13 years 12 months ago
Combining coregularization and consensus-based self-training for multilingual text categorization
We investigate the problem of learning document classifiers in a multilingual setting, from collections where labels are only partially available. We address this problem in the ...
Massih-Reza Amini, Cyril Goutte, Nicolas Usunier
KDD
2004
ACM
210views Data Mining» more  KDD 2004»
14 years 8 months ago
Probabilistic author-topic models for information discovery
We propose a new unsupervised learning technique for extracting information from large text collections. We model documents as if they were generated by a two-stage stochastic pro...
Mark Steyvers, Padhraic Smyth, Michal Rosen-Zvi, T...
TREC
2003
13 years 9 months ago
Overview of the TREC 2003 Question Answering Track
The TREC 2003 question answering track contained two tasks, the passages task and the main task. In the passages task, systems returned a single text snippet in response to factoi...
Ellen M. Voorhees
KDD
2007
ACM
124views Data Mining» more  KDD 2007»
14 years 2 months ago
Hierarchical mixture models: a probabilistic analysis
Mixture models form one of the most widely used classes of generative models for describing structured and clustered data. In this paper we develop a new approach for the analysis...
Mark Sandler