Sciweavers

483 search results - page 75 / 97
» Sampling the Web as Training Data for Text Classification
Sort
View
ICML
2006
IEEE
16 years 3 months ago
The rate adapting poisson model for information retrieval and object recognition
Probabilistic modelling of text data in the bagof-words representation has been dominated by directed graphical models such as pLSI, LDA, NMF, and discrete PCA. Recently, state of...
Peter V. Gehler, Alex Holub, Max Welling
122
Voted
KDD
2005
ACM
171views Data Mining» more  KDD 2005»
16 years 2 months ago
Deriving marketing intelligence from online discussion
Weblogs and message boards provide online forums for discussion that record the voice of the public. Woven into this mass of discussion is a wide range of opinion and commentary a...
Natalie S. Glance, Matthew Hurst, Kamal Nigam, Mat...
110
Voted
SIGIR
2005
ACM
15 years 8 months ago
A Markov random field model for term dependencies
This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as eviden...
Donald Metzler, W. Bruce Croft
126
Voted
SEMWEB
2009
Springer
15 years 9 months ago
Policy-Aware Content Reuse on the Web
The Web allows users to share their work very effectively leading to the rapid re-use and remixing of content on the Web including text, images, and videos. Scientific research d...
Oshani Seneviratne, Lalana Kagal, Tim Berners-Lee
126
Voted
AAAI
2006
15 years 4 months ago
Semi-supervised Multi-label Learning by Constrained Non-negative Matrix Factorization
We present a novel framework for multi-label learning that explicitly addresses the challenge arising from the large number of classes and a small size of training data. The key a...
Yi Liu, Rong Jin, Liu Yang