Sciweavers

110 search results - page 19 / 22
» A Comparison of Two Document Clustering Approaches for Clust...
Sort
View
NIPS
2008
13 years 9 months ago
Semi-supervised Learning with Weakly-Related Unlabeled Data: Towards Better Text Categorization
The cluster assumption is exploited by most semi-supervised learning (SSL) methods. However, if the unlabeled data is merely weakly related to the target classes, it becomes quest...
Liu Yang, Rong Jin, Rahul Sukthankar
JMLR
2010
107views more  JMLR 2010»
13 years 2 months ago
Modeling Knowledge Worker Activity
This paper describes an approach to constructing a probabilistic process model representing knowledge worker activity out of a log of primitive events, such as e-mails, web page v...
Tadej Stajner, Dunja Mladenic
CLEF
2010
Springer
13 years 8 months ago
Web Person Name Disambiguation by Relevance Weighting of Extended Feature Sets
Abstract. This paper describes our approach to the Person Name Disambiguation clustering task in the Third Web People Search Evaluation Campaign(WePS3). The method focuses on two a...
Chong Long, Lei Shi
KDD
2007
ACM
124views Data Mining» more  KDD 2007»
14 years 1 months ago
Hierarchical mixture models: a probabilistic analysis
Mixture models form one of the most widely used classes of generative models for describing structured and clustered data. In this paper we develop a new approach for the analysis...
Mark Sandler
SDM
2009
SIAM
129views Data Mining» more  SDM 2009»
14 years 4 months ago
Multi-topic Based Query-Oriented Summarization.
Query-oriented summarization aims at extracting an informative summary from a document collection for a given query. It is very useful to help users grasp the main information rel...
Dewei Chen, Jie Tang, Limin Yao