Sciweavers

95 search results - page 14 / 19
» A cross-collection mixture model for comparative text mining
Sort
View
SIGIR
2008
ACM
13 years 7 months ago
Latent dirichlet allocation based multi-document summarization
Extraction based Multi-Document Summarization Algorithms consist of choosing sentences from the documents using some weighting mechanism and combining them into a summary. In this...
Rachit Arora, Balaraman Ravindran
KDD
2006
ACM
201views Data Mining» more  KDD 2006»
14 years 8 months ago
Clustering based large margin classification: a scalable approach using SOCP formulation
This paper presents a novel Second Order Cone Programming (SOCP) formulation for large scale binary classification tasks. Assuming that the class conditional densities are mixture...
J. Saketha Nath, Chiranjib Bhattacharyya, M. Naras...
KDD
2002
ACM
118views Data Mining» more  KDD 2002»
14 years 8 months ago
SECRET: a scalable linear regression tree algorithm
Recently there has been an increasing interest in developing regression models for large datasets that are both accurate and easy to interpret. Regressors that have these properti...
Alin Dobra, Johannes Gehrke
KDD
2002
ACM
155views Data Mining» more  KDD 2002»
14 years 8 months ago
SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets
We propose a new clustering algorithm, called SyMP, which is based on synchronization of pulse-coupled oscillators. SyMP represents each data point by an Integrate-and-Fire oscill...
Hichem Frigui
CICLING
2007
Springer
13 years 11 months ago
Rule-Based Protein Term Identification with Help from Automatic Species Tagging
In biomedical articles, terms often refer to different protein entities. For example, an arbitrary occurrence of term p53 might denote thousands of proteins across a number of spec...
Xinglong Wang