Sciweavers

KDD
2008
ACM
111views Data Mining» more  KDD 2008»
14 years 9 months ago
Fast logistic regression for text categorization with variable-length n-grams
Gökhan H. Bakir, Georgiana Ifrim, Gerhard Wei...
KDD
2008
ACM
174views Data Mining» more  KDD 2008»
14 years 9 months ago
Using predictive analysis to improve invoice-to-cash collection
Sai Zeng, Prem Melville, Christian A. Lang, Ioana ...
KDD
2008
ACM
153views Data Mining» more  KDD 2008»
14 years 9 months ago
Text classification, business intelligence, and interactivity: automating C-Sat analysis for services industry
Text classification has matured as a research discipline over the last decade. Independently, business intelligence over structured databases has long been a source of insights fo...
Shantanu Godbole, Shourya Roy
KDD
2008
ACM
115views Data Mining» more  KDD 2008»
14 years 9 months ago
Topical query decomposition
We introduce the problem of query decomposition, where we are given a query and a document retrieval system, and we want to produce a small set of queries whose union of resulting...
Francesco Bonchi, Carlos Castillo, Debora Donato, ...
KDD
2008
ACM
155views Data Mining» more  KDD 2008»
14 years 9 months ago
Factorization meets the neighborhood: a multifaceted collaborative filtering model
Recommender systems provide users with personalized suggestions for products or services. These systems often rely on Collaborating Filtering (CF), where past transactions are ana...
Yehuda Koren
KDD
2008
ACM
130views Data Mining» more  KDD 2008»
14 years 9 months ago
Unsupervised feature selection for principal components analysis
Christos Boutsidis, Michael W. Mahoney, Petros Dri...
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 9 months ago
A bayesian mixture model with linear regression mixing proportions
Classic mixture models assume that the prevalence of the various mixture components is fixed and does not vary over time. This presents problems for applications where the goal is...
Xiuyao Song, Chris Jermaine, Sanjay Ranka, John Gu...
KDD
2008
ACM
135views Data Mining» more  KDD 2008»
14 years 9 months ago
DiMaC: a disguised missing data cleaning tool
In some applications such as filling in a customer information form on the web, some missing values may not be explicitly represented as such, but instead appear as potentially va...
Ming Hua, Jian Pei
KDD
2008
ACM
128views Data Mining» more  KDD 2008»
14 years 9 months ago
Scaling up text classification for large file systems
: We combine the speed and scalability of information retrieval with the generally superior classification accuracy offered by machine learning, yielding a two-phase text classifie...
George Forman, Shyamsundar Rajaram
KDD
2008
ACM
142views Data Mining» more  KDD 2008»
14 years 9 months ago
Efficient ticket routing by resolution sequence mining
IT problem management calls for quick identification of resolvers to reported problems. The efficiency of this process highly depends on ticket routing--transferring problem ticke...
Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos A...