Sciweavers

KDD
2007
ACM
139views Data Mining» more  KDD 2007»
14 years 9 months ago
Making generative classifiers robust to selection bias
Andrew T. Smith, Charles Elkan
KDD
2007
ACM
192views Data Mining» more  KDD 2007»
14 years 9 months ago
Active exploration for learning rankings from clickthrough data
We address the task of learning rankings of documents from search engine logs of user behavior. Previous work on this problem has relied on passively collected clickthrough data. ...
Filip Radlinski, Thorsten Joachims
KDD
2007
ACM
146views Data Mining» more  KDD 2007»
14 years 9 months ago
Event summarization for system management
Wei Peng, Charles Perng, Tao Li, Haixun Wang
KDD
2007
ACM
184views Data Mining» more  KDD 2007»
14 years 9 months ago
GraphScope: parameter-free mining of large time-evolving graphs
How can we find communities in dynamic networks of social interactions, such as who calls whom, who emails whom, or who sells to whom? How can we spot discontinuity timepoints in ...
Jimeng Sun, Christos Faloutsos, Spiros Papadimitri...
KDD
2007
ACM
132views Data Mining» more  KDD 2007»
14 years 9 months ago
A scalable modular convex solver for regularized risk minimization
A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different r...
Choon Hui Teo, Alex J. Smola, S. V. N. Vishwanatha...
KDD
2007
ACM
176views Data Mining» more  KDD 2007»
14 years 9 months ago
Mining correlated bursty topic patterns from coordinated text streams
Previous work on text mining has almost exclusively focused on a single stream. However, we often have available multiple text streams indexed by the same set of time points (call...
Xuanhui Wang, ChengXiang Zhai, Xiao Hu, Richard Sp...
KDD
2007
ACM
149views Data Mining» more  KDD 2007»
14 years 9 months ago
Distributed classification in peer-to-peer networks
This work studies the problem of distributed classification in peer-to-peer (P2P) networks. While there has been a significant amount of work in distributed classification, most o...
Ping Luo, Hui Xiong, Kevin Lü, Zhongzhi Shi
KDD
2007
ACM
156views Data Mining» more  KDD 2007»
14 years 9 months ago
Estimating rates of rare events at multiple resolutions
We consider the problem of estimating occurrence rates of rare events for extremely sparse data, using pre-existing hierarchies to perform inference at multiple resolutions. In pa...
Deepak Agarwal, Andrei Z. Broder, Deepayan Chakrab...
KDD
2007
ACM
178views Data Mining» more  KDD 2007»
14 years 9 months ago
Real-time ranking with concept drift using expert advice
In many practical applications, one is interested in generating a ranked list of items using information mined from continuous streams of data. For example, in the context of comp...
Hila Becker, Marta Arias
KDD
2007
ACM
137views Data Mining» more  KDD 2007»
14 years 9 months ago
Characterising the difference
Characterising the differences between two databases is an often occurring problem in Data Mining. Detection of change over time is a prime example, comparing databases from two b...
Jilles Vreeken, Matthijs van Leeuwen, Arno Siebes