Sciweavers

PAKDD
2010
ACM
167views Data Mining» more  PAKDD 2010»
14 years 3 months ago
Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand
We present a general framework for the task of extracting specific information “on demand” from a large corpus such as the Web under resource-constraints. Given a database wit...
Pallika Kanani, Andrew McCallum, Shaohan Hu
PAKDD
2010
ACM
182views Data Mining» more  PAKDD 2010»
14 years 4 months ago
Computation of Ratios of Secure Summations in Multi-party Privacy-Preserving Latent Dirichlet Allocation
In this paper, we focus our attention on the problem of computing the ratio of two numbers, both of which are the summations of the private numbers distributed in different parties...
Bin Yang, Hiroshi Nakagawa
PAKDD
2010
ACM
175views Data Mining» more  PAKDD 2010»
14 years 4 months ago
EigenSpokes: Surprising Patterns and Scalable Community Chipping in Large Graphs
Abstract. We report a surprising, persistent pattern in large sparse social graphs, which we term EigenSpokes. We focus on large Mobile Call graphs, spanning about 186K nodes and m...
B. Aditya Prakash, Ashwin Sridharan, Mukund Seshad...
PAKDD
2010
ACM
275views Data Mining» more  PAKDD 2010»
14 years 4 months ago
Anonymizing Transaction Data by Integrating Suppression and Generalization
Abstract. Privacy protection in publishing transaction data is an important problem. A key feature of transaction data is the extreme sparsity, which renders any single technique i...
Junqiang Liu, Ke Wang
PAKDD
2010
ACM
212views Data Mining» more  PAKDD 2010»
14 years 5 months ago
Fast Perceptron Decision Tree Learning from Evolving Data Streams
Abstract. Mining of data streams must balance three evaluation dimensions: accuracy, time and memory. Excellent accuracy on data streams has been obtained with Naive Bayes Hoeffdi...
Albert Bifet, Geoffrey Holmes, Bernhard Pfahringer...
PAKDD
2010
ACM
117views Data Mining» more  PAKDD 2010»
14 years 5 months ago
BASSET: Scalable Gateway Finder in Large Graphs
Given a social network, who is the best person to introduce you to, say, Chris Ferguson, the poker champion? Or, given a network of people and skills, who is the best person to he...
Hanghang Tong, Spiros Papadimitriou, Christos Falo...
PAKDD
2010
ACM
151views Data Mining» more  PAKDD 2010»
14 years 5 months ago
Ensemble Learning Based on Multi-Task Class Labels
Abstract. It is well known that diversity among component classifiers is crucial for constructing a strong ensemble. Most existing ensemble methods achieve this goal through resam...
Qing Wang, Liang Zhang
PAKDD
2010
ACM
189views Data Mining» more  PAKDD 2010»
14 years 5 months ago
Subsequence Matching of Stream Synopses under the Time Warping Distance
In this paper, we propose a method for online subsequence matching between histogram-based stream synopsis structures under the dynamic warping distance. Given a query synopsis pat...
Su-Chen Lin, Mi-Yen Yeh, Ming-Syan Chen
PAKDD
2010
ACM
130views Data Mining» more  PAKDD 2010»
14 years 5 months ago
A New Framework for Dissimilarity and Similarity Learning
Adam Woznica, Alexandros Kalousis
PAKDD
2010
ACM
174views Data Mining» more  PAKDD 2010»
14 years 5 months ago
Classification and Pattern Discovery of Mood in Weblogs
Thin Nguyen, Dinh Q. Phung, Brett Adams, Tran The ...