Sciweavers

CORR
2011
Springer

Link Prediction by De-anonymization: How We Won the Kaggle Social Network Challenge

13 years 3 months ago
Link Prediction by De-anonymization: How We Won the Kaggle Social Network Challenge
— This paper describes the winning entry to the IJCNN 2011 Social Network Challenge run by Kaggle.com. The goal of the contest was to promote research on realworld link prediction, and the dataset was a graph obtained by crawling the popular Flickr social photo sharing website, with user identities scrubbed. By de-anonymizing much of the competition test set using our own Flickr crawl, we were able to effectively game the competition. Our attack represents a new application of de-anonymization to gaming machine learning contests, suggesting changes in how future competitions should be run. We introduce a new simulated annealing-based weighted graph matching algorithm for the seeding step of deanonymization. We also show how to combine de-anonymization with link prediction—the latter is required to achieve good performance on the portion of the test set not de-anonymized— for example by training the predictor on the de-anonymized portion of the test set, and combining probabilisti...
Arvind Narayanan, Elaine Shi, Benjamin I. P. Rubin
Added 19 Aug 2011
Updated 19 Aug 2011
Type Journal
Year 2011
Where CORR
Authors Arvind Narayanan, Elaine Shi, Benjamin I. P. Rubinstein
Comments (0)