Optimal randomization for privacy preserving data mining

14 years 6 months ago

Download www.cs.uiuc.edu

Randomization is an economical and eﬃcient approach for privacy preserving data mining (PPDM). In order to guarantee the performance of data mining and the protection of individual privacy, optimal randomization schemes need to be employed. This paper demonstrates the construction of optimal randomization schemes for privacy preserving density estimation. We propose a general framework for randomization using mixture models. The impact of randomization on data mining is quantiﬁed by performance degradation and mutual information loss, while privacy and privacy loss are quantiﬁed by interval-based metrics. Two diﬀerent types of problems are deﬁned to identify optimal randomization for PPDM. Illustrative examples and simulation results are reported. Categories and Subject Descriptors H.2.8 [Database Management]: Database Applications— Data mining; H.2.0 [Database Management]: General— Security, integrity, and protection General Terms Theory, Algorithms, Security Keywords m...

Michael Yu Zhu, Lei Liu

Real-time Traffic

Data Mining | KDD 2004 | Optimal Randomization | Optimal Randomization Schemes | Randomization |

claim paper

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	KDD
Authors	Michael Yu Zhu, Lei Liu

Comments (0)

Sciweavers

Optimal randomization for privacy preserving data mining

Data Mining | KDD 2004 | Optimal Randomization | Optimal Randomization Schemes | Randomization |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers