Privacy via the Johnson-Lindenstrauss Transform

Krishnaram Kenthapadi; Aleksandra Korolova; Ilya Mironov; Nina Mishra

doi:10.29012/jpc.v5i1.625

PDF

Published: Aug 1, 2013

DOI: https://doi.org/10.29012/jpc.v5i1.625

Keywords:

differential privacy, Johnson-Lindenstrauss, sketching

Krishnaram Kenthapadi

Microsoft Research, Mountain View, CA

Aleksandra Korolova

Stanford University, Stanford, CA

Ilya Mironov

Microsoft Research, Mountain View, CA

Nina Mishra

Microsoft Research, Mountain View, CA

Abstract

Suppose that party A collects private information about its users, where each user's data is represented as a bit vector. Suppose that party B has a proprietary data mining algorithm that requires estimating the distance between users, such as clustering or nearest neighbors. We ask if it is possible for party A to publish some information about each user so that B can estimate the distance between users without being able to infer any private bit of a user. Our method involves projecting each user's representation into a random, lower-dimensional space via a sparse Johnson-Lindenstrauss transform and then adding Gaussian noise to each entry of the lower-dimensional representation. We show that the method preserves differential privacy---where the more privacy is desired, the larger the variance of the Gaussian noise. Further, we show how to approximate the true distances between users via only the lower-dimensional, perturbed data. Finally, we consider other perturbation methods such as randomized response and draw comparisons to sketch-based methods. While the goal of releasing user-specific data to third parties is more broad than preserving distances, this work shows that distance computations with privacy is an achievable goal.

How to Cite

Kenthapadi, Krishnaram, Aleksandra Korolova, Ilya Mironov, and Nina Mishra. 2013. “Privacy via the Johnson-Lindenstrauss Transform”. Journal of Privacy and Confidentiality 5 (1). https://doi.org/10.29012/jpc.v5i1.625.

Issue

Vol. 5 No. 1 (2013)

Section

Articles

Copyright is retained by the authors. By submitting to this journal, the author(s) license the article under the Creative Commons License – Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), unless choosing a more lenient license (for instance, public domain). For situations not allowed under CC BY-NC-ND, short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source.

Authors of articles published by the journal grant the journal the right to store the articles in its databases for an unlimited period of time and to distribute and reproduce the articles electronically.

Most read articles by the same author(s)

Aleksandra Korolova, Privacy Violations Using Microtargeted Ads: A Case Study , Journal of Privacy and Confidentiality: Vol. 3 No. 1 (2011)
Brendan Avent, Aleksandra Korolova, David Zeber, Torgeir Hovden, Benjamin Livshits, BLENDER: Enabling Local Search with a Hybrid Differential Privacy Model , Journal of Privacy and Confidentiality: Vol. 9 No. 2 (2019): Differential Privacy, including Special Issue on the Theory and Practice of Differential Privacy 2017
Shubha U. Nabar, Nina Mishra, Releasing Private Contingency Tables , Journal of Privacy and Confidentiality: Vol. 2 No. 1 (2010)

Make a Submission

about2

The Journal of Privacy and Confidentiality is an open-access multi-disciplinary journal whose purpose is to facilitate the coalescence of research methodologies and activities in the areas of privacy, confidentiality, and disclosure limitation. The JPC seeks to publish a wide range of research and review papers, not only from academia, but also from government (especially official statistical agencies) and industry, and to serve as a forum for exchange of views, discussion, and news. For more information, see the About the Journal page.