Learning in a Large Function Space: Privacy-Preserving Mechanisms for SVM Learning

Benjamin I. P. Rubinstein; Peter L. Bartlett; Ling Huang; Nina Taft

doi:10.29012/jpc.v4i1.612

PDF

Published: Jul 20, 2012

DOI: https://doi.org/10.29012/jpc.v4i1.612

Keywords:

Support Vector Machines, Privacy-Preserving Learning, Lower Bounds

Benjamin I. P. Rubinstein

Microsoft Research, Mountain View, CA

https://orcid.org/0000-0002-2947-6980

Peter L. Bartlett

Division of Computer Science and Department of Statistics, University of California, Berkeley

Ling Huang

Intel Labs, Berkeley, CA

Nina Taft

Technicolor, Palo Alto, CA

Abstract

The ubiquitous need for analyzing privacy-sensitive information—including health records, personal communications, product ratings and social network data—is driving significant interest in privacy-preserving data analysis across several research communities. This paper explores the release of Support Vector Machine (SVM) classifiers while preserving the privacy of training data. The SVM is a popular machine learning method that maps data to a high-dimensional feature space before learning a linear decision boundary. We present efficient mechanisms for finite-dimensional feature mappings and for (potentially infinite-dimensional) mappings with translation-invariant kernels. In the latter case, our mechanism borrows a technique from large-scale learning to learn in a finite-dimensional feature space whose inner-product uniformly approximates the desired feature space inner-product (the desired kernel) with high probability. Differential privacy is established using algorithmic stability, a property used in learning theory to bound generalization error. Utility—when the private classifier is pointwise close to the non-private classifier with high probability—is proven using smoothness of regularized empirical risk minimization with respect to small perturbations to the feature mapping. Finally we conclude with lower bounds on the differential privacy of any mechanism approximating the SVM.

How to Cite

Rubinstein, Benjamin I. P., Peter L. Bartlett, Ling Huang, and Nina Taft. 2012. “Learning in a Large Function Space: Privacy-Preserving Mechanisms for SVM Learning”. Journal of Privacy and Confidentiality 4 (1). https://doi.org/10.29012/jpc.v4i1.612.

Issue

Vol. 4 No. 1 (2012)

Section

Articles

Copyright is retained by the authors. By submitting to this journal, the author(s) license the article under the Creative Commons License – Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), unless choosing a more lenient license (for instance, public domain). For situations not allowed under CC BY-NC-ND, short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source.

Authors of articles published by the journal grant the journal the right to store the articles in its databases for an unlimited period of time and to distribute and reproduce the articles electronically.

Article Sidebar

Main Article Content

Abstract

Article Details

Similar Articles