TY - JOUR AU - Balle, Borja AU - Barthe, Gilles AU - Gaboardi, Marco PY - 2020/01/15 Y2 - 2024/03/28 TI - Privacy Profiles and Amplification by Subsampling JF - Journal of Privacy and Confidentiality JA - JPC VL - 10 IS - 1 SE - TPDP 2018 DO - 10.29012/jpc.726 UR - https://journalprivacyconfidentiality.org/index.php/jpc/article/view/726 SP - AB - <p>Differential privacy provides a robust quantifiable methodology to measure and control the privacy leakage of data analysis algorithms.<br>A fundamental insight is that by forcing algorithms to be randomized, their privacy leakage can be characterized by measuring the dissimilarity between output distributions produced by applying the algorithm to pairs datasets differing in one individual.<br>After the introduction of differential privacy, several variants of the original definition have been proposed by changing the measure of dissimilarity between distributions, including concentrated, zero-concentrated and R{\'e}nyi differential privacy.</p><p>The first contribution of this paper is to introduce the notion of privacy profile of a mechanism.<br>This profile captures all valid $(\varepsilon,\delta)$ differential privacy parameters satisfied by a given mechanism, and contrasts with the usual approach of providing guarantees in terms of a single point in this curve.<br>We show that knowledge of this curve is equivalent to knowledge of the privacy guarantees with respect to the alternative definitions listed above.<br>This sheds further light into the connections between different privacy definitions, and suggests that these should be considered alternative but otherwise equivalent points of view.</p><p>The second contribution of this paper is to apply the privacy profiles machinery to study the so-called ``privacy amplification by subsampling'' principle, which ensures that a differentially private mechanism run on a random subsample of a population provides higher privacy guarantees than when run on the entire population.<br>Several instances of this principle have been studied for different random subsampling methods, each with an ad-hoc analysis. In this paper we set out to study this phenomenon in detail with the aim to provide a general method capable of recovering prior analyses in a streamlined fashion.<br>Our method makes extensive use of coupling argument, and introduces a new tool to analyse differential privacy for mixture distributions.</p> ER -