Differentially private posterior summaries for linear regression coefficients

Gilad Amitai; Jerome Reiter

doi:10.29012/jpc.683

PDF

Published: Dec 12, 2018

DOI: https://doi.org/10.29012/jpc.683

Keywords:

differential privacy, Bayesian inference, posterior quantile, posterior probability

Gilad Amitai

Duke University

Jerome Reiter

Duke University

Abstract

In Bayesian regression modeling, often analysts summarize inferences using posterior probabilities and quantiles, such as the posterior probability that a coefficient exceeds zero or the posterior median of that coefficient. However, with potentially unbounded outcomes and explanatory variables, regression inferences based on typical prior distributions can be sensitive to values of individual data points. Thus, releasing posterior summaries of regression coefficients can result in disclosure risks. In this article, we propose some differentially private algorithms for reporting posterior probabilities and posterior quantiles of linear regression coefficients. The algorithms use the general strategy of subsample and aggregate, a technique that requires randomly partitioning the data into disjoint subsets, estimating the regression within each subset, and combining results in ways that satisfy differential privacy. We illustrate the performance of some of the algorithms using repeated sampling studies. The non-private versions also can be used for Bayesian inference with big data in non-private settings.

How to Cite

Amitai, Gilad, and Jerome Reiter. 2018. “Differentially Private Posterior Summaries for Linear Regression Coefficients”. Journal of Privacy and Confidentiality 8 (1). https://doi.org/10.29012/jpc.683.

Issue

Vol. 8 No. 1 (2018): Commemorating Stephen Fienberg

Section

Articles

Copyright is retained by the authors. By submitting to this journal, the author(s) license the article under the Creative Commons License – Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), unless choosing a more lenient license (for instance, public domain). For situations not allowed under CC BY-NC-ND, short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source.

Authors of articles published by the journal grant the journal the right to store the articles in its databases for an unlimited period of time and to distribute and reproduce the articles electronically.

Funding data

National Science Foundation
Grant numbers SES 1131897;ACI 1443014

Most read articles by the same author(s)

Michelle Nixon, Andres Barrientos, Jerome Reiter, Aleksandra Slavkovic, A Latent Class Modeling Approach for Generating Synthetic Data and Making Posterior Inferences from Differentially Private Counts , Journal of Privacy and Confidentiality: Vol. 12 No. 1 (2022): Regular issue, including an article based on a presentation at TPDP 2020
John M. Abowd, Cynthia Dwork, Alan F. Karr, Kobbi Nissim, Jerome Reiter, Aleksandra Slavković, Lars Vilhuber, Launching the Society for Privacy and Confidentiality Research to Own the Journal of Privacy and Confidentiality , Journal of Privacy and Confidentiality: Vol. 14 No. 3 (2024): Regular issue, including articles from the Noisy Measurements Workshop 2022

Make a Submission

about2

The Journal of Privacy and Confidentiality is an open-access multi-disciplinary journal whose purpose is to facilitate the coalescence of research methodologies and activities in the areas of privacy, confidentiality, and disclosure limitation. The JPC seeks to publish a wide range of research and review papers, not only from academia, but also from government (especially official statistical agencies) and industry, and to serve as a forum for exchange of views, discussion, and news. For more information, see the About the Journal page.