Bayesian Estimation of Disclosure Risks for Multiply Imputed, Synthetic Data

Jerome P. Reiter; Quanli Wang; Biyuan Zhang

doi:10.29012/jpc.v6i1.635

PDF

Published: Jun 1, 2014

DOI: https://doi.org/10.29012/jpc.v6i1.635

Keywords:

Bayesian, confidentiality, imputation, risk, synthetic

Jerome P. Reiter

Department of Statistical Science, Box 90251, Duke University, Durham, NC

https://orcid.org/0000-0002-8374-3832

Quanli Wang

Department of Statistical Science, Box 90251, Duke University, Durham, NC

Biyuan Zhang

Department of Economics, Duke University, Durham, NC

Abstract

Agencies seeking to disseminate public use microdata, i.e., data on individual records, can replace confidential values with multiple draws from statistical models estimated with the collected data. We present a famework for evaluating disclosure risks inherent in releasing multiply-imputed, synthetic data. The basic idea is to mimic an intruder who computes posterior distributions of confidential values given the released synthetic data and prior knowledge. We illustrate the methodology with artificial fully synthetic data and with partial synthesis of the Survey of Youth in Custody.

How to Cite

Reiter, Jerome P., Quanli Wang, and Biyuan Zhang. 2014. “Bayesian Estimation of Disclosure Risks for Multiply Imputed, Synthetic Data”. Journal of Privacy and Confidentiality 6 (1). https://doi.org/10.29012/jpc.v6i1.635.

Issue

Vol. 6 No. 1 (2014)

Section

Articles

Copyright is retained by the authors. By submitting to this journal, the author(s) license the article under the Creative Commons License – Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0), unless choosing a more lenient license (for instance, public domain). For situations not allowed under CC BY-NC-ND, short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including © notice, is given to the source.

Authors of articles published by the journal grant the journal the right to store the articles in its databases for an unlimited period of time and to distribute and reproduce the articles electronically.

Funding data

National Science Foundation
Grant numbers CNS-10-12141;SES-11-31897

Article Sidebar

Main Article Content

Abstract

Article Details

Funding data

Most read articles by the same author(s)