ADVERTISEMENT

If you are seeing this message, you may be experiencing temporary network problems. Please wait a few minutes and refresh the page. If the problem persists, you may wish to report it to your local Network Manager.

It is also possible that your web browser is not configured or not able to display style sheets. In this case, although the visual presentation will be degraded, the site should continue to be functional. We recommend using the latest version of Microsoft or Mozilla web browser to help minimise these problems.

Wiley InterScience

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Journal of the Royal Statistical Society: Series B (Statistical Methodology)

Volume 64 Issue 4, Pages 855 - 867

Published Online: 23 Oct 2002

© 2010 The Royal Statistical Society and Blackwell Publishing Ltd



< Previous Abstract  |  Next Abstract >

Save Article to My Profile      Download Citation      Request Permissions

Abstract |  References  |  Full Text: HTML, PDF (Size: 125K)  | Related Articles | Citation Tracking

A measure of disclosure risk for microdata
C. J. Skinnerand M. J. Elliot
 University of Southampton, UK  University of Manchester, UK
Address for correspondence : C. J. Skinner, Department of Social Statistics, University of Southampton, Highfield, Southampton, SO17 1BJ, UK.
E-mail: cjs@soton.ac.uk
Copyright 2002 Royal Statistical Society
KEYWORDS
Confidentiality protection • Finite population inference • Sample survey data • Statistical disclosure control

ABSTRACT

Summary. Protection against disclosure is important for statistical agencies releasing microdata files from sample surveys. Simple measures of disclosure risk can provide useful evidence to support decisions about release. We propose a new measure of disclosure risk: the probability that a unique match between a microdata record and a population unit is correct. We argue that this measure has at least two advantages. First, we suggest that it may be a more realistic measure of risk than two measures that are currently used with census data. Second, we show that consistent inference (in a specified sense) may be made about this measure from sample data without strong modelling assumptions. This is a surprising finding, in its contrast with the properties of the two 'similar' established measures. As a result, this measure has potentially useful applications to sample surveys. In addition to obtaining a simple consistent predictor of the measure, we propose a simple variance estimator and show that it is consistent. We also consider the extension of inference to allow for certain complex sampling schemes. We present a numerical study based on 1991 census data for about 450 000 enumerated individuals in one area of Great Britain. We show that the theoretical results on the properties of the point predictor of the measure of risk and its variance estimator hold to a good approximation for these data.


[Received July 2001. Revised June 2002]

DIGITAL OBJECT IDENTIFIER (DOI)
10.1111/1467-9868.00365 About DOI

Related Articles

  • Find other articles like this in Wiley InterScience
  • Find articles in Wiley InterScience written by any of the authors

Wiley InterScience is a member of CrossRef.

Cross Ref Member


Also of Interest

Statistics

Wiley-Blackwell is the largest publisher of society-based statistics journals and No. 1 in terms of quality and international scope.

Wiley-Blackwell publishes 19 statistics journals and is now the top publisher of Thomson Reuters ranked statistics journals.

Discover more about the statistics portfolio

Hot Papers
RSS

Journal of the Royal Statistical Society

See the Papers attracting early citation:

Series A: Statistics in Society
A re-evaluation of random-effects meta-analysis

Series B: Statistical Methodology
Testing for lack of fit in inverse regression—with applications to biophotonic imaging

Series C: Applied Statistics
A multifaceted sensitivity analysis of the Slovenian public opinion survey data

Announcing
SIGN

Significance

2010 Crystal Ball Competition

Try to forecast the results of 10 different events, some sporting, some cultural, some just odd, that will take place between May and July 2010.
Cash prizes and books for winners.

Take part

Check out the rules

Have Fun!