Evaluating Bias and Noise Induced by the U.S. Census Bureau's Privacy Protection Methods

التفاصيل البيبلوغرافية
العنوان: Evaluating Bias and Noise Induced by the U.S. Census Bureau's Privacy Protection Methods
المؤلفون: Kenny, Christopher T., McCartan, Cory, Kuriwaki, Shiro, Simko, Tyler, Imai, Kosuke
المصدر: Science advances, 10(18) (2024) eadl2524
سنة النشر: 2023
المجموعة: Computer Science
Statistics
مصطلحات موضوعية: Computer Science - Computers and Society, Statistics - Applications
الوصف: The United States Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information. We conduct the first independent evaluation of bias and noise induced by the Bureau's two main disclosure avoidance systems: the TopDown algorithm employed for the 2020 Census and the swapping algorithm implemented for the three previous Censuses. Our evaluation leverages the Noisy Measure File (NMF) as well as two independent runs of the TopDown algorithm applied to the 2010 decennial Census. We find that the NMF contains too much noise to be directly useful, especially for Hispanic and multiracial populations. TopDown's post-processing dramatically reduces the NMF noise and produces data whose accuracy is similar to that of swapping. While the estimated errors for both TopDown and swapping algorithms are generally no greater than other sources of Census error, they can be relatively substantial for geographies with small total populations.
Comment: 25 pages, 6 figures, 2 tables, plus appendices
نوع الوثيقة: Working Paper
DOI: 10.1126/sciadv.adl2524
URL الوصول: http://arxiv.org/abs/2306.07521
رقم الأكسشن: edsarx.2306.07521
قاعدة البيانات: arXiv