Completeness of Race and Ethnicity Reporting in Person-Level COVID-19 Surveillance Data, 50 States, April 2020–December 2021

Abstract
Objectives: Black, Indigenous, and People of Color have borne a disproportionate incidence of COVID-19 cases in the United States. However, few studies have documented the completeness of race and ethnicity reporting in national COVID-19 surveillance data. The objective of this study was to describe the completeness of race and ethnicity ascertainment in person-level data received by the Centers for Disease Control and Prevention (CDC) through national COVID-19 case surveillance. Methods: We compared COVID-19 cases with "complete" (ie, per Office of Management and Budget 1997 revised criteria) data on race and ethnicity from CDC person-level surveillance data with CDC-reported aggregate counts of COVID-19 from April 5, 2020, through December 1, 2021, in aggregate and by state. Results: National person-level COVID-19 case surveillance data received by CDC during the study period included 18 881 379 COVID-19 cases with complete ascertainment of race and ethnicity, representing 39.4% of all cases reported to CDC in aggregate (N = 47 898 497). Five states (Georgia, Hawaii, Nebraska, New Jersey, and West Virginia) did not report any COVID-19 person-level cases with multiple racial identities to CDC. Conclusion: Our findings highlight a high degree of missing data on race and ethnicity in national COVID-19 case surveillance, enhancing our understanding of current challenges in using these data to understand the impact of COVID-19 on Black, Indigenous, and People of Color. Streamlining surveillance processes to decrease reporting incidence and align reporting requirements with an Office of Management and Budget-compliant collection of data on race and ethnicity would improve the completeness of data on race and ethnicity for national COVID-19 case surveillance.
Funding Information
  • Indian Health Service (U1B1IHS0006-21-00)