Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms
- 1 July 2017
- journal article
- review article
- Published by Ovid Technologies (Wolters Kluwer Health) in Journal of Public Health Management and Practice
- Vol. 23 (4), e5-e13
- https://doi.org/10.1097/phh.0000000000000388
Abstract
Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe available open health data, identify whether data are presented in a way that is aligned with best practices and usable for researchers, and examine differences across platforms. Two reviewers systematically reviewed a random sample of data offerings on NYC OpenData (New York City, all offerings, n = 37), Health Data NY (New York State, 25% sample, n = 71), and HealthData.gov (US Department of Health and Human Services, 5% sample, n = 75), using a standard coding guide. Three open health data platforms at the federal, New York State, and New York City levels. Data characteristics from the coding guide were aggregated into summary indices for intrinsic data quality, contextual data quality, adherence to the Dublin Core metadata standards, and the 5-star open data deployment scheme. One quarter of the offerings were structured datasets; other presentation styles included charts (14.7%), documents describing data (12.0%), maps (10.9%), and query tools (7.7%). Health Data NY had higher intrinsic data quality (P < .001), contextual data quality (P < .001), and Dublin Core metadata standards adherence (P < .001). All met basic “web availability” open data standards; fewer met higher standards of “hyperlinked to other data.” Although all platforms need improvement, they already provide readily available data for health research. Sustained effort on improving open data websites and metadata is necessary for ensuring researchers use these data, thereby increasing their research value.Keywords
This publication has 25 references indexed in Scilit:
- Opening Health DataJournal of Public Health Management and Practice, 2015
- Liberating Data to Transform Health CareJAMA, 2014
- Creating Open Government Ecosystems: A Research and Development AgendaFuture Internet, 2012
- Benefits, Adoption Barriers and Myths of Open Data and Open GovernmentInformation Systems Management, 2012
- Open government and e-government: Democratic challenges from a public value perspectiveInformation Polity, 2012
- Fifteen Years of Data and Information Quality Literature: Developing a Research Agenda for AccountingJournal of Information Systems, 2011
- Advancing the Framework: Use of Health Data--A Report of a Working Conference of the American Medical Informatics AssociationJournal of the American Medical Informatics Association, 2008
- Toward a National Framework for the Secondary Use of Health Data: An American Medical Informatics Association White PaperJournal of the American Medical Informatics Association, 2007
- Designing electronic government information access programs: a holistic approachGovernment Information Quarterly, 2004
- Enhancing data quality in data warehouse environmentsCommunications of the ACM, 1999