Toward fairness in artificial intelligence for medical image analysis: identification and mitigation of potential biases in the roadmap from data collection to model deployment

Open Access

26 April 2023

journal article
Published by SPIE-Intl Soc Optical Eng in Journal of Medical Imaging

Vol. 10 (06), 061104
https://doi.org/10.1117/1.jmi.10.6.061104

Abstract

PurposeTo recognize and address various sources of bias essential for algorithmic fairness and trustworthiness and to contribute to a just and equitable deployment of AI in medical imaging, there is an increasing interest in developing medical imaging-based machine learning methods, also known as medical imaging artificial intelligence (AI), for the detection, diagnosis, prognosis, and risk assessment of disease with the goal of clinical implementation. These tools are intended to help improve traditional human decision-making in medical imaging. However, biases introduced in the steps toward clinical deployment may impede their intended function, potentially exacerbating inequities. Specifically, medical imaging AI can propagate or amplify biases introduced in the many steps from model inception to deployment, resulting in a systematic difference in the treatment of different groups.ApproachOur multi-institutional team included medical physicists, medical imaging artificial intelligence/machine learning (AI/ML) researchers, experts in AI/ML bias, statisticians, physicians, and scientists from regulatory bodies. We identified sources of bias in AI/ML, mitigation strategies for these biases, and developed recommendations for best practices in medical imaging AI/ML development.ResultsFive main steps along the roadmap of medical imaging AI/ML were identified: (1) data collection, (2) data preparation and annotation, (3) model development, (4) model evaluation, and (5) model deployment. Within these steps, or bias categories, we identified 29 sources of potential bias, many of which can impact multiple steps, as well as mitigation strategies.ConclusionsOur findings provide a valuable resource to researchers, clinicians, and the public at large.

Keywords

This publication has 31 references indexed in Scilit:

STARD 2015: An Updated List of Essential Items for Reporting Diagnostic Accuracy Studies
Clinical Chemistry, 2015
Statistical issues in the comparison of quantitative imaging biomarker algorithms using pulmonary nodule volume as an example
Statistical Methods in Medical Research, 2014
Quantitative imaging biomarkers: A review of statistical methods for computer algorithm comparisons
Statistical Methods in Medical Research, 2014
Climatic Associations of British Species Distributions Show Good Transferability in Time but Low Predictive Accuracy for Range Change
PLOS ONE, 2012
Automation bias: a systematic review of frequency, effect mediators, and mitigators
Journal of the American Medical Informatics Association, 2012
Correcting an analysis of variance for clustering
British Journal of Mathematical and Statistical Psychology, 2011
Clinimetrics corner: the many faces of selection bias
Journal of Manual & Manipulative Therapy, 2010
Bias
Journal of Epidemiology and Community Health, 2004
Simultaneous Truth and Performance Level Estimation (STAPLE): An Algorithm for the Validation of Image Segmentation
IEEE Transactions on Medical Imaging, 2004
Data analysis for detection and localization of multiple abnormalities with application to mammography
Academic Radiology, 2000

Cited by 22 articles