Considerations for using predictive models that include race as an input variable: The case study of lung cancer screening

J Biomed Inform. 2023 Nov:147:104525. doi: 10.1016/j.jbi.2023.104525. Epub 2023 Oct 14.

Abstract

Indiscriminate use of predictive models incorporating race can reinforce biases present in source data and lead to an exacerbation of health disparities. In some countries, such as the United States, there is therefore a push to remove race from prediction models; however, there are still many prediction models that use race as an input. Biomedical informaticists who are given the responsibility of using these predictive models in healthcare environments are likely to be faced with questions like how to deal with race covariates in these models. Thus, there is a need for a pragmatic framework to help model users think through how to include race in their chosen model so as to avoid inadvertently exacerbating disparities. In this paper, we use the case study of lung cancer screening to propose a simple framework to guide how model users can approach the use (or non-use) of race inputs in the predictive models they are tasked with leveraging in electronic health records and clinical workflows.

Keywords: Decision framework; Health disparities; Prediction models; Race.

Publication types

  • Research Support, U.S. Gov't, P.H.S.
  • Research Support, N.I.H., Extramural

MeSH terms

  • Early Detection of Cancer*
  • Electronic Health Records
  • Humans
  • Lung Neoplasms* / diagnosis
  • United States