Trustworthy Machine Learning for Equitable Healthcare
Project Number1F30MD020264-01
Contact PI/Project LeaderYAO, MICHAEL STEVEN YU-SHUAN
Awardee OrganizationUNIVERSITY OF PENNSYLVANIA
Description
Abstract Text
Project Summary
Infectious diseases, such as pulmonary tuberculosis and sepsis, are associated with significant patient mor-
bidity and mortality. Because of their public health significance, recent research in artificial intelligence (AI) and
machine learning (ML) have explored how deep learning models may be used as clinical decision support tools
to work alongside clinicians in improving patient care. However, it is well-documented that AI algorithms—even
those approved by the Food and Drug Administration (FDA)—are inaccurate and perform poorly on real-world pa-
tients, especially those from minority backgrounds. As a result, such tools often inadvertently propagate existing
biases due to their poor performance on historically marginalized patient populations. Especially in high-stakes
applications such as healthcare where there is a small margin for error, it is important to train neural networks
that are clinically interpretable, trustworthy, and reliable even for patients from underrepresented backgrounds.
The reasons behind biased model performance are complex, but can be largely distilled into two major causes
of algorithmic bias: (1) black-box predictive models may not be well-calibrated with true clinical decision-making
used by physicians; and (2) minority patients are underrepresented in model training datasets. In this work, I
propose a series of algorithmic and practical innovations to address these two root causes of algorithmic
bias in the clinical diagnosis and management of pulmonary tuberculosis and sepsis. I will accomplish this
task by increasing the accessibility and reliability of AI models for traditionally marginalized patient populations.
Firstly, in order to better align predictive models with clinical reasoning for the diagnosis of pulmonary tuberculosis,
I will show how publicly available vision-language foundational models can be used to improve the diagnostic
accuracy of clinicians in resource-limited settings (Aim 1). Secondly, I will propose and validate a novel
computational algorithm that leverages historical aggregate patient data to better inform the clinical care of
minority patients diagnosed with sepsis (Aim 2). These experiments will collectively demonstrate how AI al-
gorithms can be better leveraged for various applications spanning multiple domains of patient care. In providing
solutions for these clinically relevant problems, I will further develop the technical skills and scientific reasoning
needed as a future radiologist and academic researcher in machine learning. I look forward to leveraging both
my graduate education and clinical training together to ultimately become a well-rounded physician-scientist and
independent investigator.
Public Health Relevance Statement
Project Narrative
Despite recent progress in artificial intelligence (AI) and machine learning (ML), deep learning models have been
shown to perform inaccurately on patients from minority racial and ethnic backgrounds. This limitation adversely
impacts underrepresented patients through propagating existing biases and inequities in healthcare. The pro-
posed research identifies and validates computational strategies to address this issue in the diagnosis and treat-
ment of pulmonary tuberculosis, sepsis, and other diseases of important significance to public health.
National Institute on Minority Health and Health Disparities
CFDA Code
307
DUNS Number
042250712
UEI
GM1XX56LEP58
Project Start Date
17-September-2024
Project End Date
31-August-2027
Budget Start Date
17-September-2024
Budget End Date
31-August-2025
Project Funding Information for 2024
Total Funding
$53,974
Direct Costs
$53,974
Indirect Costs
Year
Funding IC
FY Total Cost by IC
2024
National Institute on Minority Health and Health Disparities
$53,974
Year
Funding IC
FY Total Cost by IC
Sub Projects
No Sub Projects information available for 1F30MD020264-01
Publications
Publications are associated with projects, but cannot be identified with any particular year of the project or fiscal year of funding. This is due to the continuous and cumulative nature of knowledge generation across the life of a project and the sometimes long and variable publishing timeline. Similarly, for multi-component projects, publications are associated with the parent core project and not with individual sub-projects.
No Publications available for 1F30MD020264-01
Patents
No Patents information available for 1F30MD020264-01
Outcomes
The Project Outcomes shown here are displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed are those of the PI and do not necessarily reflect the views of the National Institutes of Health. NIH has not endorsed the content below.
No Outcomes available for 1F30MD020264-01
Clinical Studies
No Clinical Studies information available for 1F30MD020264-01
News and More
Related News Releases
No news release information available for 1F30MD020264-01
History
No Historical information available for 1F30MD020264-01
Similar Projects
No Similar Projects information available for 1F30MD020264-01