AUTHOR=Rogers Paul , McCall Thomas , Zhang Ying , Reese Jessica , Wang Dong , Tong Weida 

TITLE=Leveraging AI to improve disease screening among American Indians: insights from the Strong Heart Study

JOURNAL=Experimental Biology and Medicine

VOLUME=Volume 249 - 2024

YEAR=2025

URL=https://www.ebm-journal.org/journals/experimental-biology-and-medicine/articles/10.3389/ebm.2024.10341

DOI=10.3389/ebm.2024.10341

ISSN=1535-3699

ABSTRACT=<p>Screening tests for disease have their performance measured through sensitivity and specificity, which inform how well the test can discriminate between those with and without the condition. Typically, high values for sensitivity and specificity are desired. These two measures of performance are unaffected by the outcome prevalence of the disease in the population. Research projects into the health of the American Indian frequently develop Machine learning algorithms as predictors of conditions in this population. In essence, these models serve as <italic>in silico</italic> screening tests for disease. A screening test’s sensitivity and specificity values, typically determined during the development of the test, inform on the performance at the population level and are not affected by the prevalence of disease. A screening test’s positive predictive value (PPV) is susceptible to the prevalence of the outcome. As the number of artificial intelligence and machine learning models flourish to predict disease outcomes, it is crucial to understand if the PPV values for these <italic>in silico</italic> methods suffer as traditional screening tests in a low prevalence outcome environment. The Strong Heart Study (SHS) is an epidemiological study of the American Indian and has been utilized in predictive models for health outcomes. We used data from the SHS focusing on the samples taken during Phases V and VI. Logistic Regression, Artificial Neural Network, and Random Forest were utilized as <italic>in silico</italic> screening tests within the SHS group. Their sensitivity, specificity, and PPV performance were assessed with health outcomes of varying prevalence within the SHS subjects. Although sensitivity and specificity remained high in these <italic>in silico</italic> screening tests, the PPVs’ values declined as the outcome’s prevalence became rare. Machine learning models used as <italic>in silico</italic> screening tests are subject to the same drawbacks as traditional screening tests when the outcome to be predicted is of low prevalence.</p>