Jeremiah Wilson1, Natasha Martin2, Annie Son3, Susan LIttle1, Thomas Martin1, Niamh Mcdonogh4, Ashley Pitcher4 and Orla Doyle4, (1)University of California, San Diego, (2)Medicine, University of California San Diego, (3)Gilead Sciences, (4) Iqvia
Background: Despite new U.S. Centers for Disease Control and U.S. Preventative Services Taskforce guidelines for universal screening of all adults aged 18 and older for hepatitis C virus (HCV), many remain undiagnosed, and even more so due to the COVID-19 pandemic. Machine learning (ML) algorithms are potentially effective at improving the HCV care cascade. We evaluate the potential cost-effectiveness of a ML algorithm to identify undiagnosed HCV patients in care, using data from an algorithm developed based on U.S. ambulatory electronic medical records (EMR).
Methods: The algorithm was trained using 16M patients from U.S. ambulatory EMR data across primary and specialty care from 2015-2020. The algorithm was developed to identify undiagnosed HCV patients in a 12-month prediction window using medical history from a 24-month lookback with a 1-month offset. Algorithmic sensitivity for various levels of Positive Predictive Value (PPV) was assessed on an independent cross section of the data. A HCV natural history Markov model was used to evaluate the cost-effectiveness of the ML algorithm compared to status quo screening used to identify patients over the training data period (risk-based and birth-cohort screening, PPV ~2%). We compared the status quo to scenarios with the machine learning algorithm at different sensitivity levels (5-100%). We identified optimal algorithm sensitivity which maximized health (measured in quality-adjusted life years, QALYs) while staying under a willingness-to-pay threshold of USD$100,000/QALY gained. Based on the algorithm's performance on EMR data, we assumed patients were diagnosed 6.5 months sooner than status quo.
Results: The ML algorithm was cost-effective (ICER<$100k/QALY gained) in identifying undiagnosed HCV patients for sensitivity levels of up to 40% (Figure 1). The optimal sensitivity level of 40% (PPV 0.17%) resulted in incremental costs of $96.90 [95% CI 77-118] and incremental QALYs of 0.0011 [95% CI 0.0008-0.0014], and produced a mean ICER of $92,245/QALY gained.
Conclusion: ML algorithms to identify undiagnosed HCV patients could be cost-effective in the U.S., so evaluating real-world effectiveness is warranted. As algorithms can be tuned to a desired tradeoff between PPV and sensitivity, economic modeling can inform this tradeoff.
