Detection of acute dengue virus infection, with and without concurrent malaria infection, in a cohort of febrile children in Kenya, 2014-2019, by clinicians or machine learning algorithms

PLOS Glob Public Health. 2023 Jul 26;3(7):e0001950. doi: 10.1371/journal.pgph.0001950. eCollection 2023.

Abstract

Poor access to diagnostic testing in resource limited settings restricts surveillance for emerging infections, such as dengue virus (DENV), to clinician suspicion, based on history and exam observations alone. We investigated the ability of machine learning to detect DENV based solely on data available at the clinic visit. We extracted symptom and physical exam data from 6,208 pediatric febrile illness visits to Kenyan public health clinics from 2014-2019 and created a dataset with 113 clinical features. Malaria testing was available at the clinic site. DENV testing was performed afterwards. We randomly sampled 70% of the dataset to develop DENV and malaria prediction models using boosted logistic regression, decision trees and random forests, support vector machines, naïve Bayes, and neural networks with 10-fold cross validation, tuned to maximize accuracy. 30% of the dataset was reserved to validate the models. 485 subjects (7.8%) had DENV, and 3,145 subjects (50.7%) had malaria. 220 (3.5%) subjects had co-infection with both DENV and malaria. In the validation dataset, clinician accuracy for diagnosis of malaria was high (82% accuracy, 85% sensitivity, 80% specificity). Accuracy of the models for predicting malaria diagnosis ranged from 53-69% (35-94% sensitivity, 11-80% specificity). In contrast, clinicians detected only 21 of 145 cases of DENV (80% accuracy, 14% sensitivity, 85% specificity). Of the six models, only logistic regression identified any DENV case (8 cases, 91% accuracy, 5.5% sensitivity, 98% specificity). Without diagnostic testing, interpretation of clinical findings by humans or machines cannot detect DENV at 8% prevalence. Access to point-of-care diagnostic tests must be prioritized to address global inequities in emerging infections surveillance.

Grants and funding

This work was supported by the National Institute of Allergy and Infectious Diseases at the National Institutes of Health [NIH K23 AI127909] to D.M.V. and [NIH R01 AI102918] to A.D.L.; the Stanford Maternal & Child Health Research Institute, Lucille Packard Foundation for Children’s Health, and Spectrum (the Stanford Center for Clinical & Translational Research & Education) [KL2 TR 001083, UL1 TR 001085] to D.M.V. The funders of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report, or in the decision to submit the paper for publication.