Efficacy of machine learning models for the prediction of death occurrence and counts associated with foodborne illnesses and hospitalizations in the United States

Foodborne outbreak data released through national surveillance systems provides essential information about the results of investigations. This study evaluates the efficacy of machine learning (ML) models for the prediction of death occurrence and counts associated with foodborne illnesses and hospitalizations in the United States. Confirmed foodborne outbreaks were obtained from the Centers for Disease Control and Prevention's National Outbreak Reporting System (NORS). Foodborne pathogens causing at least 10 deaths in total were selected for analysis. The binary classification performance (accuracy, %) and prediction efficacy of ML models (mean absolute errors, MAE) were used for evaluation. A total of 10,069 foodborne outbreaks with confirmed single etiology resulted in 275,827 illnesses, 18,579 hospitalizations, and 458 deaths. Salmonella was the leading causative agent (54.23 %) of bacterial foodborne outbreaks, followed by pathogenic Escherichia coli (12.13 %). Norovirus (96.69 %) and Cyclospora cayetanensis (60.76 %) represented major causes of viral and protozoan/parasite foodborne outbreaks, respectively. The classification performance of ML models ranged from 88.9 to 94.5 % for the overall prediction of death occurrence associated with foodborne illnesses and hospitalizations. Prediction efficacy of ML models for death counts remained <0.9 with MAE, except for Listeria monocytogenes with an average MAE of 134.1 ± 11.1. This study indicates the potential use and performance of ML algorithms for the prediction of death occurrence or counts caused by foodborne etiological agents to improve public health safety based on the numbers of illnesses and hospitalizations. © 2025 Elsevier B.V., All rights reserved.

Anahtar Kelimeler

Bacteria, Classification, Data Mining, Protozoan/parasite, Public Health Informatics, Regression, Virus, Algorithm, Article, Bacterium, Binary Classification, Classification, Cyclospora Cayetanensis, Cyclosporiasis, Data Mining, Death, Diagnostic Accuracy, Disease Association, Disease Surveillance, Efficacy Parameters, Epidemic, Escherichia Coli Infection, Food Poisoning, Food Safety, Foodborne Pathogen, Hospitalization, Human, Listeria Monocytogenes, Listeriosis, Machine Learning, Mean Absolute Error, Nonhuman, Norovirus, Norovirus Infection, Parasite, Pathogenic Escherichia Coli, Prediction, Protozoal Infection, Protozoon, Public Health Service, Salmonella, Salmonella Food Poisoning, United States, Virus, Virus Infection

Kaynak

Microbial Risk Analysis

Scopus Q Değeri

Q2

Cilt

30

Bağlantı

https://doi.org/10.1016/j.mran.2025.100351
https://hdl.handle.net/20.500.12639/7346

Koleksiyon

Scopus İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Efficacy of machine learning models for the prediction of death occurrence and counts associated with foodborne illnesses and hospitalizations in the United States

Dosyalar

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon

Onay

İnceleme

Ekleyen

Referans Veren