Comparing regression models with count data to artificial neural network and ensemble models for prediction of generic escherichia coli population in agricultural ponds based on weather station measurements Measurements

dc.contributor.authorBuyrukoğlu, Gonca
dc.contributor.authorBuyrukoğlu, Selim
dc.contributor.authorTopalcengiz, Zeynal
dc.date.accessioned2021-04-21T11:05:21Z
dc.date.available2021-04-21T11:05:21Z
dc.date.issued2021en_US
dc.departmentFakülteler, Mühendislik-Mimarlık Fakültesi, Gıda Mühendisliği Bölümüen_US
dc.description.abstractAbstract View references (58) Indicator microorganisms are monitored in agricultural waters to foster produce safety. Various prediction models are used to estimate the population of indicator microorganisms and pathogens when no observation is available. The purpose of this study was to compare the performance of regression models with count data (zero-inflated Poisson and hurdle negative binomial) to artificial neural network and ensemble models (random forest and AdaBoost) for the prediction of generic Escherichia coli population in agricultural surface waters in relation with weather station measurements. Two-part count data models were built on E. coli population count frequencies (0, [1,10), [10,100), [100,1000), [1000, 10000), (>=10000)) based on the data structure. The use of artificial neural network, AdaBoost, and random forest were determined based on the mean absolute error (MAE) value over pre-tested six models. The MAE was also used to compare the performance of two-part count data models with artificial neural network and ensemble models. Over-dispersed E. coli population count frequencies was calculated between 2.2 and 52.2% for all ponds. Observed and predicted zero E. coli population counts for all ponds were matched from 82 to 100% for zero-inflated Poisson and 100% for hurdle negative binomial regression models. Overdispersion reduced the performance of tested models. AdaBoost-Twelve Estimators had the best performance with the lowest MAE values for all ponds (from 0.87 to 46.60). The ensemble models used in this study provided more promising performance when compared to tested regression models with count data. © 2021en_US
dc.identifier.doi10.1016/j.mran.2021.100171
dc.identifier.issn23523522
dc.identifier.orcid0000-0002-2113-7319
dc.identifier.scopus2-s2.0-85103991376
dc.identifier.scopusqualityQ2
dc.identifier.urihttps://doi.org/10.1016/j.mran.2021.100171
dc.identifier.urihttps://hdl.handle.net/20.500.12639/2745
dc.identifier.wosWOS:000834988400003
dc.identifier.wosqualityQ4
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.institutionauthorTopalcengiz, Zeynal
dc.language.isoen
dc.publisherElsevier B.V.en_US
dc.relation.ispartofMicrobial Risk Analysisen_US
dc.relation.publicationcategoryMakale - Ulusal Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectAdaBoosten_US
dc.subjectHurdle modelen_US
dc.subjectNegative binomial modelen_US
dc.subjectProduce safetyen_US
dc.subjectRandom foresten_US
dc.subjectZero inflated Poisson modelen_US
dc.titleComparing regression models with count data to artificial neural network and ensemble models for prediction of generic escherichia coli population in agricultural ponds based on weather station measurements Measurementsen_US
dc.typeArticle

Dosyalar

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: