Treffer: Design a model to predict incomplete immunization among Ethiopian children using ensemble machine learning algorithms.
BMC Public Health. 2020 Sep 5;20(1):1362. (PMID: 32891120)
BMC Public Health. 2023 May 2;23(1):802. (PMID: 37131146)
BMC Public Health. 2020 Nov 26;20(1):1803. (PMID: 33243208)
BMC Pediatr. 2022 May 12;22(1):268. (PMID: 35550040)
JMIR Public Health Surveill. 2018 Sep 04;4(3):e63. (PMID: 30181112)
PLOS Glob Public Health. 2023 Aug 16;3(8):e0002088. (PMID: 37585408)
Healthcare (Basel). 2023 Jun 20;11(12):. (PMID: 37372925)
BMC Infect Dis. 2023 Jan 23;23(1):49. (PMID: 36690950)
Glob Pediatr Health. 2020 Oct 30;7:2333794X20968681. (PMID: 33241080)
PLoS One. 2024 Mar 7;19(3):e0299384. (PMID: 38451961)
PLoS One. 2023 Sep 13;18(9):e0291499. (PMID: 37703252)
Lancet. 2024 May 25;403(10441):2307-2316. (PMID: 38705159)
Ethiop J Health Sci. 2021 Jan;31(1):63-72. (PMID: 34158753)
BMC Res Notes. 2019 Apr 29;12(1):241. (PMID: 31036071)
Health Res Policy Syst. 2021 Aug 11;19(Suppl 2):68. (PMID: 34380520)
J Med Internet Res. 2021 Nov 25;23(11):e33231. (PMID: 34751650)
PLoS One. 2022 Oct 20;17(10):e0274501. (PMID: 36264780)
J Infect Dis. 2021 Sep 30;224(12 Suppl 2):S452-S467. (PMID: 34590130)
Hum Vaccin Immunother. 2021 Jan 2;17(1):247-254. (PMID: 32703070)
Front Public Health. 2023 Feb 28;11:1085279. (PMID: 36926180)
Front Public Health. 2024 Apr 12;12:1291495. (PMID: 38716249)
Int J Equity Health. 2020 May 12;19(1):65. (PMID: 32398089)
J Multidiscip Healthc. 2021 Sep 21;14:2657-2667. (PMID: 34584421)
BMJ Open. 2023 Mar 8;13(3):e069278. (PMID: 36889833)
PLoS One. 2021 Dec 7;16(12):e0260258. (PMID: 34874949)
BMC Public Health. 2016 Jan 12;16:27. (PMID: 26757893)
Sci Rep. 2024 May 21;14(1):11529. (PMID: 38773175)
PLoS One. 2023 Oct 18;18(10):e0288867. (PMID: 37851705)
Weitere Informationen
Immunization is a cost-effective public health intervention globally, including in Ethiopia. However, the study focused on children aged 0-59 months and analyzed factors influencing incomplete immunization using ensemble machine learning techniques. A total of 16,394 EDHS datasets were used, with 80% for training and 20% for testing sets. Accordingly, the training set consisted of 13,115 samples, while the testing set contained 3,279 samples. Ensemble learning algorithms were employed, including Bagging methods (Bagging meta-estimator, Random Forest), Boosting methods (Gradient Boosting, XGBoost, LightGBM, AdaBoost, and CatBoost), and Voting ensembles combining both bagging and boosting models. Additionally, Stacking was performed using XGBoost and CatBoost as base models, with other machine learning algorithms such as Random Forest, K-Nearest Neighbors (KNN), Artificial Neural Networks (ANN), Support Vector Machines (SVM), and Logistic Regression as meta-models. All models were implemented using the Python programming language. On the tested data, bagging meta-estimator + XGBoost voting model executed the highest performance result of accuracy (95.94%), f1-score (95.89%), recall (94.81%), and precision (97.07%), for visualizing using a confusion matrix and AUC-ROC value of 96%, and the cross-validation score of 95.75% for its reliability. Also, the most influential factors for incomplete immunization include marital status, residence, and others. This study aims to identify key factors influencing immunization coverage among Ethiopian children under the age of five and improve with ensemble machine learning algorithms. The findings provide valuable insights for targeted interventions, supporting improved immunization practices and contributing to better child health outcomes.
(© 2025. The Author(s).)
Declarations. Competing interest: The authors assert that there are no conflicts of interest. Consent for publication: The publishing consent was not applicable. Ethical consideration: The EDHS data, which is accessible to the public on the DHS website, was utilized from ( http://www.dhsprogram.com ). A consent letter was obtained to download the dataset for this study after the International Review Board of Demographic and Health Surveys (DHS) program data archivists waived informed consent after the consent paper was submitted to the DHS program. We carefully followed all DHS approval requirements. After registering and submitting a valid request, the study’s methodology, which conforms to relevant standards, is available. There are no names of individuals or household addresses in the datasets.