An Ensemble Model of Machine Learning Algorithms for the Severity of Sickle Cell Disease (Scd) Among Paediatrics Patients

  • Balogun Jeremiah Ademola Obafemi Awolowo University
  • Aderounmu Temilade Obafemi Awolowo University
  • Egejuru Ngozi Chidozie
  • Idowu Peter Adebayo Obafemi Awolowo University
Keywords: Sickle Cell Disease (SCD), Disease severity, Stack-Ensemble Model, Naïve Bayes, Decision Trees, Multi-Layer Perceptron


This study was motivated at developing an ensemble of 3 supervised machine learning algorithms for the assessment of the severity of sickle cell disease (SCD) among paediatric patients. The study collected data from a tertiary hospital in south-western Nigeria following the identification of variables required for assessing the severity of SCD. The study also adopted the use of 3 supervised machine learning algorithms namely: naïve Bayes (NB), C4.5 decision trees (DT) and support vector machines (SVM) for creating the ensemble model using a 10-fold cross validation technique. The models were created by adopting the algorithms in isolation and in combination of 2 and 3 which were compared. The developed models were evaluated in order to present the model with the best performance. The results of the study showed that using an ensemble of DT and NB alone provided the best performance. The study has implications in presenting a model for improving the assessment of the severity of SCD among paediatric patients in Nigeria.

Author Biographies

Balogun Jeremiah Ademola, Obafemi Awolowo University

Department of Computer Science and Engineering,

Obafemi Awolowo University, Ile-Ife, Nigeria

Aderounmu Temilade, Obafemi Awolowo University

Department of Paediatrics and Child Health,

Obafemi Awolowo University Teaching Hospital Complex (OAUTHC),

Ile-Ife, Nigeria.

Egejuru Ngozi Chidozie

Department of Computer Science and Engineering, Obafemi Awolowo University, Ile-Ife, Nigeria.

Idowu Peter Adebayo, Obafemi Awolowo University

Department of Computer Science and Engineering,

Obafemi Awolowo University, Ile-Ife, Nigeria.


Agasa, B., Bosunga, K., Opara, A., Tshilumba, K., Dupont, E., & Vertongen, F. (2010). Prevalence of sickle cell disease in a northeastern region of the Democratic Republic of Congo: What impact on transfusion policy? Transfusion Medicine 20(1): 62 – 65.

Aliyu, Z. Y., Kato, G. J., Taylor, Jt., Babadoko, A., Mamman, A. I. & Gordeuk, V. R. (2008). Sickle cell disease and pulmonary hypertension in Africa: A global perspective and review of epidemiology, pathophysiology, and management. American Journal of Hematology 83(1): 63–70.

Aygun, B. & Odame, I. (2012). A global perspective on sickle cell disease. Pediatric Blood & Cancer 59(2): 386 – 390.

Chakravorty, S. & Williams, T. N. (2015). Sickle cell disease: A neglected chronic disease of increasing global health importance. Archives of Disease in Childhood 100(1): 48 – 53.

Goyal, A. & Kaur, R. (2016). A Survey on Ensemble Model for Loan Prediction. International Journal of Advanced research and Innovative Ideas in Education (IJARIIE), 2(1): 623 – 628.

Ikefuna, A. N. & Emodi, I. J. (2007). Hospital admission of patients with sickle cell anaemia pattern and outcome in Enugu area of Nigeria. Nigerian Journal of Clinical Practice 10(1): 24–29.

Jaing, Y., Qiu, B., Xu, C. & Li, C. (2017). The Research of Clinical Decision Support System Based on Three-Layer Knowledge base Model. Journal of Healthcare Engineering, 7: 12 – 32.

Joshi, N. & Srivastava, S. (2014). Improving Classification Accuracy using Ensemble Learning Technique (using different Decision Trees). International Journal of Computer Science and Mobile Computing, 3(5): 727 – 732.

King, M.A. (2015). Ensemble learning techniques for Structured and Unstructured Data. Unpublished PhD Thesis of the Department of Business Information Technology.

Makani, J., Cox, S. E., Soka, D., Komba, A. N., Oruo, J. & Mwamtemi, H. (2011). Mortality in sickle cell anemia in Africa: A prospective cohort study in Tanzania. PLoS ONE 6(2): 1 – 12.

Milton, J.N., Gordeuk, V.R., Taylor, J.G., Gladwin, M.T., Steinberg, M.H. & Sebastiani, P. (2014). Prediction of Fetal Hemoglobin in Sickle Cell Anemia using an Ensemble of Genetic Risk Prediction Models. Circulatory Cardiovascular Genetics, 792): 110 – 115.

Mitchell T. (1997). Machine Learning. New York: McGraw Hill

Simidjievski, N., Todorovski, L. & Dzeroski, S. (2016). Modeling Dynamic Systems with efficient Ensembles of Process-Based Models. PLoS ONE Computational Biology, 11(4): 1 – 27.

Xiao, Y., Wu, J., Lin, Z & Zhao, X. (2018). A Deep Learning-Based Multi-Model Ensemble Method for Cancer Prediction. Journal of computational Methods Programs and Biomedicine, 153: 1 - 9.

Xu, M., papageorgiou, D.P., Abidi, S.Z., Dao, M., Zhao, H. & Karniadakis, G. (2017). A Deep Convolutional Neural Network for the Classification of Red Blood Cells in Sickle Cell Anemia. PLoS ONE Computational Biology, 13(10): 1 – 12.

Research Articles