Generic placeholder image

Current Medical Imaging

Editor-in-Chief

ISSN (Print): 1573-4056
ISSN (Online): 1875-6603

Research Article

An Efficient Ensemble-based Machine Learning approach for Predicting Chronic Kidney Disease

Author(s): Divyanshi Chhabra, Mamta Juneja* and Gautam Chutani

Volume 20, 2024

Published on: 13 July, 2023

Article ID: e080523216634 Pages: 12

DOI: 10.2174/1573405620666230508104538

open_access

conference banner
Abstract

Background: Chronic kidney disease (CKD) is a long-term risk to one’s health that can result in kidney failure. CKD is one of today's most serious diseases, and early detection can aid in proper treatment. Machine learning techniques have proven to be reliable in the early medical diagnosis.

Objective: The paper aims to perform CKD prediction using machine learning classification approaches. The dataset used for the present study for detecting CKD was obtained from the machine learning repository at the University of California, Irvine (UCI).

Methods: In this study, twelve machine learning-based classification algorithms with full features were used. Since the CKD dataset had a class imbalance issue, the Synthetic Minority Over-Sampling technique (SMOTE) was used to alleviate the problem of class imbalance and review the performance based on machine learning classification models using the K fold cross-validation technique. The proposed work compares the results of twelve classifiers with and without the SMOTE technique, and then the top three classifiers with the highest accuracy, Support Vector Machine, Random Forest, and Adaptive Boosting classification algorithms were selected to use the ensemble technique to improve performance.

Results: The accuracy achieved using a stacking classifier as an ensemble technique with cross-validation is 99.5%.

Conclusion: The study provides an ensemble learning approach in which the top three best-performing classifiers in terms of cross-validation results are stacked in an ensemble model after balancing the dataset using SMOTE. This proposed technique could be applied to other diseases in the future, making disease detection less intrusive and cost-effective.

Keywords: Chronic kidney disease, Machine learning, Feature engineering, SMOTE, Ensemble technique, Kidney failure.

[1]
Battineni Gopi. Applications of machine learning predictive models in the chronic disease diagnosis. J Pers Med 2020; 10(2): 21.
[http://dx.doi.org/10.3390/jpm10020021]
[2]
Shamrat , Javed FMM. Implementation of machine learning algorithms to detect the prognosis rate of kidney disease. 2020 IEEE International Conference for Innovation in Technology (INOCON).
[3]
McClellan WM, Warnock DG, Judd S, et al. Albuminuria and racial disparities in the risk for ESRD. J Am Soc Nephrol 2011; 22(9): 1721-8.
[http://dx.doi.org/10.1681/ASN.2010101085] [PMID: 21868498]
[4]
Haroun MK, Jaar BG, Hoffman SC, Comstock GW, Klag MJ, Coresh J. Risk factors for chronic kidney disease: a prospective study of 23,534 men and women in Washington County, Maryland. J Am Soc Nephrol 2003; 14(11): 2934-41.
[http://dx.doi.org/10.1097/01.ASN.0000095249.99803.85] [PMID: 14569104]
[5]
Mula-Abed WAS, Rasadi KA, Al-Riyami D. Estimated glomerular filtration rate (eGFR): A serum creatinine-based test for the detection of chronic kidney disease and its impact on clinical practice. Oman Med J 2012; 27(4): 339-40.
[http://dx.doi.org/10.5001/omj.2012.87] [PMID: 23071895]
[6]
Levey AS, Cattran D, Friedman A, et al. Proteinuria as a surrogate outcome in CKD: report of a scientific workshop sponsored by the National Kidney Foundation and the US Food and Drug Administration. Am J Kidney Dis 2009; 54(2): 205-26.
[http://dx.doi.org/10.1053/j.ajkd.2009.04.029] [PMID: 19577347]
[7]
Gerogianni S. Concerns of patients on dialysis: A research study. Health Sci J 2014; 8(4): 423-37.
[8]
Bala S, Krishan K. A literature review on kidney disease prediction using data mining classification technique. International Journal of Computer Science and Mobile Computing 2014; 3(7): 960-7.
[9]
Chetty N, Vaisla KS, Sudarsan SD. Role of attributes selection in classification of Chronic Kidney Disease patients. 2015 International Conference on Computing, Communication and Security (ICCCS).
[http://dx.doi.org/10.1109/CCCS.2015.7374193]
[10]
Charleonnan A. Predictive analytics for chronic kidney disease using machine learning techniques. 2016 Management and Innovation Technology International Conference (MITicon). MIT-80-3.
[http://dx.doi.org/10.1109/MITICON.2016.8025242]
[11]
Polat H, Danaei Mehr H, Cetin A. Diagnosis of chronic kidney disease based on a support vector machine by feature selection methods. J Med Syst 2017; 41(4): 55.
[http://dx.doi.org/10.1007/s10916-017-0703-x] [PMID: 28243816]
[12]
Gunarathne WH, Perera KD, Kahandawaarachchi KA. Performance evaluation on machine learning classification techniques for disease classification and forecasting through data analytics for chronic kidney disease (CKD). 2017 IEEE 17th International Conference on Bioinformatics and Bioengineering (BIBE). 291-6.
[13]
Xiao J, Ding R, Xu X, et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. J Transl Med 2019; 17(1): 119.
[http://dx.doi.org/10.1186/s12967-019-1860-0] [PMID: 30971285]
[14]
Qin J, Chen L, Liu Y, Liu C, Feng C, Chen B. A Machine Learning Methodology for Diagnosing Chronic Kidney Disease. IEEE Access 2020; 8: 20991-1002.
[http://dx.doi.org/10.1109/ACCESS.2019.2963053]
[15]
Deepika B, Rao VKR, Rampure DN, Prajwal P, Gowda DG, et al. Early Prediction of Chronic Kidney Disease by using Machine Learning Techniques. Am J Comput Sci Eng Surv 2020; 8(2): 7.
[16]
Chittora P, Chaurasia S, Chakrabarti P, et al. Prediction of Chronic Kidney Disease - A Machine Learning Perspective. IEEE Access 2021; 9: 17312-34.
[http://dx.doi.org/10.1109/ACCESS.2021.3053763]
[17]
Alanazi R. “Identification and prediction of chronic diseases using machine learning approach,” Journal of Healthcare Engineering, vol. 2022. Article ID 2022; 2826127: 1-9.
[18]
Dua D, Graff C. UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science 2019.
[19]
James B, Yoshua B. Random search for hyper-parameter optimization. J Mach Learn Res 2012; 13: 281-305.
[20]
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 2002; 16: 321-57.
[http://dx.doi.org/10.1613/jair.953]
[21]
Roy MS, Ghosh R, Goswami D, Karthik R. Comparative Analysis of Machine Learning Methods to Detect Chronic Kidney Disease. J Phys Conf Ser 2021; 1911(1): 012005.
[http://dx.doi.org/10.1088/1742-6596/1911/1/012005]
[22]
Gupta R, Koli N, Mahor N, Tejashri N. Performance analysis of machine learning classifier for predicting chronic kidney disease. 2020 International Conference for Emerging Technology (INCET). 1-4.
[http://dx.doi.org/10.1109/INCET49848.2020.9154147]
[23]
Rabby AKMSR. Machine learning applied to kidney disease prediction: Comparison study. 2019 10th international conference on computing, communication and networking technologies (ICCCNT) IEEE.
[24]
Emon MU. Performance analysis of chronic kidney disease through machine learning approaches. 2021 6th International Conference on Inventive Computation Technologies (ICICT) IEEE.
[http://dx.doi.org/10.1109/ICICT50816.2021.9358491]
[25]
Wibawa MS, Dendi MIM, Agus WPIM. Boosted classifier and features selection for enhancing chronic kidney disease diagnosis. 2017 5th international conference on cyber and IT service management (CITSM) IEEE.
[26]
Yildirim P. Chronic kidney disease prediction on imbalanced data by multilayer perceptron: Chronic kidney disease prediction. 2017 IEEE 41st annual computer software and applications conference (COMPSAC).
[http://dx.doi.org/10.1109/COMPSAC.2017.84]
[27]
Hossain MM. Analysis of the performance of feature optimization techniques for the diagnosis of machine learning-based chronic kidney disease. Mach Learn 2022; 9: 100330.
[28]
Chowdhury NH. Performance analysis of Conventional machine learning algorithms for identification of chronic kidney disease in type 1 diabetes mellitus patients. Diagnostics 2021; 11(12): 2267.
[http://dx.doi.org/10.3390/diagnostics11122267]
[29]
Abdar M, Acharya UR, Sarrafzadegan N, Makarenkov V. NE-nu-SVC: a new nested ensemble clinical decision support system for effective diagnosis of coronary artery disease. IEEE Access 2019; 7: 167605-20.
[http://dx.doi.org/10.1109/ACCESS.2019.2953920]
[30]
Ogunleye A, Wang QG. XGBoost model for chronic kidney disease diagnosis. IEEE/ACM Trans Comput Biol Bioinformatics 2020; 17(6): 2131-40.
[http://dx.doi.org/10.1109/TCBB.2019.2911071] [PMID: 30998478]

© 2024 Bentham Science Publishers | Privacy Policy