"Harnessing Sociodemographic and Anthropometric Insights to Predict Typ" by Rico Kurniawan, Rezki Yunanda et al. 2775-0574">
  •  
  •  
 

Abstract

The complexity of diabetes makes early diagnosis and effective management challenging for healthcare settings. This study developed and evaluated models using machine learning algorithms to predict the risk of diabetes. The data were primarily sourced from the Indonesia Family Life Survey (IFLS). Diabetes status was identified using A1c whole blood sample values (A1c WBS ³5.7%). Sociodemographic and anthropometric factors were set as predictors, most of which were significantly correlated with diabetes status. Discrete machine learning algorithms such as decision tree, k-nearest neighbors (KNN), random forest (RF), naïve Bayer, support vector machine (SVM), and neural network (MLP) were applied to construct and compare prediction models for classification. MLP model exhibited the highest performance with AUC 67%, Accuracy 87.1%, F1 Score 87.1%, Recall 87.1% and Precision 82.4%. Overall, machine learning models were found highly viable in predicting disease outcomes with increasing accuracy. Their usage would allow healthcare professionals to make more informed patient care decisions. Future research initiatives should attempt to further enhance these models.

References

1. Joshi RD, Dhakal CK. Predicting Type 2 Diabetes Using Logistic Regression and Machine Learning Approaches. Int J Environ Res Public Health. 2021;18(14):7346.

2. The Emerging Risk Factors Collaboration, Sarwar N, Gao P, Kondapally Seshasai SR, Gobin R, Kaptoge S, et al. Diabetes mellitus, fasting blood glucose concentration, and risk of vascular disease: a collaborative meta-analysis of 102 prospective studies. The Lancet. 2010;375(9733):2215–22.

3. WHO. Global Report on Diabetes [Internet]. Geneva; 2016 [cited 2021 Mar 19]. Available from: %5C

4. Ministry of Health of Republic Indonesia. Diabetes Mellitus in Indonesia, 2020. Jakarta; 2020.

5. Mboi N, Murty Surbakti I, Trihandini I, Elyazar I, Houston Smith K, Bahjuri Ali P, et al. On the road to universal health care in Indonesia, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2018;392(10147):581–91.

6. Almutairi N, Hosseinzadeh H, Gopaldasani V. The effectiveness of patient activation intervention on type 2 diabetes mellitus glycemic control and self-management behaviors: A systematic review of RCTs. Prim Care Diabetes. 2020;14(1):12–20.

7. Fregoso-Aparicio L, Noguez J, Montesinos L, García-García JA. Machine learning and deep learning predictive models for type 2 diabetes: a systematic review. Diabetol Metab Syndr. 2021;13:148.

8. Arifin B, Idrus LR, van Asselt ADI, Purba FD, Perwitasari DA, Thobari JA, et al. Health-related quality of life in Indonesian type 2 diabetes mellitus outpatients measured with the Bahasa version of EQ-5D. Quality of Life Research. 2019;28(5):1179–90.

9. Vigersky RA. An Overview of Management Issues in Adult Patients with Type 2 Diabetes Mellitus. J Diabetes Sci Technol. 2011;5(2):245–50.

10. Valsalan P, Hasan NU, Farooq U, Zghaibeh M, Baig I. IoT Based Expert System for Diabetes Diagnosis and Insulin Dosage Calculation. Healthcare. 2022;11(1):12.

11. Suresh K, Obulesu O, Ramudu BV. Diabetes Prediction using Machine Learning Techniques. Helix - The Scientific Explorer | Peer Reviewed Bimonthly International Journal. 2020;10(02):136–42.

12. Al-Badri M, Hamdy O. Diabetes clinic reinvented: will technology change the future of diabetes care? Ther Adv Endocrinol Metab. 2021;12:2042018821995368.

13. Davenport T, Kalakota R. The potential for artificial intelligence in healthcare. Future Healthc J. 2019;6(2):94–8.

14. Ahamed BS, Arya MS, Nancy V AO. Prediction of Type-2 Diabetes Mellitus Disease Using Machine Learning Classifiers and Techniques. Front Comput Sci. 2022;4:835242.

15. Zhao M, Song L, Sun L, Wang M, Wang C, Yao S, et al. Associations of Type 2 Diabetes Onset Age With Cardiovascular Disease and Mortality: The Kailuan Study. Diabetes Care. 2021;44(6):1426–32.

16. Harreiter J, Kautzky-Willer A. Sex and Gender Differences in Prevention of Type 2 Diabetes. Front Endocrinol (Lausanne). 2018;9:220.

17. Kautzky-Willer A, Harreiter J, Pacini G. Sex and Gender Differences in Risk, Pathophysiology and Complications of Type 2 Diabetes Mellitus. Endocr Rev. 2016;37(3):278–316.

18. Liao LZ, Chen ZC, Li WD, Zhuang XD, Liao XX. Causal effect of education on type 2 diabetes: A network Mendelian randomization study. World J Diabetes. 2021;12(3):261–77.

19. Ranasinghe P, Cooray DN, Jayawardena R, Katulanda P. The influence of family history of Hypertension on disease prevalence and associated metabolic risk factors among Sri Lankan adults. BMC Public Health. 2015;15:576.

20. Le Boudec J, Marques-Vidal P, Cornuz J, Clair C. Smoking cessation and the incidence of pre-diabetes and type 2 diabetes: a cohort study. J Diabetes Complications. 2016;30(1):43–8.

21. Yuan S, Larsson SC. An atlas on risk factors for type 2 diabetes: a wide-angled Mendelian randomisation study. Diabetologia. 2020;63(11):2359–71.

22. van Oort S, Beulens JWJ, van Ballegooijen AJ, Grobbee DE, Larsson SC. Association of Cardiovascular Risk Factors and Lifestyle Behaviors With Hypertension. Hypertension. 2020;76(6):1971–9.

23. Ismail L, Materwala H, Al Kaabi J. Association of risk factors with type 2 diabetes: A systematic review. Comput Struct Biotechnol J. 2021;19:1759–85.

24. Zhang Z. Introduction to machine learning: k-nearest neighbors. Ann Transl Med. 2016;4(11):218.

25. Mansouri S, Boulares S, Chabchoub S. Machine Learning for Early Diabetes Detection and Diagnosis. J Wirel Mob Netw Ubiquitous Comput Dependable Appl. 2024;15(1):216–30.

26. Patel HH, Prajapati P. Study and Analysis of Decision Tree Based Classification Algorithms. International Journal of Computer Sciences and Engineering. 2018;6(10):74–8.

27. Olusanya MO, Ogunsakin RE, Ghai M, Adeleke MA. Accuracy of Machine Learning Classification Models for the Prediction of Type 2 Diabetes Mellitus: A Systematic Survey and Meta-Analysis Approach. Int J Environ Res Public Health. 2022;19(21):14280.

28. Kulkarni VY, Sinha PK. Random Forest Classifiers :A Survey and Future Research Directions. International Journal of Advanced Computing. 2013;36(1):1144–53.

29. Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20:273–97.

30. Popescu MC, Perescu-Popescu L, Balas VE, Mastorakis NE. Multilayer Perceptron and Neural Networks. WSEAS TRANSACTIONS ON CIRCUITS AND SYSTEMS. 2009;8(7):579–88.

31. Butt UM, Letchmunan S, Ali M, Hassan FH, Baqir A, Sherazi HHR. Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications. J Healthc Eng. 2021;2021:9930985.

32. Saito T, Rehmsmeier M. The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets. PLoS One. 2015;10(3):e0118432.

33. Powers DMW. Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation. International Journal of Machine Learning Technology . 2011;2(1):37–63.

34. Sokolova M, Japkowicz N, Szpakowicz S. Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation. In: Australasian Joint Conference on Artificial Intelligence. Melbourne: Wiley; 2006. p. 1015–21.

35. Darmawan ES, Permanasari VY, Nisrina LV, Kusuma D, Hasibuan SR, Widyasanti N. Behind the Hospital Ward: In-Hospital Mortality of Type 2 Diabetes Mellitus Patients in Indonesia (Analysis of National Health Insurance Claim Sample Data). Int J Environ Res Public Health. 2024;21(5):581.

36. Deberneh HM, Kim I. Prediction of Type 2 Diabetes Based on Machine Learning Algorithm. Int J Environ Res Public Health. 2021;18(6):3317.

37. Agliata A, Giordano D, Bardozzo F, Bottiglieri S, Facchiano A, Tagliaferri R. Machine Learning as a Support for the Diagnosis of Type 2 Diabetes. Int J Mol Sci. 2023;24(7):6775.

38. Zhang L, Shang X, Sreedharan S, Yan X, Liu J, Keel S, et al. Predicting the Development of Type 2 Diabetes in a Large Australian Cohort Using Machine-Learning Techniques: Longitudinal Survey Study. JMIR Med Inform. 2020;8(7):e16850.

39. Dritsas E, Trigka M. Data-Driven Machine-Learning Methods for Diabetes Risk Prediction. Sensors. 2022;22(14):5304.

40. Krishnamoorthi R, Joshi S, Almarzouki HZ, Shukla PK, Rizwan A, Kalpana C, et al. A Novel Diabetes Healthcare Disease Prediction Framework Using Machine Learning Techniques. J Healthc Eng. 2022;2022:1684017.

41. Poly TN, Islam MM, Li YCJ. Early Diabetes Prediction: A Comparative Study Using Machine Learning Techniques. Stud Health Technol Inform. 2022;295:409–13.

42. Maniruzzaman Md, Rahman MdJ, Ahammed B, Abedin MdM. Classification and prediction of diabetes disease using machine learning paradigm. Health Inf Sci Syst. 2020;8:7.

43. Fazakis N, Kocsis O, Dritsas E, Alexiou S, Fakotakis N, Moustakas K. Machine Learning Tools for Long-Term Type 2 Diabetes Risk Prediction. IEEE Access. 2021;9:103737–57.

Share

COinS
 
 

To view the content in your browser, please download Adobe Reader or, alternately,
you may Download the file to your hard drive.

NOTE: The latest versions of Adobe Reader do not support viewing PDF files within Firefox on Mac OS and if you are using a modern (Intel) Mac, there is no official plugin for viewing PDF files within the browser window.