A Systematic Literature Review of Machine Learning and Explainable AI Approaches for Diabetes Prediction: Taxonomy, Gaps, and Future Directions (2020–2025)
  • Author(s): Smarika Singh; Prof. (Dr.) Ritu Sindhu; Dr. Shivani Sharma
  • Paper ID: 1718378
  • Page: 4378-4387
  • Published Date: 28-05-2026
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 9 Issue 11 May-2026
Abstract

Diabetes mellitus is a rapidly escalating global public health crisis affecting over 537 million adults worldwide, with projections suggesting 783 million by 2045. The exponential growth of health data repositories and machine learning research has produced a vast body of literature on automated diabetes risk prediction. However, this literature suffers from recurring methodological inconsistencies — particularly over-reliance on a single benchmark dataset, data leakage from improper SMOTE application, absence of threshold optimisation for real-world imbalanced distributions, and shallow explainability that hinders clinical adoption. This paper presents a systematic literature review of 34 studies published between 2020 and 2025, covering classical machine learning, ensemble methods, deep learning architectures, and explainable AI frameworks applied to diabetes prediction. We propose a six-category taxonomy of approaches, analyse comparative performance across datasets, and identify eight specific research gaps that remain unaddressed in the existing literature. We also present our own empirical work — a SMOTE-enhanced hybrid RF+MLP stacking framework validated across Pima Indian and BRFSS 2015 datasets achieving consistent AUC of 0.808 and 0.816 respectively with 90.7% clinical recall — as a representative study addressing several identified gaps. Based on the review, we outline seven concrete future research directions essential for bridging the gap between research prototypes and clinically deployable diabetes screening tools.

Keywords

Diabetes Prediction, Machine Learning, Ensemble Methods, Explainable AI, SHAP, LIME, SMOTE, Cross-Dataset Validation, Systematic Review, Random Forest, Xgboost, Deep Learning

Citations

IRE Journals:
Smarika Singh, Prof. (Dr.) Ritu Sindhu, Dr. Shivani Sharma "A Systematic Literature Review of Machine Learning and Explainable AI Approaches for Diabetes Prediction: Taxonomy, Gaps, and Future Directions (2020–2025)" Iconic Research And Engineering Journals Volume 9 Issue 11 2026 Page 4378-4387 https://doi.org/10.64388/IREV9I11-1718378

IEEE:
Smarika Singh, Prof. (Dr.) Ritu Sindhu, Dr. Shivani Sharma "A Systematic Literature Review of Machine Learning and Explainable AI Approaches for Diabetes Prediction: Taxonomy, Gaps, and Future Directions (2020–2025)" Iconic Research And Engineering Journals, 9(11) https://doi.org/10.64388/IREV9I11-1718378