An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR

Tabitha Susan Philip; Dr. Balamurugan S

doi:10.64388/IREV9I11-1717948

Home / Current Issue / Paper 1717948

1717948PublishedVol 9 · Issue 11

An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR

Tabitha Susan Philip Dr. Balamurugan S

Subject area: Science,Engineering and Technology · Area of research: Computer Science

DOI: https://doi.org/10.64388/IREV9I11-1717948

Abstract

Electronic health records (EHRs) contain large amounts of unstructured clinical texts that require analysis beyond traditional approaches. While transformer models like BioBERT (bidirectional encoder representations from transformers for biomedical text mining) have greatly improved prediction accuracy, their black-box nature reduces trust in clinical applications. This paper provides an overview of recent research papers on clinical NLP tasks and reveals the lack of interpretability in conjunction with data standardization. This study presents ICTF (interpretable clinical transformer framework), which uses a dual pipeline approach to compare machine learning methods with BioBERT. This method also incorporates SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model agnostic Explanations) to provide explanation post-prediction, assisting clinicians in interpreting predictions. The ICTF framework will predict disease labels while providing visualization maps using the MTSamples dataset available on Kaggle.

Keywords

Natural Language Processing (NLP), BioBERT, Interpretability, Electronic Health Records (EHR), SHAP, LIME, Disease Prediction.

How to cite this paper

Tabitha Susan Philip, Dr. Balamurugan S "An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR" Iconic Research And Engineering Journals Volume 9 Issue 11 2026 Page 2568-2572 https://doi.org/10.64388/IREV9I11-1717948

Tabitha Susan Philip, Dr. Balamurugan S "An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR" Iconic Research And Engineering Journals, vol. 9, no. 11, May. 2026, doi: https://doi.org/10.64388/IREV9I11-1717948

Tabitha Susan Philip, Dr. Balamurugan S (2026). An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR. Iconic Research And Engineering Journals, 9(11). doi: https://doi.org/10.64388/IREV9I11-1717948

Tabitha Susan Philip, Dr. Balamurugan S "An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR" Iconic Research And Engineering Journals, vol. 9, no. 11, May. 2026. Crossref, https://doi.org/10.64388/IREV9I11-1717948

@article{1717948,
      author = {Tabitha Susan Philip, Dr. Balamurugan S},
      title = {An Interpretable Clinical Transformer Framework (ICTF) for Disease Prediction using EHR},
      journal = {Iconic Research And Engineering Journals},
      year = {2026},
      volume = {9},
      number = {11},
      pages = {2568-2572},
      issn = {2456-8880},
      url = {https://www.irejournals.com/formatedpaper/1717948.pdf},
      abstract = {Electronic health records (EHRs) contain large amounts of unstructured clinical texts that require analysis beyond traditional approaches. While transformer models like BioBERT (bidirectional encoder representations from transformers for biomedical text mining) have greatly improved prediction accuracy, their black-box nature reduces trust in clinical applications.  This paper provides an overview of recent research papers on clinical NLP tasks and reveals the lack of interpretability in conjunction with data standardization. This study presents ICTF (interpretable clinical transformer framework), which uses a dual pipeline approach to compare machine learning methods with BioBERT. This method also incorporates SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model agnostic Explanations) to provide explanation post-prediction, assisting clinicians in interpreting predictions. The ICTF framework will predict disease labels while providing visualization maps using the MTSamples dataset available on Kaggle.},
      keywords = {Natural Language Processing (NLP), BioBERT, Interpretability, Electronic Health Records (EHR), SHAP, LIME, Disease Prediction.},
      month = {May},
      doi = {https://doi.org/10.64388/IREV9I11-1717948}
  }