EduRAG: A Multi-Format Retrieval-Augmented Generation Assistant for Academic Question Answering
  • Author(s): M. Syam Prakash; G. V. Charan Kumar; D. Gayathri
  • Paper ID: 1716796
  • Page: 2391-2392
  • Published Date: 23-04-2026
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 9 Issue 10 April-2026
Abstract

Large Language Models (LLMs) can produce fluent responses, but they often provide unsupported information when answers rely on specific documents. This paper introduces EduRAG, a Retrieval-Augmented Generation (RAG) assistant designed for answering student questions based on uploaded academic materials. The proposed system combines multiple file formats, semantic chunking, dense embedding generation, vector similarity retrieval, and context-based response synthesis. EduRAG supports various educational formats, including PDF, DOCX, PPTX, TXT/MD, CSV/XLSX, and image-based text through OCR. This allows practical use across different classroom resources. The backend is built using Flask and includes APIs for health monitoring, document upload, indexing, querying, file listing, and deletion. Sentence-Transformers are used for creating semantic embeddings, and FAISS provides efficient nearest-neighbor retrieval. For generation, the architecture supports both cloud-hosted and local LLM options through configurable providers. Index persistence and metadata storage allow for reusable sessions and quicker follow-up queries. Experimental results on a mix of academic documents show that EduRAG enhances answer relevance and grounding compared to direct LLM prompting without retrieval. It maintains acceptable latency for interactive educational use. The system shows that retrieval-based prompting significantly reduces the risk of generating false information and increases trustworthiness in academic assistants. EduRAG offers a low-cost, flexible foundation for institutional learning support and can be further improved with reranking, citation tracing, and hybrid retrieval methods.

Keywords

Retrieval-Augmented Generation, Educational Chatbot, FAISS, Semantic Search, Document Question Answering, Large Language Models.

Citations

IRE Journals:
M. Syam Prakash, G. V. Charan Kumar, D. Gayathri "EduRAG: A Multi-Format Retrieval-Augmented Generation Assistant for Academic Question Answering" Iconic Research And Engineering Journals Volume 9 Issue 10 2026 Page 2391-2392 https://doi.org/10.64388/IREV9I10-1716796

IEEE:
M. Syam Prakash, G. V. Charan Kumar, D. Gayathri "EduRAG: A Multi-Format Retrieval-Augmented Generation Assistant for Academic Question Answering" Iconic Research And Engineering Journals, 9(10) https://doi.org/10.64388/IREV9I10-1716796