Almost everything is mechanized in the digital age, and data is shared and kept digitally. Nonetheless, there are a number of circumstances in which the data is not digital, and it may become necessary to extract text from them in order to preserve it digitally. Optical character recognition (OCR) text extraction has been totally transformed by state-of-the-art technologies like text recognition software. As a result, this paper provides an overview of the idea, describes the extraction procedure, and showcases the most recent methods, tools, and research in the field. An overview of the technologies provided by this review will be helpful to other researchers in the field.
Optical Character Recognition (OCR), Digital Image Processing (DIP), Text recognition, Pre-processing, Feature extraction
IRE Journals:
Samender Singh, Mukesh Singla "OCR-Based Text Extraction: A Comprehensive Review" Iconic Research And Engineering Journals Volume 9 Issue 7 2026 Page 1797-1804 https://doi.org/10.64388/IREV9I7-1713624
IEEE:
Samender Singh, Mukesh Singla
"OCR-Based Text Extraction: A Comprehensive Review" Iconic Research And Engineering Journals, 9(7) https://doi.org/10.64388/IREV9I7-1713624