Application of Python Programming Language in PDF-TEXT Based Information Extraction
  • Author(s): Benita A. Chinemerem ; Donatus. O. Njoku ; Taiwo Ahmed O.
  • Paper ID: 1703529
  • Page: 160-167
  • Published Date: 26-09-2022
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 6 Issue 3 September-2022
Abstract

Information Extraction has become a vital aspect of research over the decades which allow millions of researchers have access to only what seems important to them, amidst the enormous pieces of information around. The motivation of carrying this work out was to mitigate on the time factor that faces every researcher with regards to meeting stipulated deadlines. After a careful literature review on the existing studies it was gathered that a proposed system framework was developed to match keywords during the word extraction formation. The research followed a System Structured Analysis Design Methodology (SSADM), which utilizes the powerful libraries of python programming to mine text data known as keywords from a structured file and parse through these binary data. The extracted input data are encrypted via parallel encryption. The system was tested and the sample results achieved.

Keywords

Information, Extraction, Text data, Binary data, Encryption, Python

Citations

IRE Journals:
Benita A. Chinemerem , Donatus. O. Njoku , Taiwo Ahmed O. "Application of Python Programming Language in PDF-TEXT Based Information Extraction" Iconic Research And Engineering Journals Volume 6 Issue 3 2022 Page 160-167

IEEE:
Benita A. Chinemerem , Donatus. O. Njoku , Taiwo Ahmed O. "Application of Python Programming Language in PDF-TEXT Based Information Extraction" Iconic Research And Engineering Journals, 6(3)