Authors: Ms.Usha Dhankar, Ms. Preeti Kalra, Ms.Agrima Samanotra, Mr.Aaditya Shriv Astava
Abstract: This research explores the application of Retrieval-Augmented Generation (RAG) for enhancing information extraction and question-answering tasks from scanned PDF documents using Optical Character Recognition (OCR). By integrating a retrieval mechanism with a generative language model, we present a novel framework that intelligently interprets noisy, unstructured OCR outputs and enables contextual interaction via natural language queries[1][2]. The approach bridges the gap between image-based document archives and intelligent systems, facilitating improved document accessibility in fields like legal, academic, and archival research.
DOI: https://doi.org/10.5281/zenodo.16445198