Bridging Linguistic And Structural Gaps In Marathi Government Document Translation: A Survey Of Modern Approaches

Uncategorized

Authors: Manasi Waghe, Danish Chandargi, Mohammad Aamir Rayyan, Raviraj Joshi, Dr. A.R. Deshpande

Abstract: The translation of government and legal documents from Marathi to English poses unique challenges due to linguis- tic complexity, domain-specific terminology, structural richness, and low-resource constraints. General-purpose machine translation systems often fail to maintain semantic fidelity, formatting, and terminological consistency required for administrative and legal texts. This survey explores recent advances in multilingual machine translation, domain adaptation techniques, OCR-driven document understanding, Marathi-specific NLP resources, and terminology- constrained translation methods. We examine the state-of-the-art in robust Marathi-to-English translation systems and highlight critical gaps, focusing on integrating layout-aware models and domain- specific constraints to improve translation quality and reliability for official government documentation.

× How can I help you?