Normalizing Sanskrit Texts: An Approach Towards Enhanced Accessibility and Precision

Sabnam Kumari

doi:10.52783/jisem.v10i33s.5742

PDF

Published: Apr 8, 2025

DOI: https://doi.org/10.52783/jisem.v10i33s.5742

Keywords:

Text Normalisation; Sanskrit Text; Accuracy; Language; Tokenization, NSW.

Sabnam Kumari, Amita Malik

Abstract

Sanskrit text normalisation streamlines inconsistencies in spelling, morphology, and syntax to improve computational text processing and the online availability of ancient texts. This research creates a normalizing pipeline to increase NLP applications' accuracy as well as Text-to-Speech (TTS) system accuracy. In our approach, reducing non-standard words (NSW) increases searchability and understanding. With its 93% accuracy, the model makes clear computational text processing breakthroughs. The project enhances digital humanities by raising the availability of Sanskrit texts for linguistic research and historical studies. The results of this study on the normalisation of Sanskrit text imply that meticulous standardisation of the text considerably increases the efficiency and accuracy of computer text processing. By using basic ideas and methods, the study enhances the capacity for digital searching, analysing, and comprehending of ancient Sanskrit works.. This study unequivocally shows that eliminating non-standard words (NSW) is a necessary step to guarantee the input text follows a standard language form, therefore enhancing performance in NLP tasks and speech synthesis. The work is with accuracy of 93%, precision of 92%, recall of 91%, F1 score of 91%, and specificity of 94%.

Issue

Vol. 10 No. 33s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

Normalizing Sanskrit Texts: An Approach Towards Enhanced Accessibility and Precision

Abstract

Volume 11 (2026)

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details