Transformer-Based Image-to-LaTeX Conversion: Improving Mathematical Equation Recognition with Self-Attention

Neeta P. Patil

doi:10.52783/jisem.v10i13s.2101

PDF

Published: Feb 10, 2025

DOI: https://doi.org/10.52783/jisem.v10i13s.2101

Keywords:

Transformer model, encoder-decoder, image to LaTeX, math formulas

Neeta P. Patil, Yogita D. Mane, Akshay Agrawal, Anil Vasoya, Sanketi Raut

Abstract

Automating the transfer of mathematical equations from photos to LaTeX code is difficult because to handwriting diversity, formatting issues, and structural difficulties. Traditional CNN and RNN models struggle with long-term dependency and input variability. To overcome these challenges, we present a transformer-based encoder-decoder architecture that uses self-attention to increase contextual comprehension and sequence alignment. The model is trained on the im2latex dataset using token-level cross-entropy loss and sequence-level BLEU-based reinforcement learning, followed by Adam and beam search for inference. Compared to existing models, the proposed model has the greatest BLEU score, competitive MED performance, and better robustness against noisy and handwritten inputs, however the Exact Match (EM) score shows space for improvement. This study demonstrates the efficacy of transformer-based architectures for improving LaTeX conversion accuracy and mathematical document processing.

Issue

Vol. 10 No. 13s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

Transformer-Based Image-to-LaTeX Conversion: Improving Mathematical Equation Recognition with Self-Attention

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details