eLDA: Augmenting Topic Modeling with Word Embeddings for Enhanced Coherence and Interpretability

Gobind Kumar Das, Panthadeep Bhattacharjee

doi:10.52783/jisem.v10i21s.3372

PDF

Published: Mar 1, 2025

DOI: https://doi.org/10.52783/jisem.v10i21s.3372

Keywords:

Latent Dirichlet Allocation, Corpora, Word2Vec Embedding, Topic Modeling

Gobind Kumar Das, Panthadeep Bhattacharjee

Abstract

Traditional topic modeling methods like Latent Dirichlet Allocation suffer from several challenges, especially concerning appropriate topic coherence, logical and consistent word groups that follow some semantic relationship, and interpretability. In this work, we propose an enhanced version of LDA, called eLDA, which incorporates Word2Vec embeddings (W2Ve) into LDA. This approach is adopted in order to improve the coherence of individual topics and improve the general topic interpretability by using established metrics such as the coherence score. Traditional LDA and eLDA coherence scores are compared to validate the results. In contrast to the former, we observe that eLDA provides much better interpretability with higher coherence scores, stronger semantic relationships, and improved visualization of topics.

Issue

Vol. 10 No. 21s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

eLDA: Augmenting Topic Modeling with Word Embeddings for Enhanced Coherence and Interpretability

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details