A Comprehensive Survey of Streaming Large Language Models: Architectures, Applications, and Future Directions

Maheshkumar Mayilsamy

doi:10.52783/jisem.v10i60s.13138

PDF

Published: Sep 30, 2025

DOI: https://doi.org/10.52783/jisem.v10i60s.13138

Keywords:

Real-Time Language Processing, Streaming Architecture, Inference Optimization, Continuous Learning, Multi-Modal Integration

Maheshkumar Mayilsamy

Abstract

This comprehensive article examines the current state of streaming Large Language Models, synthesizing research across technical implementations, application domains, and performance optimization techniques. It systematically review the transition from batch to streaming architectures, analyzes enabling technologies, and identify emerging research directions in this rapidly evolving field. The use of Large Language Models (LLMs) in streaming systems signifies a paradigm shift in how institutions process and generate value from continually produced information. This detailed article discusses the shift to real-time implementation based on streaming rather than classical batch processing, covering the technical foundations of such implementations, their strengths, and challenges. It examines how these systems allow organizations to analyze logs, conversations, and transactional events on the fly to immediately provide actionable insights in areas such as customer support, financial compliance, cybersecurity, e-commerce, and content moderation. By analyzing enabling technologies such as model distillation, hybrid deployment architectures, and specialized infrastructure components, this article sheds light on how organizations address challenges inherent in latency management, resource optimization, and scalability. Possible future directions are discussed, including adaptive learning capabilities, multi-modal integration, and increased explainability mechanisms, providing a research roadmap for advancing streaming LLM technologies and their organizational impacts on real-time intelligence and decision facilitation.

Issue

Vol. 10 No. 60s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

A Comprehensive Survey of Streaming Large Language Models: Architectures, Applications, and Future Directions

Abstract

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details