Exploration of Big Data Pipeline Solutions for Business Analysis: A Comprehensive Survey

Pallavi G B

doi:10.52783/jisem.v10i51s.10439

PDF

Published: May 30, 2025

DOI: https://doi.org/10.52783/jisem.v10i51s.10439

Keywords:

big data, cloud computing, machine learning, data analytics.

Pallavi G B, Latha N R, Shyamala G, Kalyana Kiran B. S. Goli, D Revanth, Gamana Yeluri R, Harika N, Keerthi P Reddy

Abstract

The sudden burst of data has resulted in the emergence of many big data frameworks such as Hadoop, Flink, and cloud-native platforms including Azure, AWS, and Google Cloud. Although these technologies facilitate efficient processing, storage, and analytics for business analysis, organizations are faced with the dilemma of selecting the appropriate framework because of differences in scalability, automation, and performance. Managed cloud platforms focus on smooth integration and operational efficiency, but companies receive no direct guidance on how to select the optimal pipeline for a given workload, especially when working with real-world, heterogeneous datasets such as Yelp. This research delves into the challenges of big data processing, examining primary inefficiencies and architectural trade-offs to offer insights into workflow optimization for data, business analysis, and decision-making. Furthermore, this work not only compared the platforms but also offers some guidance on how to choose the best processing pipeline specific to a complex business dataset like Yelp.

Issue

Vol. 10 No. 51s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

Exploration of Big Data Pipeline Solutions for Business Analysis: A Comprehensive Survey

Abstract

Volume 11 (2026)

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details