SLO-First Autoscaling for Multi-Tenant Microservices: A Control-Theoretic Approach to P95/P99 Latency

Karthik Chakravarthy Cheekuri

doi:10.52783/jisem.v10i61s.13473

PDF

Published: Oct 30, 2025

DOI: https://doi.org/10.52783/jisem.v10i61s.13473

Keywords:

Tail latency, autoscaling, model predictive control, multi-tenant systems, service level objectives

Karthik Chakravarthy Cheekuri

Abstract

Maintaining strict service level objectives at the tail of latency distributions (P95/P99) remains challenging in large-scale, multi-tenant cloud platforms. Conventional autoscalers scale workloads based on resource utilization metrics such as CPU or memory, reacting only after latency violations occur and failing to anticipate bursty demand or cross-tenant interference. This work presents a control-theoretic resource management framework that proactively enforces latency SLOs across microservices by modeling each service as a queueing system. The framework continuously infers service time distributions and backlog states to predict future tail latency under varying load conditions, then applies model predictive control to allocate resources before violations manifest. The design incorporates multi-tenant fairness mechanisms that isolate noisy tenants while preserving global cost efficiency through constrained optimization over prediction horizons. Evaluation on synthetic burst traces and production-like workloads demonstrates 73% reduction in P95 violations, 68% reduction in P99 violations, 2.3× faster scaling convergence compared to reactive methods, and 41% resource cost savings relative to static over-provisioning strategies. The framework establishes a principled foundation for latency-driven, cost-aware resource management in multi-tenant cloud environments.

Issue

Vol. 10 No. 61s (2025)

Section

Articles

Journal of Information Systems Engineering and Management

SLO-First Autoscaling for Multi-Tenant Microservices: A Control-Theoretic Approach to P95/P99 Latency

Abstract

Volume 11 (2026)

Volume 10 (2025)

Volume 9 (2024)

Volume 8 (2023)

Volume 7 (2022)

Volume 6 (2021)

Volume 5 (2020)

Volume 4 (2019)

Volume 3 (2018)

Volume 2 (2017)

Volume 1 (2016)

Journal of Information Systems Engineering and Management

Article Sidebar

Main Article Content

Abstract

Article Details