Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Taming cold starts: proactive serverless scheduling with model predictive control
Umeå University, Faculty of Science and Technology, Department of Computing Science. (Autonomous Distributed Systems Lab)ORCID iD: 0000-0002-9156-3364
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0002-9842-7840
Umeå University, Faculty of Science and Technology, Department of Computing Science. Elastisys AB, Umeå, Sweden. (Autonomous Distributed Systems Lab)ORCID iD: 0000-0002-2633-6798
2025 (English)In: Proceedings: 2025 IEEE 33rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems. MASCOTS: Paris, France 21-23 October 2025, IEEE, 2025, p. 1-8Conference paper, Oral presentation only (Refereed)
Abstract [en]

Serverless computing has transformed cloud application deployment by introducing a fine-grained, event-driven execution model that abstracts away infrastructure management. Its on-demand nature makes it especially appealing for latency-sensitive and bursty workloads. However, the cold start problem, i.e., where the platform incurs significant delay when provisioning new containers, remains the Achilles' heel of such platforms. 

This paper presents a predictive serverless scheduling framework based on Model Predictive Control to proactively mitigate cold starts, thereby improving end-to-end response time. By forecasting future invocations, the controller jointly optimizes container prewarming and request dispatching, improving latency while minimizing resource overhead.

We implement our approach on Apache OpenWhisk, deployed on a Kubernetes-based testbed. Experimental results using real-world function traces and synthetic workloads demonstrate that our method significantly outperforms state-of-the-art baselines, achieving up to 85% lower tail latency and a 34% reduction in resource usage.

Place, publisher, year, edition, pages
IEEE, 2025. p. 1-8
Series
IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, ISSN 1526-7539, E-ISSN 2375-0227
Keywords [en]
Serverless, Cloud Computing, Orchestration, Cold Start, Function-as-a-service, Model Predictive Control, Prediction, Request Shaping
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:umu:diva-243593DOI: 10.1109/MASCOTS67699.2025.11283271Scopus ID: 2-s2.0-105031750750ISBN: 979-8-3315-5761-4 (print)ISBN: 979-8-3315-5760-7 (electronic)OAI: oai:DiVA.org:umu-243593DiVA, id: diva2:1994897
Conference
MASCOTS 2025: 33rd International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication System, Paris, France, October 21-23, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon Europe, 101092711Available from: 2025-09-03 Created: 2025-09-03 Last updated: 2026-04-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Nguyen, ChanhBhuyan, MonowarElmroth, Erik

Search in DiVA

By author/editor
Nguyen, ChanhBhuyan, MonowarElmroth, Erik
By organisation
Department of Computing Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 335 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf