A2L2: An Application Aware Flexible HPC Scheduling Model for Low-Latency Allocation
2015 (English)In: VTDC '15: Proceedings of the 8th International Workshop on Virtualization Technologies in Distributed Computing, ACM Digital Library, 2015, , 11-19 p.11-19 p.Conference paper (Refereed)
High-performance computing (HPC) is focused on providing large-scale compute capacity to scientific applications. HPC schedulers tend to be optimized for large parallel batch jobs and, as such, often overlook the requirements of other scientific applications. In this work, we propose a cloud-inspired HPC scheduling model that aims to capture application performance and requirement models (Application Aware - A2) and dynamically resize malleable application resource allocations to be able to support applications with critical performance or deadline requirements. (Low Latency allocation - L2). The proposed model incorporates measures to improve data-intensive applications performance on HPC systems and is derived from a set of cloud scheduling techniques that are identified as applicable in HPC environments. The model places special focus on dynamically malleable applications; data-intensive applications that support dynamic resource allocation without incurring severe performance penalties; which are proposed for fine-grained back-filling and dynamic resource allocation control without job preemption.
Place, publisher, year, edition, pages
ACM Digital Library, 2015. , 11-19 p.11-19 p.
Scheduling, job, HPC, malleable, applications, low-latency
Research subject Computing Science
IdentifiersURN: urn:nbn:se:umu:diva-110526DOI: 10.1145/2755979.2755983ISBN: 978-1-4503-3573-7OAI: oai:DiVA.org:umu-110526DiVA: diva2:862490
Proceedings of the 8th International Workshop on Virtualization Technologies in Distributed Computing (VTDC), Portland, Oregon on June 15-19, 2015
FundereSSENCE - An eScience CollaborationEU, FP7, Seventh Framework Programme, 610711Swedish Research Council, C0590801