Umeå University's logo

umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Managing cloud resource scarcity
Umeå University, Faculty of Science and Technology, Department of Computing Science.ORCID iD: 0000-0001-5860-6695
2020 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

According to the Infrastructure-as-a-Service conceptualization of cloud computing, Infrastructure Providers offer utility-like pay-as-you-go access to computing resources (e.g., data processing, networks, and storage) to Service Providers, who use those resources to host applications for the benefit of end users. The quantity of resources available to Infrastructure Providers at any given moment is limited, as is the quantity of resources allocated to the applications of each Service Provider.

This thesis examines the management of cloud resource scarcity from the perspectives of both Infrastructure and Service Providers, with the aim of finding ways to ensure that the end user experience is minimally affected.

We consider three main strategies for managing cloud resource scarcity. First, we explore ways to efficiently construct collaborative federations of autonomous and independent Infrastructure Providers that allow local resource scarcity to be masked by extension using capacity from remote sites. Second, we consider how scheduling both within a cloud site and across a federation can be made aware of restrictions imposed by Service Providers for, e.g., performance or legal reasons. Third, we suggest ways of making applications conscious of resource availability so that they can apply quality elasticity under resource constraints.

The thesis is the culmination of 11 years of work within academia and industry. Based on the unique perspective granted by this long experience, the introductory chapters present a historical view of each subtopic mentioned above. Specifically, they discuss how cloud computing has evolved in conjunction with ways of developing applications to the symbiotic benefit of both, leading to the emergence of cloud-native software that allows Infrastructure Providers to use their infrastructure more efficiently and offer it more affordably while simultaneously granting Service Providers improved availability and performance in cloud-based environments.

Abstract [sv]

I molnet (cloud computing), ur Infrastructure-as-a-Service-perspektivet, ger infrastruktursleverantörer tillgång till datorresurser (såsom databehandlings-, nätverks- och lagringskapacitet) med en löpande betalningsmodell till tjänsteleverantörer så att dessa kan erbjuda applikationer till fördel för slutanvändare. För infrastruktursleverantörer finns det en begränsning i mängden datorresurser som är tillgängliga vid varje givet tillfälle. För tjänsteleverantörer finns det en begränsning i hur många sådana resurser som är allokerade till deras applikationer.

Det övergripande målet med denna doktorsavhandling är att studera olika sätt att hantera resursbrister i molnet ur både infrastruktursleverantörers och tjänsteleverantörers perspektiv, så att slutanvändares upplevelse blir minimalt påverkad.

Våra ansatser till att hantera resursbrister i molnet kommer från tre huvudsakliga områden. Först utforskar vi federationer av autonoma och oberoende infrastruktursleverantörer, som möjliggör att lokala resursbrister kan maskeras genom att utöka och nyttja resurser från andra leverantörer. Därnäst undersöker vi hur schemaläggning både inom och mellan molnsajter i en federation kan göras medveten om begränsningar som tjänsteleverantörer kräver av exempelvis prestanda- eller legala skäl. Slutligen föreslår vi olika sätt hur applikationer kan göras medvetna om nuvarande resurstillgång och göras kvalitetselastiska.

Avhandlingen utgör kulmen av 11 års arbete inom akademins och industrins värld. Baserat på de unika möjligheter en sådan erfarenhet ger presenteras även ett historiskt perspektiv av dessa områden i de inledande kapitlen. I dessa kapitel diskuterar vi hur molnet har utvecklats tillsammans med hur applikationer levereras till slutkunder och hur ett symbiotiskt förhållande uppstått dem emellan. Resultatet är mjukvara som är cloud-native, vilket möjliggör för infrastruktursleverantörer att till högre grad effektivt utnyttja sin infrastruktur och erbjuda tillgång till den på ett mer kostnadseffektivt sätt, samt ger tjänsteleverantörer ökad tillgänglighet och prestanda i molnbaserade miljöer.

Place, publisher, year, edition, pages
Umeå: Umeå universitet , 2020. , p. 109
Series
Report / UMINF, ISSN 0348-0542 ; 20.07
Keywords [en]
cloud computing, edge computing, kubernetes, federation, scheduling, quality elasticity
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-174509ISBN: 978-91-7855-328-0 (print)ISBN: 978-91-7855-329-7 (electronic)OAI: oai:DiVA.org:umu-174509DiVA, id: diva2:1461167
Public defence
2020-10-16, Aula Biologica, Biologihuset, Umeå, 10:00 (English)
Opponent
Supervisors
Available from: 2020-09-25 Created: 2020-08-26 Last updated: 2020-10-20Bibliographically approved
List of papers
1. Interfaces for Placement, Migration, and Monitoring of Virtual Machines in Federated Clouds
Open this publication in new window or tab >>Interfaces for Placement, Migration, and Monitoring of Virtual Machines in Federated Clouds
2009 (English)In: Proceedings of the Eighth International Conference on Grid and Cooperative Computing (GCC 2009) / [ed] Juan. E. Guerrero, IEEE Computer Society Press , 2009, p. 253-260Conference paper, Published paper (Refereed)
Abstract [en]

Current cloud computing infrastructure offerings are lacking in interoperability, which is a hindrance to the advancement and adoption of the cloud computing paradigm. As clouds are made interoperable, federations of clouds may be formed. Such federations are from the point of view of the user not burdened by vendor lock-in, and opens for business possibilities where a market place of cloud computing infrastructure can be formed. Federated clouds require unified management interfaces regarding the virtual machines (VMs) that comprise the services running in the cloud federation. Standardization efforts for the required management interfaces have so far focused on definition of description formats regarding VMs, and the control of already deployed VMs. We propose technology neutral interfaces and architectural additions for handling placement, migration, and monitoring of VMs in federated cloud environments, the latter as an extension of current monitoring architectures used in grid computing. The interfaces presented adhere to the general requirements of scalability, efficiency, and security in addition to specific requirements related to the particular issues of interoperability and business relationships between competing cloud computing infrastructure providers. In addition, they may be used equally well locally and remotely, creating a layer of abstraction that simplifies management of virtualized service components.

Place, publisher, year, edition, pages
IEEE Computer Society Press, 2009
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-29903 (URN)10.1109/GCC.2009.36 (DOI)2-s2.0-72349093866 (Scopus ID)978-0-7695-3766-5 (ISBN)
Conference
Eighth International Conference on Grid and Cooperative Computing (GCC 09), 27-29 Aug 2009, Lanzhou, Gansu
Available from: 2009-11-26 Created: 2009-11-26 Last updated: 2023-03-24Bibliographically approved
2. Scheduling and Monitoring of Internally Structured Services in Cloud Federations
Open this publication in new window or tab >>Scheduling and Monitoring of Internally Structured Services in Cloud Federations
2011 (English)In: 2011 IEEE Symposium on Computers and Communications (ISCC), IEEE, 2011, p. 173-178Conference paper, Published paper (Refereed)
Abstract [en]

Cloud infrastructure providers may form Cloud federations to cope with peaks in resource demand and to make large-scale service management simpler for service providers. To realize Cloud federations, a number of technical and managerial difficulties need to be solved. We present ongoing work addressing three related key management topics, namely, specification, scheduling, and monitoring of services. Service providers need to be able to influence how their resources are placed in Cloud federations, as federations may cross national borders or include companies in direct competition with the service provider. Based on related work in the RESERVOIR project, we propose a way to define service structure and placement restrictions using hierarchical directed acyclic graphs. We define a model for scheduling in Cloud federations that abides by the specified placement constraints and minimizes the risk of violating Service-Level Agreements. We present a heuristic that helps the model determine which virtual machines (VMs) are suitable candidates for migration. To aid the scheduler, and to provide unified data to service providers, we also propose a monitoring data distribution architecture that introduces cross-site compatibility by means of semantic metadata annotations.

Place, publisher, year, edition, pages
IEEE, 2011
Series
IEEE Symposium on Computers & Communications (ISCC), ISSN 1530-1346
Keywords
cloud computing, scheduling, monitoring, service structure
National Category
Computer Sciences
Research subject
business data processing
Identifiers
urn:nbn:se:umu:diva-40268 (URN)10.1109/ISCC.2011.5984012 (DOI)000298614900034 ()2-s2.0-80052742050 (Scopus ID)978-1-4577-0678-3 (ISBN)978-1-4577-0680-6 (ISBN)
Conference
16th IEEE Symposium on Computers and Communications (lSCC)and First International Workshop on Management of Cloud Systems (MoCS 2011), Corfu, Greece, June 28-July 1, 2011
Available from: 2011-02-21 Created: 2011-02-21 Last updated: 2023-03-24Bibliographically approved
3. Modeling and Placement of Cloud Services with Internal Structure
Open this publication in new window or tab >>Modeling and Placement of Cloud Services with Internal Structure
Show others...
2016 (English)In: IEEE Transactions on Cloud Computing, ISSN 2168-7161, Vol. 4, no 4, p. 429-439Article in journal (Refereed) Published
Abstract [en]

Virtual machine placement is the process of mapping virtual machines to available physical hosts within a datacenter or on a remote datacenter in a cloud federation. Normally, service owners cannot influence the placement of service components beyond choosing datacenter provider and deployment zone at that provider. For some services, however, this lack of influence is a hindrance to cloud adoption. For example, services that require specific geographical deployment (due e.g. to legislation), or require redundancy by avoiding co-location placement of critical components. We present an approach for service owners to influence placement of their service components by explicitly specifying service structure, component relationships, and placement constraints between components. We show how the structure and constraints can be expressed and subsequently formulated as constraints that can be used in placement of virtual machines in the cloud. We use an integer linear programming scheduling approach to illustrate the approach, show the corresponding mathematical formulation of the model, and evaluate it using a large set of simulated input. Our experimental evaluation confirms the feasibility of the model and shows how varying amounts of placement constraints and data center background load affects the possibility for a solver to find a solution satisfying all constraints within a certain time-frame. Our experiments indicate that the number of constraints affects the ability of finding a solution to a higher degree than background load, and that for a high number of hosts with low capacity, component affinity is the dominating factor affecting the possibility to find a solution.

Place, publisher, year, edition, pages
IEEE Computer Society, 2016
Keywords
service management, service structure, placement, affinity, collocation, scheduling, integer linear programming, cloud computing
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-80125 (URN)10.1109/TCC.2014.2362120 (DOI)000390560200005 ()2-s2.0-85027575276 (Scopus ID)
Funder
eSSENCE - An eScience Collaboration
Available from: 2013-09-10 Created: 2013-09-10 Last updated: 2023-03-24Bibliographically approved
4. Quality-Elasticity: Improved Resource Utilization, Throughput, and Response Times Via Adjusting Output Quality to Current Operating Conditions
Open this publication in new window or tab >>Quality-Elasticity: Improved Resource Utilization, Throughput, and Response Times Via Adjusting Output Quality to Current Operating Conditions
2019 (English)In: 2019 IEEE International Conference on Autonomic Computing (ICAC), IEEE, 2019, p. 52-62Conference paper, Published paper (Refereed)
Abstract [en]

This work addresses two related problems for on-line services, namely poor resource utilization during regular operating conditions, and low throughput, long response times, or poor performance under periods of high system load. To address these problems, we introduce our notion of quality-elasticity as a manner of dynamically adapting response qualities from software services along a fine-grained spectrum. When resources are abundant, response quality can be increased, and when resources are scarce, responses are delivered at a lower quality to prioritize throughput and response times. We present an example of how a complex online shopping site can be made quality-elastic. Experiments show that, compared to state of the art, improvements in throughput (57% more served queries), lowered response times (8 time reduction for 95th percentile responses), and an estimated 40% profitability increase can be made using our quality-elastic approach. When resources are abundant, our approach may achieve upwards of twice as high resource utilization as prior work in this field.

Place, publisher, year, edition, pages
IEEE, 2019
Series
Proceedings of the International Conference on Autonomic Computing, ISSN 2474-0764, E-ISSN 2474-0756
Keywords
cloud computing, service delivery, adaptive software, brownout
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-174392 (URN)10.1109/ICAC.2019.00017 (DOI)000589508500007 ()2-s2.0-85073231355 (Scopus ID)978-1-7281-2411-7 (ISBN)
Conference
IEEE International Conference on Autonomic Computing (ICAC), Umeå, Swede, June 16-20, 2019
Funder
eSSENCE - An eScience CollaborationSwedish Research Council, C0590801Wallenberg AI, Autonomous Systems and Software Program (WASP)Knut and Alice Wallenberg Foundation
Available from: 2020-08-21 Created: 2020-08-21 Last updated: 2020-12-29Bibliographically approved
5. Impact of etcd deployment on Kubernetes, Istio, and application performance
Open this publication in new window or tab >>Impact of etcd deployment on Kubernetes, Istio, and application performance
Show others...
2020 (English)In: Software, practice & experience, ISSN 0038-0644, E-ISSN 1097-024X, Vol. 50, no 10, p. 1986-2007Article in journal (Refereed) Published
Abstract [en]

This experience article describes lessons learned as we conducted experiments in a Kubernetes-based environment, the most notable of which was that the performance of both the Kubernetes control plane and the deployed application depends strongly and in unexpected ways on the performance of the etcd database. The article contains (a) detailed descriptions of how networking with and without Istio works in Kubernetes, based on the Flannel Container Networking Interface (CNI) provider in VXLAN mode with IP Virtual Server (IPVS)-backed Kubernetes Services, (b) a comprehensive discussion about how to conduct load and performance testing using a closed-loop workload generator, and (c) an open source experiment framework useful for executing experiments in a shared cloud environment and exploring the resulting data. It also shows that statistical analysis may reveal the data resulting from such experiments to be misleading even when careful preparations are made, and that nondeterministic behavior stemming from etcd can affect both the platform as a whole and the deployed application. Finally, it is demonstrated that using high-performance backing storage for etcd can reduce the occurrence of such nondeterministic behaviors by a statistically significant (P < .05) margin. The implication of this experience article is that systems researchers studying the performance of applications deployed on Kubernetes cannot simply consider their specific application to be under test. Instead, the particularities of the underlying Kubernetes and cloud platform must be taken into account, in particular because their performance can impact that of etcd.

Place, publisher, year, edition, pages
John Wiley & Sons, 2020
Keywords
cloud computing, distributed systems, etcd, Kubernetes, performance
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-174336 (URN)10.1002/spe.2885 (DOI)000556599500001 ()2-s2.0-85089109897 (Scopus ID)
Available from: 2020-08-20 Created: 2020-08-20 Last updated: 2023-03-23Bibliographically approved
6. Decentralized Kubernetes Federation Control Plane
Open this publication in new window or tab >>Decentralized Kubernetes Federation Control Plane
(English)Manuscript (preprint) (Other academic)
Abstract [en]

This position paper presents our vision for a distributed decentralized Kubernetes federation control plane. The goal is to support federations consisting of thousands of Kubernetes clusters, in order to support next generation edge cloud use-cases. Our review of the literature and experience with the current centralized state of the art Kubernetes federation controllers shows that it is unable to scale to a sufficient size, and centralization constitutes an unacceptable single point of failure. Our proposed system maintains cluster autonomy, allows clusters to collaboratively handle error conditions, and scales to support edge cloud use-cases. Our approach is based on a shared database of conflict-free replicated data types (CRDTs), shared among all clusters in the federation, and algorithms that make use of the data.

National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-174412 (URN)
Available from: 2020-08-24 Created: 2020-08-24 Last updated: 2020-08-26
7. Towards Soft Circuit Breaking in Service Meshes via Application-agnostic Caching
Open this publication in new window or tab >>Towards Soft Circuit Breaking in Service Meshes via Application-agnostic Caching
Show others...
(English)Manuscript (preprint) (Other academic)
National Category
Computer Sciences
Identifiers
urn:nbn:se:umu:diva-174446 (URN)
Available from: 2020-08-25 Created: 2020-08-25 Last updated: 2020-08-26

Open Access in DiVA

fulltext(719 kB)1159 downloads
File information
File name FULLTEXT01.pdfFile size 719 kBChecksum SHA-512
13f2a5332d76bd846ebe8e2c22e1679761a916b76a1df697e1b8d704faed6af301d879d8547d9e624298d2af72892bb2ad7d85245bf3f4e54d5565e3d1901207
Type fulltextMimetype application/pdf
spikblad(161 kB)140 downloads
File information
File name FULLTEXT02.pdfFile size 161 kBChecksum SHA-512
0ab7fbd9bdda6c6607e91f19cd0c7c24594e45ef83e2dde6ee0c8392c63e52a63766730c4de2c9a7a58bf6ca5edf909f9f9f26d0e35809fcbc261f3300fc3938
Type spikbladMimetype application/pdf

Authority records

Larsson, Lars

Search in DiVA

By author/editor
Larsson, Lars
By organisation
Department of Computing Science
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 1302 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 2572 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf