umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Autonomous cloud resource provisioning: accounting, allocation, and performance control
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.
2015 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

The emergence of large-scale Internet services coupled with the evolution of computing technologies such as distributed systems, parallel computing, utility computing, grid, and virtualization has fueled a movement toward a new resource provisioning paradigm called cloud computing. The main appeal of cloud computing lies in its ability to provide a shared pool of infinitely scalable computing resources for cloud services, which can be quickly provisioned and released on-demand with minimal effort. The rapidly growing interest in cloud computing from both the public and industry together with the rapid expansion in scale and complexity of cloud computing resources and the services hosted on them have made monitoring, controlling, and provisioning cloud computing resources at runtime into a very challenging and complex task. This thesis investigates algorithms, models and techniques for autonomously monitoring, controlling, and provisioning the various resources required to meet services’ performance requirements and account for their resource usage.

Quota management mechanisms are essential for controlling distributed shared resources so that services do not exceed their allocated or paid-for budget. Appropriate cloud-wide monitoring and controlling of quotas must be exercised to avoid over- or under-provisioning of resources. To this end, this thesis presents new distributed algorithms that efficiently manage quotas for services running across distributed nodes.

Determining the optimal amount of resources to meet services’ performance requirements is a key task in cloud computing. However, this task is extremely challenging due to multi-faceted issues such as the dynamic nature of cloud environments, the need for supporting heterogeneous services with different performance requirements, the unpredictable nature of services’ workloads, the non-triviality of mapping performance measurements into resources, and resource shortages. Models and techniques that can predict the optimal amount of resources needed to meet service performance requirements at runtime irrespective of variations in workloads are proposed. Moreover, different service differentiation schemes are proposed for managing temporary resource shortages due to, e.g., flash crowds or hardware failures.

In addition, the resources used by services must be accounted for in order to properly bill customers. Thus, monitoring data for running services should be collected and aggregated to maintain a single global state of the system that can be used to generate a single bill for each customer. However, collecting and aggregating such data across geographical distributed locations is challenging because the management task itself may consume significant computing and network resources unless done with care. A consistency and synchronization mechanism that can alleviate this task is proposed.

Ort, förlag, år, upplaga, sidor
Umeå: Umeå University , 2015. , s. 39
Serie
Report / UMINF, ISSN 0348-0542 ; 15.10
Nyckelord [en]
cloud computing, distributed infrastructure, monitoring, accounting, performance modeling, service differentiation
Nationell ämneskategori
Datorsystem
Forskningsämne
administrativ databehandling
Identifikatorer
URN: urn:nbn:se:umu:diva-107955ISBN: 978-91-7601-334-2 (tryckt)OAI: oai:DiVA.org:umu-107955DiVA, id: diva2:849876
Disputation
2015-09-28, MA121 (MIT building), Umeå University, Umeå, 10:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2015-09-07 Skapad: 2015-08-31 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
Delarbeten
1. Management of distributed resource allocations in multi-cluster environments
Öppna denna publikation i ny flik eller fönster >>Management of distributed resource allocations in multi-cluster environments
2012 (Engelska)Ingår i: Performance Computing and Communications Conference (IPCCC) 2012, 31st International, IEEE, New York, USA: IEEE , 2012, s. 275-284Konferensbidrag, Publicerat paper (Övrigt vetenskapligt)
Abstract [en]

We present a fully distributed solution for managing resource allocation for services running across multiple clusters in a large-scale cloud computing environment. Our solution allows individual services running across clusters to compete dynamically for allocations based on their rate of consumption while maintaining the global cloud level allocation limits. The solution monitors resource consumption by services that are spread over a number of clusters. Global polls are triggered only when the allocated balance in a cluster decreases below a threshold and allocations are reassigned in a manner that avoids further immediate global polls. Our solution achieves scalability by minimizing global message exchanges, increases performance by distributing requests, and improves availability by avoiding a single point of failure. We perform a range of simulations to verify the accuracy of our approach, to validate our theoretical results, and to evaluate against competing approaches.

Ort, förlag, år, upplaga, sidor
New York, USA: IEEE, 2012
Serie
2012 IEEE 31ST International performance computing and communications conference (IPCCC), ISSN 1097-2641
Nyckelord
Distributed monitoring, distributed resource allocations management, distributed quota management
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:umu:diva-66430 (URN)10.1109/PCCC.2012.6407768 (DOI)000313524400040 ()978-1-4673-4881-2 (ISBN)
Konferens
IEEE 31st International Performance Computing and Communications Conference (IPCCC)2012-12-01--03, Austin, Texas, USA
Tillgänglig från: 2013-02-22 Skapad: 2013-02-19 Senast uppdaterad: 2018-06-08Bibliografiskt granskad
2. A Tree-based Protocol for Enforcing Quotas in Clouds
Öppna denna publikation i ny flik eller fönster >>A Tree-based Protocol for Enforcing Quotas in Clouds
Visa övriga...
2014 (Engelska)Ingår i: the IEEE 10th 2014 World Congress on Services (SERVICES 2014), IEEE Computer Society, 2014Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Services are more and more hosted on cloud nodes for enhancing their performance and increasing their availability. The virtually unlimited availability of resources enables service owners to consume resources without quantitative restrictions, paying only for what they consume. To avoid cost overrun, resource consumption must be controlled and capped when necessary.We present a distributed tree-based protocol to manage quotas in clouds that minimizes communication overhead and reduces the time required to inspect if a quota has been exhausted. Experimental evaluation shows that our protocol provides 42% more communication savings and is up to 15 times faster compared to a distributed baseline solution.

Ort, förlag, år, upplaga, sidor
IEEE Computer Society, 2014
Nyckelord
Distributed Quota Monitoring, Distributed Quota Enforcement and Management, Distributed Credit Management, Clouds
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-93382 (URN)
Konferens
the IEEE 10th 2014 World Congress on Services (SERVICES 2014)
Tillgänglig från: 2014-09-18 Skapad: 2014-09-18 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
3. A synchronization mechanism for cloud accounting systems
Öppna denna publikation i ny flik eller fönster >>A synchronization mechanism for cloud accounting systems
Visa övriga...
2014 (Engelska)Ingår i: 2014 International Conference on Cloud and Autonomic Computing (ICCAC 2014), 2014, s. 111-120Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

In current cloud systems, services run across multiple geographically distributed clusters and continuously generate resource usage data due to constant resource consumption. In the context of accounting, resource usage data generated from each cluster during service runtime must be collected and aggregated into a single cloud-wide record so that a single bill can be created. This paper presents a mechanism to synchronize accounting records among distributed accounting system peers. Run time resource usage generated from different clusters is synchronized to maintain a single cloud-wide view of the data so that a single bill can be created. We provide a set of accounting system requirements and an evaluation which verifies that the solution fulfills these requirements. Experimental results show that our solution produces less overhead in terms of data exchange and scales near-linearly with the size of clusters with no single point of failure.

Nyckelord
Distributed Accounting, Postpaid, Cloud, Synchronization
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
urn:nbn:se:umu:diva-87753 (URN)10.1109/ICCAC.2014.11 (DOI)000370731000016 ()978-1-4799-5841-2 (ISBN)
Konferens
2014 IEEE International Conference on Cloud and Autonomic Computing (ICCAC), Imperial College, London, ENGLAND, SEP 08-12, 2014
Tillgänglig från: 2014-04-08 Skapad: 2014-04-08 Senast uppdaterad: 2018-06-08Bibliografiskt granskad
4. Towards faster response time models for vertical elasticity
Öppna denna publikation i ny flik eller fönster >>Towards faster response time models for vertical elasticity
2014 (Engelska)Ingår i: 2014 IEEE/ACM 7TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2014, s. 560-565Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Resource provisioning in cloud computing is typ- ically coarse-grained. For example, entire CPU cores may be allocated for periods of up to an hour. The Resource-as-a- Service cloud concept has been introduced to improve the efficiency of resource utilization in clouds. In this concept, resources are allocated in terms of CPU core fractions, with granularities of seconds. Such infrastructures could be created using existing technologies such as lightweight virtualization using LXC or by exploiting the Xen hypervisor’s capacity for vertical elasticity. However, performance models for de- termining how much capacity to allocate to each application are currently lacking. To address this deficit, we evaluate two performance models for predicting mean response times: the previously proposed queue length model and the novel inverse model. The models are evaluated using 3 applications under both open and closed system models. The inverse model reacted rapidly and remained stable even with targets as low as 0.5 seconds. 

Nationell ämneskategori
Datorsystem
Forskningsämne
administrativ databehandling
Identifikatorer
urn:nbn:se:umu:diva-93798 (URN)10.1109/UCC.2014.86 (DOI)000380558700079 ()978-1-4799-7881-6 (ISBN)
Konferens
IEEE/ACM 7th International Conference on Utility and Cloud Computing (UCC), DEC 08-11, 2014, London, UNITED KINGDOM
Tillgänglig från: 2014-10-01 Skapad: 2014-10-01 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
5. Tail Response Time Modeling and Control for Interactive Cloud Services
Öppna denna publikation i ny flik eller fönster >>Tail Response Time Modeling and Control for Interactive Cloud Services
(Engelska)Manuskript (preprint) (Övrigt vetenskapligt)
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-108028 (URN)
Tillgänglig från: 2015-09-01 Skapad: 2015-09-01 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
6. Coordinating CPU and Memory Elasticity Controllers to Meet Service Response Time Constraints
Öppna denna publikation i ny flik eller fönster >>Coordinating CPU and Memory Elasticity Controllers to Meet Service Response Time Constraints
Visa övriga...
2015 (Engelska)Ingår i: 2015 INTERNATIONAL CONFERENCE ON CLOUD AND AUTONOMIC COMPUTING (ICCAC), 2015, s. 69-80Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Vertical elasticity is recognized as a key enabler for efficient resource utilization of cloud infrastructure through fine-grained resource provisioning, e.g., allowing CPU cycles to be leased for as short as a few seconds. However, little research has been done to support vertical elasticity where the focus is mostly on a single resource, either CPU or memory, while an application may need arbitrary combinations of these resources at different stages of its execution. Nonetheless, the existing techniques cannot be readily used as-is without proper orchestration since they may lead to either under-or over-provisioning of resources and consequently result in undesirable behaviors such as performance disparity. The contribution of this paper is the design of an autonomic resource controller using a fuzzy control approach as a coordination technique. The novel controller dynamically adjusts the right amount of CPU and memory required to meet the performance objective of an application, namely its response time. We perform a thorough experimental evaluation using three different interactive benchmark applications, RUBiS, RUBBoS, and Olio, under workload traces generated based on open and closed system models. The results show that the coordination of memory and CPU elasticity controllers using the proposed fuzzy control provisions the right amount of resources to meet the response time target without over-committing any of the resource types. In contrast, with no coordinating between controllers, the behaviour of the system is unpredictable e.g., the application performance may be met but at the expense of over-provisioning of one of the resources, or application crashing due to severe resource shortage as a result of conflicting decisions.

Nyckelord
cloud-based application, fuzzy control, cloud computing, vertical elasticity, performance, cpu utilization, memory utilization
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-108032 (URN)10.1109/ICCAC.2015.20 (DOI)000380476500007 ()978-1-4673-9566-3 (ISBN)
Konferens
2015 IEEE International Conference on Cloud and Autonomic Computing(ICCAC), Cambridge, MA, USA, September 21-24, 2015
Anmärkning

Originally included in thesis in accepted form.

Tillgänglig från: 2015-09-01 Skapad: 2015-09-01 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
7. Performance-Based Service Differentiation in Clouds
Öppna denna publikation i ny flik eller fönster >>Performance-Based Service Differentiation in Clouds
2015 (Engelska)Ingår i: 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), IEEE conference proceedings, 2015, s. 505-514Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Due to fierce competition, cloud providers need to run their data-centers efficiently. One of the issues is to increase data-center utilization while maintaining applications' performance targets. Achieving high data-center utilization while meeting applications' performance is difficult, as data-center overload may lead to poor performance of hosted services. Service differentiation has been proposed to control which services get degraded. However, current approaches are capacity-based, which are oblivious to the observed performance of each service and cannot divide the available capacity among hosted services so as to minimize overall performance degradation. In this paper we propose performance-based service differentiation. In case enough capacity is available, each service is automatically allocated the right amount of capacity that meets its target performance, expressed either as response time or throughput. In case of overload, we propose two service differentiation schemes that dynamically decide which services to degrade and to what extent. We carried out an extensive set of experiments using different services -- interactive as well as non-interactive -- by varying the workload mixes of each service over time. The results demonstrate that our solution precisely provides guaranteed performance or service differentiation depending on available capacity.

Ort, förlag, år, upplaga, sidor
IEEE conference proceedings, 2015
Nyckelord
cloud computing, computer centres, interactive systems, cloud providers, data centers, performance-based service differentiation, Computational modeling, Degradation, Noise, Resource management, Throughput, Time factors, Time measurement, cloud, elasticity, guaranteed service, performance models, service differentiation
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-108026 (URN)10.1109/CCGrid.2015.145 (DOI)000380493100051 ()978-1-4799-8006-2 (ISBN)
Externt samarbete:
Konferens
2015 15th IEEE ACM International Symposium on Cluster Cloud and Grid Computing (CCGrid 2015),Shenzhen, PEOPLES R CHINA,MAY 04-07, 2015
Tillgänglig från: 2015-09-01 Skapad: 2015-09-01 Senast uppdaterad: 2018-06-07Bibliografiskt granskad
8. Towards Optimized Self-Management of Distributed Object Storage Systems
Öppna denna publikation i ny flik eller fönster >>Towards Optimized Self-Management of Distributed Object Storage Systems
Visa övriga...
2015 (Engelska)Rapport (Övrigt vetenskapligt)
Abstract [en]

Cloud storage is increasingly adopted by users due to simplified storage systems compared to on-premise storage. These systems are mostly presented as Object Storage Systems (OSSs), hiding issues, such as redundancy, from users. As new industries are considering adopting clouds for storage, OSSs have to evolve to support new needs. Among the most challenging is assuring guaranteed performance.

In this paper, we present Controllable Trade-offs (CTO), an OSS-agnostic solution to add performance guarantees. CTO presents itself as a thin layer that mediates requests between the user and the OSS. For generic support, performance is controlled by tuning the rejection probability, and implemented as a user-side queue. Results show that CTO may reduce penalties 3.23 times on average and up to 68 times when the load is high.

Förlag
s. 15
Serie
Report / UMINF, ISSN 0348-0542 ; 15.11
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-108033 (URN)
Tillgänglig från: 2015-09-01 Skapad: 2015-09-01 Senast uppdaterad: 2018-06-07Bibliografiskt granskad

Open Access i DiVA

fulltext(757 kB)841 nedladdningar
Filinformation
Filnamn FULLTEXT03.pdfFilstorlek 757 kBChecksumma SHA-512
58526d3d9599b03e453234a539b0bc9c6f927d6535bc202a59851b66f989b97d23c4ba1476c24186420e405d5e6e2040ada9c6ad2f4f931e20c46b054a559a0b
Typ fulltextMimetyp application/pdf
spikblad(119 kB)35 nedladdningar
Filinformation
Filnamn FULLTEXT02.pdfFilstorlek 119 kBChecksumma SHA-512
df3b2eda535d7d750806a222a8b3b59dca4436113612f4707dc1285c04f721a65d9a712db8184674c3e563ac25bde9a48cb15e46e6cfc02a45d181d3eb56ddf3
Typ spikbladMimetyp application/pdf

Personposter BETA

Lakew, Ewnetu Bayuh

Sök vidare i DiVA

Av författaren/redaktören
Lakew, Ewnetu Bayuh
Av organisationen
Institutionen för datavetenskap
Datorsystem

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 876 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 2475 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf