umu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Capacity Scaling for Elastic Compute Clouds
Umeå University, Faculty of Science and Technology, Department of Computing Science. (Cloud and Grid Computing)
2013 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

AbstractCloud computing is a computing model that allows better management, higher utiliza-tion and reduced operating costs for datacenters while providing on demand resourceprovisioning for different customers. Data centers are often enormous in size andcomplexity. In order to fully realize the cloud computing model, efficient cloud man-agement software systems that can deal with the datacenter size and complexity needto be designed and built.This thesis studies automated cloud elasticity management, one of the main andcrucial datacenter management capabilities. Elasticity can be defined as the abilityof cloud infrastructures to rapidly change the amount of resources allocated to anapplication in the cloud according to its demand. This work introduces algorithms,techniques and tools that a cloud provider can use to automate dynamic resource pro-visioning allowing the provider to better manage the datacenter resources. We designtwo automated elasticity algorithms for cloud infrastructures that predict the futureload for an application running on the cloud. It is assumed that a request is either ser-viced or dropped after one time unit, that all requests are homogeneous and that it takesone time unit to add or remove resources. We discuss the different design approachesfor elasticity controllers and evaluate our algorithms using real workload traces. Wecompare the performance of our algorithms with a state-of-the-art controller. We ex-tend on the design of the best performing controller out of our two controllers anddrop the assumptions made during the first design. The controller is evaluated with aset of different real workloads.All controllers are designed using certain assumptions on the underlying systemmodel and operating conditions. This limits a controller’s performance if the modelor operating conditions change. With this as a starting point, we design a workloadanalysis and classification tool that assigns a workload to its most suitable elasticitycontroller out of a set of implemented controllers. The tool has two main components,an analyzer and a classifier. The analyzer analyzes a workload and feeds the analysisresults to the classifier. The classifier assigns a workload to the most suitable elasticitycontroller based on the workload characteristics and a set of predefined business levelobjectives. The tool is evaluated with a set of collected real workloads and a set ofgenerated synthetic workloads. Our evaluation results shows that the tool can help acloud provider to improve the QoS provided to the customers.

Place, publisher, year, edition, pages
Umeå: Umeå universitet , 2013. , 22 p.
Series
Report / UMINF, ISSN 0348-0542 ; 2013:14
National Category
Computer Systems
Research subject
Computer Science; Computer Systems
Identifiers
URN: urn:nbn:se:umu:diva-87238Libris ID: 15409138ISBN: 978-91-7459-688-5 (print)OAI: oai:DiVA.org:umu-87238DiVA: diva2:707751
Presentation
2013-06-10, Umeå universitet, Umeå, 11:00
Opponent
Supervisors
Funder
EU, FP7, Seventh Framework ProgrammeSwedish Research CouncileSSENCE - An eScience Collaboration
Note

Enligt Libris är författarnamnet: Ahmed Aleyeldin (Ali-Eldin) Hassan.

Available from: 2014-04-03 Created: 2014-03-25 Last updated: 2014-04-03Bibliographically approved
List of papers
1. An adaptive hybrid elasticity controller for cloud infrastructures
Open this publication in new window or tab >>An adaptive hybrid elasticity controller for cloud infrastructures
2012 (English)In: 2012 IEEE Network operations and managent symposium (NOMS), IEEE Communications Society, 2012, 204-212 p.Conference paper, Published paper (Refereed)
Abstract [en]

Cloud elasticity is the ability of the cloud infrastructure to rapidly change the amount of resources allocated to a service in order to meet the actual varying demands on the service while enforcing SLAs. In this paper, we focus on horizontal elasticity, the ability of the infrastructure to add or remove virtual machines allocated to a service deployed in the cloud. We model a cloud service using queuing theory. Using that model we build two adaptive proactive controllers that estimate the future load on a service. We explore the different possible scenarios for deploying a proactive elasticity controller coupled with a reactive elasticity controller in the cloud. Using simulation with workload traces from the FIFA world-cup web servers, we show that a hybrid controller that incorporates a reactive controller for scale up coupled with our proactive controllers for scale down decisions reduces SLA violations by a factor of 2 to 10 compared to a regression based controller or a completely reactive controller.

Place, publisher, year, edition, pages
IEEE Communications Society, 2012
Series
IEEE IFIP Network Operations and Management Symposium, ISSN 1542-1201
National Category
Computer and Information Science
Identifiers
urn:nbn:se:umu:diva-51044 (URN)10.1109/NOMS.2012.6211900 (DOI)000309517000025 ()978-1-4673-0268-5 (ISBN)
Conference
13th IEEE/IFIP Network Operations and Management Symposium, 16-20 April 2012, Maui, Hawaii, USA
Available from: 2012-01-09 Created: 2012-01-09 Last updated: 2015-09-11Bibliographically approved
2. Efficient provisioning of bursty scientific workloads on the cloud using adaptive elasticity control
Open this publication in new window or tab >>Efficient provisioning of bursty scientific workloads on the cloud using adaptive elasticity control
2012 (English)In: Proceedings of the 3rd workshop on Scientific Cloud Computing Date, Association for Computing Machinery (ACM), 2012, 31-40 p.Conference paper, Published paper (Refereed)
Abstract [en]

Elasticity is the ability of a cloud infrastructure to dynamically change theamount of resources allocated to a running service as load changes. We build anautonomous elasticity controller that changes the number of virtual machinesallocated to a service based on both monitored load changes and predictions offuture load. The cloud infrastructure is modeled as a G/G/N queue. This modelis used to construct a hybrid reactive-adaptive controller that quickly reactsto sudden load changes, prevents premature release of resources, takes intoaccount the heterogeneity of the workload, and avoids oscillations. Using simulations with Web and cluster workload traces, we show that our proposed controller lowers the number of delayed requests by a factor of 70 for the Web traces and 3 for the cluster traces when compared to a reactive controller. Ourcontroller also decreases the average number of queued requests by a factor of 3 for both traces, and reduces oscillations by a factor of 7 for the Web traces and 3 for the cluster traces. This comes at the expense of between 20% and 30% over-provisioning, as compared to a few percent for the reactive controller.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2012
Keyword
Cloud Computing, Elasticity, Proportional Control
National Category
Computer Science
Research subject
Computer Science; Signal Processing
Identifiers
urn:nbn:se:umu:diva-54015 (URN)10.1145/2287036.2287044 (DOI)978-1-4503-1340-7 E-ISBN (ISBN)145031340X Print (ISBN)
Conference
Third workshop on scientific cloud computing, ScienceCloud 2012, June 18th 2012, Delft, The Netherlands
Projects
OPTIMIS
Available from: 2012-04-12 Created: 2012-04-11 Last updated: 2015-09-11Bibliographically approved
3. Workload Classification for Efficient Auto-Scaling of Cloud Resources
Open this publication in new window or tab >>Workload Classification for Efficient Auto-Scaling of Cloud Resources
2013 (English)Manuscript (preprint) (Other academic)
Abstract [en]

Elasticity algorithms for cloud infrastructures dynamically change the amount of resources allocated to a running service according to the current and predicted future load. Since there is no perfect predictor, and since different applications’ workloads have different characteristics, no single elasticity algorithm is suitable for future predictions for all workloads. In this work, we introduceWAC, aWorkload Analysis and Classification tool that analyzes workloads and assigns them to the most suitable elasticity controllers based on the workloads’ characteristics and a set of business level objectives.

WAC has two main components, the analyzer and the classifier. The analyzer analyzes workloads to extract some of the features used by the classifier, namely, workloads’ autocorrelations and sample entropies which measure the periodicity and the burstiness of the workloads respectively. These two features are used with the business level objectives by the clas-sifier as the features used to assign workloads to elasticity controllers. We start by analyzing 14 real workloads available from different applications. In addition, a set of 55 workloads is generated to test WAC on more workload configurations. We implement four state of the art elasticity algorithms. The controllers are the classes to which the classifier assigns workloads. We use a K nearest neighbors classifier and experiment with different workload combinations as training and test sets. Our experi-ments show that, when the classifier is tuned carefully, WAC correctly classifies between 92% and 98.3% of the workloads to the most suitable elasticity controller.

Publisher
36 p.
National Category
Computer Systems
Research subject
Computing Science
Identifiers
urn:nbn:se:umu:diva-87231 (URN)
Projects
Cloud Control
Funder
Swedish Research Council
Note

May 21, 2013.

Available from: 2014-03-25 Created: 2014-03-25 Last updated: 2014-04-03Bibliographically approved

Open Access in DiVA

Capacity Scaling for Elastic Compute Clouds(578 kB)1050 downloads
File information
File name FULLTEXT02.pdfFile size 578 kBChecksum SHA-512
8c9361858ac6c379aeae13203434708d6cd01830ab0341a70528af2a0fb122894f087650f663834f278302bc59081233328193f37c3f61b201d33891659489bf
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Ali-Eldin, Ahmed
By organisation
Department of Computing Science
Computer Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 1050 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 699 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf