Umeå universitets logga

umu.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
How will your workload look like in 6 years?: Analyzing Wikimedia's workload
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap. (Cloud and Grid Computing Cloud and grid computing)
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för datavetenskap.
Umeå universitet, Teknisk-naturvetenskapliga fakulteten, Institutionen för matematik och matematisk statistik.
Visa övriga samt affilieringar
2014 (Engelska)Ingår i: Proceedings of the 2014 IEEE International Conference on Cloud Engineering (IC2E 2014) / [ed] Lisa O’Conner, IEEE Computer Society, 2014, s. 349-354Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Accurate understanding of workloads is key to efficient cloud resource management as well as to the design of large-scale applications. We analyze and model the workload of Wikipedia, one of the world's largest web sites. With descriptive statistics, time-series analysis, and polynomial splines, we study the trend and seasonality of the workload, its evolution over the years, and also investigate patterns in page popularity. Our results indicate that the workload is highly predictable with a strong seasonality. Our short term prediction algorithm is able to predict the workload with a Mean Absolute Percentage Error of around 2%.

Ort, förlag, år, upplaga, sidor
IEEE Computer Society, 2014. s. 349-354
Serie
IEEE, ISSN 2373-3845
Nationell ämneskategori
Annan elektroteknik och elektronik Datorsystem
Forskningsämne
administrativ databehandling
Identifikatorer
URN: urn:nbn:se:umu:diva-87235DOI: 10.1109/IC2E.2014.50ISI: 000361018600043Scopus ID: 2-s2.0-84908587591ISBN: 978-1-4799-3766-0 (tryckt)OAI: oai:DiVA.org:umu-87235DiVA, id: diva2:707725
Konferens
IC2E 2014, IEEE International Conference on Cloud Engineering, Boston, Massachusetts, 11-14 March 2014
Forskningsfinansiär
Vetenskapsrådet, C0590801eSSENCE - An eScience CollaborationTillgänglig från: 2014-03-25 Skapad: 2014-03-25 Senast uppdaterad: 2023-03-24Bibliografiskt granskad
Ingår i avhandling
1. Workload characterization, controller design and performance evaluation for cloud capacity autoscaling
Öppna denna publikation i ny flik eller fönster >>Workload characterization, controller design and performance evaluation for cloud capacity autoscaling
2015 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

This thesis studies cloud capacity auto-scaling, or how to provision and release re-sources to a service running in the cloud based on its actual demand using an auto-matic controller. As the performance of server systems depends on the system design,the system implementation, and the workloads the system is subjected to, we focuson these aspects with respect to designing auto-scaling algorithms. Towards this goal,we design and implement two auto-scaling algorithms for cloud infrastructures. Thealgorithms predict the future load for an application running in the cloud. We discussthe different approaches to designing an auto-scaler combining reactive and proactivecontrol methods, and to be able to handle long running requests, e.g., tasks runningfor longer than the actuation interval, in a cloud. We compare the performance ofour algorithms with state-of-the-art auto-scalers and evaluate the controllers’ perfor-mance with a set of workloads. As any controller is designed with an assumptionon the operating conditions and system dynamics, the performance of an auto-scalervaries with different workloads.In order to better understand the workload dynamics and evolution, we analyze a6-years long workload trace of the sixth most popular Internet website. In addition,we analyze a workload from one of the largest Video-on-Demand streaming servicesin Sweden. We discuss the popularity of objects served by the two services, the spikesin the two workloads, and the invariants in the workloads. We also introduce, a mea-sure for the disorder in a workload, i.e., the amount of burstiness. The measure isbased on Sample Entropy, an empirical statistic used in biomedical signal processingto characterize biomedical signals. The introduced measure can be used to charac-terize the workloads based on their burstiness profiles. We compare our introducedmeasure with the literature on quantifying burstiness in a server workload, and showthe advantages of our introduced measure.To better understand the tradeoffs between using different auto-scalers with differ-ent workloads, we design a framework to compare auto-scalers and give probabilisticguarantees on the performance in worst-case scenarios. Using different evaluation cri-teria and more than 700 workload traces, we compare six state-of-the-art auto-scalersthat we believe represent the development of the field in the past 8 years. Knowingthat the auto-scalers’ performance depends on the workloads, we design a workloadanalysis and classification tool that assigns a workload to its most suitable elasticitycontroller out of a set of implemented controllers. The tool has two main components;an analyzer, and a classifier. The analyzer analyzes a workload and feeds the analysisresults to the classifier. The classifier assigns a workload to the most suitable elasticitycontroller based on the workload characteristics and a set of predefined business levelobjectives. The tool is evaluated with a set of collected real workloads, and a set ofgenerated synthetic workloads. Our evaluation results shows that the tool can help acloud provider to improve the QoS provided to the customers.

Ort, förlag, år, upplaga, sidor
Umeå: Umeå University, 2015. s. 16
Serie
Report / UMINF, ISSN 0348-0542 ; 15.09
Nyckelord
cloud computing, autoscaling, workloads, performance modeling, controller design
Nationell ämneskategori
Datorsystem
Identifikatorer
urn:nbn:se:umu:diva-108398 (URN)978-91-7601-330-4 (ISBN)
Disputation
2015-10-02, N360, Naturveterhuset Building, Umeå University, Umeå, 14:00 (Engelska)
Opponent
Handledare
Forskningsfinansiär
EU, Europeiska forskningsrådetVetenskapsrådet
Tillgänglig från: 2015-09-11 Skapad: 2015-09-10 Senast uppdaterad: 2021-03-18Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextScopus

Person

Ali-Eldin, AhmedRezaie, AliMehta, AmardeepSjöstedt-de Luna, SaraSeleznjev, OlegTordsson, JohanElmroth, Erik

Sök vidare i DiVA

Av författaren/redaktören
Ali-Eldin, AhmedRezaie, AliMehta, AmardeepSjöstedt-de Luna, SaraSeleznjev, OlegTordsson, JohanElmroth, Erik
Av organisationen
Institutionen för datavetenskapInstitutionen för matematik och matematisk statistik
Annan elektroteknik och elektronikDatorsystem

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetricpoäng

doi
isbn
urn-nbn
Totalt: 2382 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf