Home > Research > Publications & Outputs > An approach for characterizing workloads in Goo...

Links

Text available via DOI:

View graph of relations

An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models. / Moreno, Ismael Solis; Garraghan, Peter; Townend, Paul.
2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE). IEEE, 2013. p. 49-60.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Moreno, IS, Garraghan, P & Townend, P 2013, An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models. in 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE). IEEE, pp. 49-60. https://doi.org/10.1109/SOSE.2013.24

APA

Moreno, I. S., Garraghan, P., & Townend, P. (2013). An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models. In 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE) (pp. 49-60). IEEE. https://doi.org/10.1109/SOSE.2013.24

Vancouver

Moreno IS, Garraghan P, Townend P. An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models. In 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE). IEEE. 2013. p. 49-60 doi: 10.1109/SOSE.2013.24

Author

Moreno, Ismael Solis ; Garraghan, Peter ; Townend, Paul. / An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models. 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE). IEEE, 2013. pp. 49-60

Bibtex

@inproceedings{862bef3ea71143eea548338b3d959e79,
title = "An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models",
abstract = "Analyzing behavioral patterns of workloads is critical to understanding Cloud computing environments. However, until now only a limited number of real-world Cloud data center trace logs have been available for analysis. This has led to a lack of methodologies to capture the diversity of patterns that exist in such datasets. This paper presents the first large-scale analysis of real-world Cloud data, using a recently released dataset that features traces from over 12,000 servers over the period of a month. Based on this analysis, we develop a novel approach for characterizing workloads that for the first time considers Cloud workload in the context of both user and task in order to derive a model to capture resource estimation and utilization patterns. The derived model assists in understanding the relationship between users and tasks within workload, and enables further work such as resource optimization, energy-efficiency improvements, and failure correlation. Additionally, it provides a mechanism to create patterns that randomly fluctuate based on realistic parameters. This is critical to emulating dynamic environments instead of statically replaying records in the trace log. Our approach is evaluated by contrasting the logged data against simulation experiments, and our results show that the derived model parameters correctly describe the operational environment within a 5% of error margin, confirming the great variability of patterns that exist in Cloud computing.",
keywords = "workload characterization, Cloud computing workload patterns, MapReduce analysis, resource usage patterns",
author = "Moreno, {Ismael Solis} and Peter Garraghan and Paul Townend",
year = "2013",
month = jun,
day = "10",
doi = "10.1109/SOSE.2013.24",
language = "English",
pages = "49--60",
booktitle = "2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE)",
publisher = "IEEE",

}

RIS

TY - GEN

T1 - An approach for characterizing workloads in Google Cloud to derive realistic resource utilization models

AU - Moreno, Ismael Solis

AU - Garraghan, Peter

AU - Townend, Paul

PY - 2013/6/10

Y1 - 2013/6/10

N2 - Analyzing behavioral patterns of workloads is critical to understanding Cloud computing environments. However, until now only a limited number of real-world Cloud data center trace logs have been available for analysis. This has led to a lack of methodologies to capture the diversity of patterns that exist in such datasets. This paper presents the first large-scale analysis of real-world Cloud data, using a recently released dataset that features traces from over 12,000 servers over the period of a month. Based on this analysis, we develop a novel approach for characterizing workloads that for the first time considers Cloud workload in the context of both user and task in order to derive a model to capture resource estimation and utilization patterns. The derived model assists in understanding the relationship between users and tasks within workload, and enables further work such as resource optimization, energy-efficiency improvements, and failure correlation. Additionally, it provides a mechanism to create patterns that randomly fluctuate based on realistic parameters. This is critical to emulating dynamic environments instead of statically replaying records in the trace log. Our approach is evaluated by contrasting the logged data against simulation experiments, and our results show that the derived model parameters correctly describe the operational environment within a 5% of error margin, confirming the great variability of patterns that exist in Cloud computing.

AB - Analyzing behavioral patterns of workloads is critical to understanding Cloud computing environments. However, until now only a limited number of real-world Cloud data center trace logs have been available for analysis. This has led to a lack of methodologies to capture the diversity of patterns that exist in such datasets. This paper presents the first large-scale analysis of real-world Cloud data, using a recently released dataset that features traces from over 12,000 servers over the period of a month. Based on this analysis, we develop a novel approach for characterizing workloads that for the first time considers Cloud workload in the context of both user and task in order to derive a model to capture resource estimation and utilization patterns. The derived model assists in understanding the relationship between users and tasks within workload, and enables further work such as resource optimization, energy-efficiency improvements, and failure correlation. Additionally, it provides a mechanism to create patterns that randomly fluctuate based on realistic parameters. This is critical to emulating dynamic environments instead of statically replaying records in the trace log. Our approach is evaluated by contrasting the logged data against simulation experiments, and our results show that the derived model parameters correctly describe the operational environment within a 5% of error margin, confirming the great variability of patterns that exist in Cloud computing.

KW - workload characterization

KW - Cloud computing workload patterns

KW - MapReduce analysis

KW - resource usage patterns

U2 - 10.1109/SOSE.2013.24

DO - 10.1109/SOSE.2013.24

M3 - Conference contribution/Paper

SP - 49

EP - 60

BT - 2013 IEEE 7th International Symposium on Service Oriented System Engineering (SOSE)

PB - IEEE

ER -