Fault-tolerant dynamic deduplication for utility computing

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

Fault-tolerant dynamic deduplication for utility computing. / Leesakul, Waraporn ; Townend, Paul; Garraghan, Peter.
2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing. IEEE, 2014. p. 397-404.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Leesakul, W, Townend, P & Garraghan, P 2014, Fault-tolerant dynamic deduplication for utility computing. in 2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing. IEEE, pp. 397-404. https://doi.org/10.1109/ISORC.2014.55

APA

Leesakul, W., Townend, P., & Garraghan, P. (2014). Fault-tolerant dynamic deduplication for utility computing. In 2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing (pp. 397-404). IEEE. https://doi.org/10.1109/ISORC.2014.55

Vancouver

Leesakul W, Townend P, Garraghan P. Fault-tolerant dynamic deduplication for utility computing. In 2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing. IEEE. 2014. p. 397-404 doi: 10.1109/ISORC.2014.55

Author

Leesakul, Waraporn ; Townend, Paul ; Garraghan, Peter. / Fault-tolerant dynamic deduplication for utility computing. 2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing. IEEE, 2014. pp. 397-404

Bibtex

@inproceedings{db1eb1dfd3514160ab498816dd4fd7e8,

title = "Fault-tolerant dynamic deduplication for utility computing",

abstract = "Utility computing is an increasingly important paradigm, whereby computing resources are provided on-demand as utilities. An important component of utility computing is storage, data volumes are growing rapidly, and mechanisms to mitigate this growth need to be developed. Data deduplication is a promising technique for drastically reducing the amount of data stored in such system systems, however, current approachs are static in nature, using an amount of redundancy fixed at design time. This is inappropriate for truly dynamic modern systems. We propose a real-time adaptive deduplication system for Cloud and Utility computing that monitors in real-time for changing system, user, and environmental behaviour in order to fulfill a balance between changing storage efficiency, performance, and fault tolerance requirements. We evaluate our system through simulation, with experimental results showing that our system is both efficient and sclable. We also perform experimentation to evaluate the fault tolerance of the system by measuring Mean Time to Repair (MTTR), and using these values to calculate availability of the system. The results show that higher replication levels result in higher system availability, however, the number of files in the system also effects recovery time. We show that the tradeoff between replication levels and recovery time when the system overloads needs further investigation.",

keywords = "Dependability, Utility Computing, Fault-tolerance, Cloud Computing, Storage, Deduplication, Adaptive",

author = "Waraporn Leesakul and Paul Townend and Peter Garraghan",

year = "2014",

month = sep,

day = "18",

doi = "10.1109/ISORC.2014.55",

language = "English",

pages = "397--404",

booktitle = "2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing",

publisher = "IEEE",

}

RIS

TY - GEN

T1 - Fault-tolerant dynamic deduplication for utility computing

AU - Leesakul, Waraporn

AU - Townend, Paul

AU - Garraghan, Peter

PY - 2014/9/18

Y1 - 2014/9/18

N2 - Utility computing is an increasingly important paradigm, whereby computing resources are provided on-demand as utilities. An important component of utility computing is storage, data volumes are growing rapidly, and mechanisms to mitigate this growth need to be developed. Data deduplication is a promising technique for drastically reducing the amount of data stored in such system systems, however, current approachs are static in nature, using an amount of redundancy fixed at design time. This is inappropriate for truly dynamic modern systems. We propose a real-time adaptive deduplication system for Cloud and Utility computing that monitors in real-time for changing system, user, and environmental behaviour in order to fulfill a balance between changing storage efficiency, performance, and fault tolerance requirements. We evaluate our system through simulation, with experimental results showing that our system is both efficient and sclable. We also perform experimentation to evaluate the fault tolerance of the system by measuring Mean Time to Repair (MTTR), and using these values to calculate availability of the system. The results show that higher replication levels result in higher system availability, however, the number of files in the system also effects recovery time. We show that the tradeoff between replication levels and recovery time when the system overloads needs further investigation.

AB - Utility computing is an increasingly important paradigm, whereby computing resources are provided on-demand as utilities. An important component of utility computing is storage, data volumes are growing rapidly, and mechanisms to mitigate this growth need to be developed. Data deduplication is a promising technique for drastically reducing the amount of data stored in such system systems, however, current approachs are static in nature, using an amount of redundancy fixed at design time. This is inappropriate for truly dynamic modern systems. We propose a real-time adaptive deduplication system for Cloud and Utility computing that monitors in real-time for changing system, user, and environmental behaviour in order to fulfill a balance between changing storage efficiency, performance, and fault tolerance requirements. We evaluate our system through simulation, with experimental results showing that our system is both efficient and sclable. We also perform experimentation to evaluate the fault tolerance of the system by measuring Mean Time to Repair (MTTR), and using these values to calculate availability of the system. The results show that higher replication levels result in higher system availability, however, the number of files in the system also effects recovery time. We show that the tradeoff between replication levels and recovery time when the system overloads needs further investigation.

KW - Dependability

KW - Utility Computing

KW - Fault-tolerance

KW - Cloud Computing

KW - Storage

KW - Deduplication

KW - Adaptive

U2 - 10.1109/ISORC.2014.55

DO - 10.1109/ISORC.2014.55

M3 - Conference contribution/Paper

SP - 397

EP - 404

BT - 2014 IEEE 17th International Symposium on Object/Component/Service-Oriented Real-Time Distributed Computing

PB - IEEE

ER -

Research

Associated organisational unit

Links

Text available via DOI:

Keywords