Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review
Publication date | 2006 |
---|---|
Host publication | SRDS 2006: 25th IEEE Symposium on Reliable Distributed Systems, Proceedings |
Editors | S Kawada |
Place of Publication | LOS ALAMITOS |
Publisher | IEEE COMPUTER SOC |
Pages | 132-142 |
Number of pages | 11 |
ISBN (print) | 0-7695-2677-2 |
<mark>Original language</mark> | English |
Event | 25th IEEE Symposium on Reliable Distributed Systems - Leeds Duration: 2/10/2006 → 4/10/2006 |
Conference | 25th IEEE Symposium on Reliable Distributed Systems |
---|---|
City | Leeds |
Period | 2/10/06 → 4/10/06 |
Conference | 25th IEEE Symposium on Reliable Distributed Systems |
---|---|
City | Leeds |
Period | 2/10/06 → 4/10/06 |
We present and evaluate a generic approach to the repair of overlay networks which identifies general principles of overlay repair and embodies these as a reusable service. At the heart of our approach is an algorithm that discovers the extent of a failed section of any type of overlay, and assigns responsibility to carry out the repair The repair strategy itself is 'pluggable' and can be tailored to the requirements of a specific overlay type or instance. Our approach is efficient in terms of the number of repair-related message exchanges it incurs; scalable in that it involves only nodes in the locality of the failed section of the overlay; and resilient in that it correctly handles cases in which multiple adjacent nodes fail simultaneously, and it tolerates new failures that occur while a repair is underway. The benefits of our approach are that: (i) it extracts and encapsulates best practice in repair for overlays; (ii) it simplifies the design and implementation of new overlays (because repair issues can be treated orthogonally to basic functionality); and (iii) it supports tailorable levels of dependability for overlays, including pluggable repair strategies.