Home > Research > Publications & Outputs > Providing Fault Tolerance via Complex Event Pro...

Electronic data

  • pft

    Rights statement: © ACM, 2019. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of the 9th International Conference on the Internet of Things, IoT 2019 http://doi.acm.org/10.1145/3365871.3365872

    Accepted author manuscript, 3.15 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Providing Fault Tolerance via Complex Event Processing and Machine Learning for IoT Systems

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published
Publication date22/10/2019
Host publicationProceedings of the 9th International Conference on the Internet of Things, IoT 2019
Place of PublicationNew York
PublisherACM
Number of pages7
ISBN (electronic)9781450372077
<mark>Original language</mark>English

Abstract

Fault-tolerance (FT) support is a key challenge for ensuring dependable Internet of Things (IoT) systems. Many existing FT-support mechanisms in IoT are static, tightly coupled, inflexible implementations that struggle to adapt in dynamic IoT environments. This paper proposes Complex Patterns of Failure (CPoF), an approach to providing reactive and proactive FT using Complex Event Processing (CEP) and Machine Learning (ML). Error-detection strategies are defined as nondeterministic finite automata (NFA) and implemented via CEP systems. Reactive-FT support is monitored and learned from to train ML models that proactively handle imminent future occurrences of known errors. We evaluated CPoF on an indoor agriculture system with experiments that used time and error correlations to preempt battery-depletion failures. We trained predictive models to learn from reactive-FT support and provide preemptive error recovery.

Bibliographic note

© ACM, 2019. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Proceedings of the 9th International Conference on the Internet of Things, IoT 2019 http://doi.acm.org/10.1145/3365871.3365872