Home > Research > Publications & Outputs > XRL-SHAP-Cache

Electronic data

Links

Text available via DOI:

View graph of relations

XRL-SHAP-Cache: an explainable reinforcement learning approach for intelligent edge service caching in content delivery networks

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • X. Xu
  • F. Wu
  • M. Bilal
  • X. Xia
  • W. Dou
  • L. Yao
  • W. Zhong
Close
Article number170303
<mark>Journal publication date</mark>27/06/2024
<mark>Journal</mark>Science China Information Sciences
Issue number7
Volume67
Publication StatusPublished
<mark>Original language</mark>English

Abstract

Content delivery networks (CDNs) play a pivotal role in the modern internet infrastructure by enabling efficient content delivery across diverse geographical regions. As an essential component of CDNs, the edge caching scheme directly influences the user experience by determining the caching and eviction of content on edge servers. With the emergence of 5G technology, traditional caching schemes have faced challenges in adapting to increasingly complex and dynamic network environments. Consequently, deep reinforcement learning (DRL) offers a promising solution for intelligent zero-touch network governance. However, the black-box nature of DRL models poses challenges in understanding and making trusting decisions. In this paper, we propose an explainable reinforcement learning (XRL)-based intelligent edge service caching approach, namely XRL-SHAP-Cache, which combines DRL with an explainable artificial intelligence (XAI) technique for cache management in CDNs. Instead of focusing solely on achieving performance gains, this study introduces a novel paradigm for providing interpretable caching strategies, thereby establishing a foundation for future transparent and trustworthy edge caching solutions. Specifically, a multi-level cache scheduling framework for CDNs was formulated theoretically, with the D3QN-based caching scheme serving as the targeted interpretable model. Subsequently, by integrating Deep-SHAP into our framework, the contribution of each state input feature to the agent’s Q-value output was calculated, thereby providing valuable insights into the decision-making process. The proposed XRL-SHAP-Cache approach was evaluated through extensive experiments to demonstrate the behavior of the scheduling agent in the face of different environmental inputs. The results demonstrate its strong explainability under various real-life scenarios while maintaining superior performance compared to traditional caching schemes in terms of cache hit ratio, quality of service (QoS), and space utilization.