Distributed service caching with deep reinforcement learning for sustainable edge computing in large-scale AI

Home > Research > Publications & Outputs > Distributed service caching with deep reinforce...

Computing and Communications

Associated organisational unit

Insight

Text available via DOI:

https://doi.org/10.1016/j.dcan.2024.11.009
Accepted author manuscript
Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Forthcoming

Wei Liu
Muhammad Bilal
Yuzhe Shi
Xiaolong Xu

More...

<mark>Journal publication date</mark>	17/11/2024
<mark>Journal</mark>	Digital Communications and Networks
Publication Status	Accepted/In press
<mark>Original language</mark>	English

Abstract

Increasing reliance on large-scale AI models has led to rising demand for intelligent services. The centralized cloud computing approach has limitations in terms of data transfer efficiency and response time, and as a result many service providers have begun to deploy edge servers to cache intelligent services in order to reduce transmission delay and communication energy consumption. However, finding the optimal service caching strategy remains a significant challenge due to the stochastic nature of service requests and the bulky nature of intelligent services. To deal with this we propose a distributed service caching scheme integrating deep reinforcement learning (DRL) with mobility prediction, which we refer to as DSDM. Specifically, we employ the D3QN (Deep Double Dueling Q-Network) framework to integrate Long Short-Term Memory (LSTM) predicted mobile device locations into the service caching replacement algorithm and adopt the distributed multi-agent approach for learning and training. Experimental results demonstrate that DSDM achieves significant performance improvements in reducing communication energy consumption compared to traditional methods across various scenarios.

Research

Associated organisational unit

Text available via DOI:

Distributed service caching with deep reinforcement learning for sustainable edge computing in large-scale AI

Abstract

Quick Links

Connect With Us

Faculties & Depts

Contact Us