Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Distributed service caching with deep reinforcement learning for sustainable edge computing in large-scale AI
AU - Liu, Wei
AU - Bilal, Muhammad
AU - Shi, Yuzhe
AU - Xu, Xiaolong
PY - 2024/11/17
Y1 - 2024/11/17
N2 - Increasing reliance on large-scale AI models has led to rising demand for intelligent services. The centralized cloud computing approach has limitations in terms of data transfer efficiency and response time, and as a result many service providers have begun to deploy edge servers to cache intelligent services in order to reduce transmission delay and communication energy consumption. However, finding the optimal service caching strategy remains a significant challenge due to the stochastic nature of service requests and the bulky nature of intelligent services. To deal with this we propose a distributed service caching scheme integrating deep reinforcement learning (DRL) with mobility prediction, which we refer to as DSDM. Specifically, we employ the D3QN (Deep Double Dueling Q-Network) framework to integrate Long Short-Term Memory (LSTM) predicted mobile device locations into the service caching replacement algorithm and adopt the distributed multi-agent approach for learning and training. Experimental results demonstrate that DSDM achieves significant performance improvements in reducing communication energy consumption compared to traditional methods across various scenarios.
AB - Increasing reliance on large-scale AI models has led to rising demand for intelligent services. The centralized cloud computing approach has limitations in terms of data transfer efficiency and response time, and as a result many service providers have begun to deploy edge servers to cache intelligent services in order to reduce transmission delay and communication energy consumption. However, finding the optimal service caching strategy remains a significant challenge due to the stochastic nature of service requests and the bulky nature of intelligent services. To deal with this we propose a distributed service caching scheme integrating deep reinforcement learning (DRL) with mobility prediction, which we refer to as DSDM. Specifically, we employ the D3QN (Deep Double Dueling Q-Network) framework to integrate Long Short-Term Memory (LSTM) predicted mobile device locations into the service caching replacement algorithm and adopt the distributed multi-agent approach for learning and training. Experimental results demonstrate that DSDM achieves significant performance improvements in reducing communication energy consumption compared to traditional methods across various scenarios.
U2 - 10.1016/j.dcan.2024.11.009
DO - 10.1016/j.dcan.2024.11.009
M3 - Journal article
JO - Digital Communications and Networks
JF - Digital Communications and Networks
SN - 2352-8648
ER -