Rights statement: ©2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Accepted author manuscript, 1.26 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License
Final published version
Research output: Contribution to Journal/Magazine › Journal article › peer-review
Research output: Contribution to Journal/Magazine › Journal article › peer-review
}
TY - JOUR
T1 - Deep Reinforcement Learning-Based Control Framework for Multilateral Telesurgery
AU - Bacha, Sarah
AU - Bai, Weibang
AU - Wang, Ziwei
AU - Xiao, Bo
AU - Yeatman, Eric
N1 - ©2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
PY - 2022/5/31
Y1 - 2022/5/31
N2 - The upper boundary of time delay is often required in traditional telesurgery control design, which would result in infeasibility of telesurgery across regions. To overcome this issue, this paper introduces a new control framework based on deep deterministic policy gradient (DDPG) reinforcement learning (RL) algorithm. The developed framework effectively overcomes the phase difference and data loss caused by time delays, which facilitates the restoration of surgeon’s intention and interactive force. Kalman filter (KF) is employed to blend multiple surgeons’ commands and predict the final local commands, respectively. The control framework ensures synchronization tracking performance and transparency. Prior knowledge of time delay is therefore not required. Simulation and experiment results have demonstrated the merits of the proposed framework.
AB - The upper boundary of time delay is often required in traditional telesurgery control design, which would result in infeasibility of telesurgery across regions. To overcome this issue, this paper introduces a new control framework based on deep deterministic policy gradient (DDPG) reinforcement learning (RL) algorithm. The developed framework effectively overcomes the phase difference and data loss caused by time delays, which facilitates the restoration of surgeon’s intention and interactive force. Kalman filter (KF) is employed to blend multiple surgeons’ commands and predict the final local commands, respectively. The control framework ensures synchronization tracking performance and transparency. Prior knowledge of time delay is therefore not required. Simulation and experiment results have demonstrated the merits of the proposed framework.
U2 - 10.1109/TMRB.2022.3170786
DO - 10.1109/TMRB.2022.3170786
M3 - Journal article
VL - 4
SP - 352
EP - 355
JO - IEEE Transactions on Medical Robotics and Bionics
JF - IEEE Transactions on Medical Robotics and Bionics
SN - 2576-3202
IS - 2
ER -