Home > Research > Publications & Outputs > REMOTE

Electronic data

  • AAAI_22_Pose_Estimation

    Accepted author manuscript, 5.48 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

View graph of relations

REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos. / Ma, Xianzheng; Rahmani, Hossein; Fan, Zhipeng et al.
Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto, Calif. : AAAI press, 2022. p. 1944-1952.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Ma, X, Rahmani, H, Fan, Z, Yang, B, Cheng, J & Liu, J 2022, REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos. in Proceedings of the 36th AAAI Conference on Artificial Intelligence. AAAI press, Palo Alto, Calif. , pp. 1944-1952, 36th AAAI Conference on Artificial Intelligence, 22/02/22. <https://ojs.aaai.org/index.php/AAAI/article/view/20089>

APA

Ma, X., Rahmani, H., Fan, Z., Yang, B., Cheng, J., & Liu, J. (2022). REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos. In Proceedings of the 36th AAAI Conference on Artificial Intelligence (pp. 1944-1952). AAAI press. https://ojs.aaai.org/index.php/AAAI/article/view/20089

Vancouver

Ma X, Rahmani H, Fan Z, Yang B, Cheng J, Liu J. REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos. In Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto, Calif. : AAAI press. 2022. p. 1944-1952 Epub 2022 Mar 1.

Author

Ma, Xianzheng ; Rahmani, Hossein ; Fan, Zhipeng et al. / REMOTE : Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos. Proceedings of the 36th AAAI Conference on Artificial Intelligence. Palo Alto, Calif. : AAAI press, 2022. pp. 1944-1952

Bibtex

@inproceedings{759118a8444b42b6b3b9b8ae7e2a8e23,
title = "REMOTE: Reinforced Motion Transformation Network for Semi-supervised 2D Pose Estimation in Videos",
abstract = "Existing approaches for 2D pose estimation in videos often require a large number of dense annotations, which are costly and labor intensive to acquire. In this paper, we propose a semi-supervised REinforced MOtion Transformation nEtwork (REMOTE) to leverage a few labeled frames and temporal pose variations in videos, which enables effective learning of 2D pose estimation in sparsely annotated videos. Specifically, we introduce a Motion Transformer (MT) module to perform cross frame reconstruction, aiming to learn motion dynamic knowledge in videos. Besides, a novel reinforcement learning-based Frame Selection Agent (FSA) is designed within our framework, which is able to harness informative frame pairs on the fly to enhance the pose estimator under our cross reconstruction mechanism. We conduct extensive experiments that show the efficacy of our proposed REMOTE framework.",
author = "Xianzheng Ma and Hossein Rahmani and Zhipeng Fan and Bin Yang and Jun Cheng and Jun Liu",
year = "2022",
month = jun,
day = "28",
language = "English",
pages = "1944--1952",
booktitle = "Proceedings of the 36th AAAI Conference on Artificial Intelligence",
publisher = "AAAI press",
note = "36th AAAI Conference on Artificial Intelligence ; Conference date: 22-02-2022 Through 01-03-2022",
url = "https://aaai.org/Conferences/AAAI-22/",

}

RIS

TY - GEN

T1 - REMOTE

T2 - 36th AAAI Conference on Artificial Intelligence

AU - Ma, Xianzheng

AU - Rahmani, Hossein

AU - Fan, Zhipeng

AU - Yang, Bin

AU - Cheng, Jun

AU - Liu, Jun

PY - 2022/6/28

Y1 - 2022/6/28

N2 - Existing approaches for 2D pose estimation in videos often require a large number of dense annotations, which are costly and labor intensive to acquire. In this paper, we propose a semi-supervised REinforced MOtion Transformation nEtwork (REMOTE) to leverage a few labeled frames and temporal pose variations in videos, which enables effective learning of 2D pose estimation in sparsely annotated videos. Specifically, we introduce a Motion Transformer (MT) module to perform cross frame reconstruction, aiming to learn motion dynamic knowledge in videos. Besides, a novel reinforcement learning-based Frame Selection Agent (FSA) is designed within our framework, which is able to harness informative frame pairs on the fly to enhance the pose estimator under our cross reconstruction mechanism. We conduct extensive experiments that show the efficacy of our proposed REMOTE framework.

AB - Existing approaches for 2D pose estimation in videos often require a large number of dense annotations, which are costly and labor intensive to acquire. In this paper, we propose a semi-supervised REinforced MOtion Transformation nEtwork (REMOTE) to leverage a few labeled frames and temporal pose variations in videos, which enables effective learning of 2D pose estimation in sparsely annotated videos. Specifically, we introduce a Motion Transformer (MT) module to perform cross frame reconstruction, aiming to learn motion dynamic knowledge in videos. Besides, a novel reinforcement learning-based Frame Selection Agent (FSA) is designed within our framework, which is able to harness informative frame pairs on the fly to enhance the pose estimator under our cross reconstruction mechanism. We conduct extensive experiments that show the efficacy of our proposed REMOTE framework.

UR - https://www.aaai.org/AAAI22Papers/AAAI-5513.XianzhengM.pdf

M3 - Conference contribution/Paper

SP - 1944

EP - 1952

BT - Proceedings of the 36th AAAI Conference on Artificial Intelligence

PB - AAAI press

CY - Palo Alto, Calif.

Y2 - 22 February 2022 through 1 March 2022

ER -