A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

Computing and Communications

Electronic data

Qu_A_Characteristic_Function-Based_Method_for_Bottom-Up_Human_Pose_Estimation_CVPR_2023_paper
Accepted author manuscript, 8.4 MB, PDF document

Text available via DOI:

https://doi.org/10.1109/CVPR52729.2023.01250
Final published version

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Standard

A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation. / Qu, Haoxuan; Cai, Yujun; Foo, Lingeng et al.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. IEEE, 2023. p. 13009-13018.

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Harvard

Qu, H, Cai, Y, Foo, L, Kumar, A & Liu, J 2023, A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. IEEE, pp. 13009-13018. https://doi.org/10.1109/CVPR52729.2023.01250

APA

Qu, H., Cai, Y., Foo, L., Kumar, A., & Liu, J. (2023). A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (pp. 13009-13018). IEEE. https://doi.org/10.1109/CVPR52729.2023.01250

Vancouver

Qu H, Cai Y, Foo L, Kumar A, Liu J. A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. IEEE. 2023. p. 13009-13018 Epub 2023 Jun 17. doi: 10.1109/CVPR52729.2023.01250

Author

Qu, Haoxuan ; Cai, Yujun ; Foo, Lingeng et al. / A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023. IEEE, 2023. pp. 13009-13018

Bibtex

@inproceedings{06e241210afb4fdeac956cdcabc6c1f7,

title = "A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation",

abstract = "Most recent methods formulate the task of human pose estimation as a heatmap estimation problem, and use the overall L2 loss computed from the entire heatmap to optimize the heatmap prediction. In this paper, we show that in bottom-up human pose estimation where each heatmap often contains multiple body joints, using the overall L2 loss to optimize the heatmap prediction may not be the optimal choice. This is because, minimizing the overall L2 loss cannot always lead the model to locate all the body joints across different sub-regions of the heatmap more accurately. To cope with this problem, from a novel perspective, we propose a new bottom-up human pose estimation method that optimizes the heatmap prediction via minimizing the distance between two characteristic functions respectively constructed from the predicted heatmap and the groundtruth heatmap. Our analysis presented in this paper indicates that the distance between these two characteristic functions is essentially the upper bound of the L2 losses w.r.t. sub-regions of the predicted heatmap. Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap. We show the effectiveness of our proposed method through extensive experiments on the COCO dataset and the CrowdPose dataset.",

author = "Haoxuan Qu and Yujun Cai and Lingeng Foo and Ajay Kumar and Jun Liu",

year = "2023",

month = aug,

day = "22",

doi = "10.1109/CVPR52729.2023.01250",

language = "English",

isbn = "9798350301304",

pages = "13009--13018",

booktitle = "Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023",

publisher = "IEEE",

}

RIS

TY - GEN

T1 - A Characteristic Function-Based Method for Bottom-Up Human Pose Estimation

AU - Qu, Haoxuan

AU - Cai, Yujun

AU - Foo, Lingeng

AU - Kumar, Ajay

AU - Liu, Jun

PY - 2023/8/22

Y1 - 2023/8/22

N2 - Most recent methods formulate the task of human pose estimation as a heatmap estimation problem, and use the overall L2 loss computed from the entire heatmap to optimize the heatmap prediction. In this paper, we show that in bottom-up human pose estimation where each heatmap often contains multiple body joints, using the overall L2 loss to optimize the heatmap prediction may not be the optimal choice. This is because, minimizing the overall L2 loss cannot always lead the model to locate all the body joints across different sub-regions of the heatmap more accurately. To cope with this problem, from a novel perspective, we propose a new bottom-up human pose estimation method that optimizes the heatmap prediction via minimizing the distance between two characteristic functions respectively constructed from the predicted heatmap and the groundtruth heatmap. Our analysis presented in this paper indicates that the distance between these two characteristic functions is essentially the upper bound of the L2 losses w.r.t. sub-regions of the predicted heatmap. Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap. We show the effectiveness of our proposed method through extensive experiments on the COCO dataset and the CrowdPose dataset.

AB - Most recent methods formulate the task of human pose estimation as a heatmap estimation problem, and use the overall L2 loss computed from the entire heatmap to optimize the heatmap prediction. In this paper, we show that in bottom-up human pose estimation where each heatmap often contains multiple body joints, using the overall L2 loss to optimize the heatmap prediction may not be the optimal choice. This is because, minimizing the overall L2 loss cannot always lead the model to locate all the body joints across different sub-regions of the heatmap more accurately. To cope with this problem, from a novel perspective, we propose a new bottom-up human pose estimation method that optimizes the heatmap prediction via minimizing the distance between two characteristic functions respectively constructed from the predicted heatmap and the groundtruth heatmap. Our analysis presented in this paper indicates that the distance between these two characteristic functions is essentially the upper bound of the L2 losses w.r.t. sub-regions of the predicted heatmap. Therefore, via minimizing the distance between the two characteristic functions, we can optimize the model to provide a more accurate localization result for the body joints in different sub-regions of the predicted heatmap. We show the effectiveness of our proposed method through extensive experiments on the COCO dataset and the CrowdPose dataset.

U2 - 10.1109/CVPR52729.2023.01250

DO - 10.1109/CVPR52729.2023.01250

M3 - Conference contribution/Paper

SN - 9798350301304

SP - 13009

EP - 13018

BT - Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023

PB - IEEE

ER -

Research

Electronic data

Links

Text available via DOI: