Robust 3-D human detection in complex environments with a depth camera

Computing and Communications

Text available via DOI:

https://doi.org/10.1109/TMM.2018.2803526
Final published version

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Robust 3-D human detection in complex environments with a depth camera. / Tian, L.; Li, M.; Hao, Y. et al.
In: IEEE Transactions on Multimedia, Vol. 20, No. 9, 30.09.2018, p. 2249-2261.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Tian, L, Li, M, Hao, Y, Liu, J, Zhang, G & Chen, YQ 2018, 'Robust 3-D human detection in complex environments with a depth camera', IEEE Transactions on Multimedia, vol. 20, no. 9, pp. 2249-2261. https://doi.org/10.1109/TMM.2018.2803526

APA

Tian, L., Li, M., Hao, Y., Liu, J., Zhang, G., & Chen, Y. Q. (2018). Robust 3-D human detection in complex environments with a depth camera. IEEE Transactions on Multimedia, 20(9), 2249-2261. https://doi.org/10.1109/TMM.2018.2803526

Vancouver

Tian L, Li M, Hao Y, Liu J, Zhang G, Chen YQ. Robust 3-D human detection in complex environments with a depth camera. IEEE Transactions on Multimedia. 2018 Sept 30;20(9):2249-2261. Epub 2018 Feb 7. doi: 10.1109/TMM.2018.2803526

Author

Tian, L. ; Li, M. ; Hao, Y. et al. / Robust 3-D human detection in complex environments with a depth camera. In: IEEE Transactions on Multimedia. 2018 ; Vol. 20, No. 9. pp. 2249-2261.

Bibtex

@article{e5a33d00f4fd4248b3973d006d2d5e57,

title = "Robust 3-D human detection in complex environments with a depth camera",

abstract = "Human detection has received great attention during the past few decades, which is yet still a challenging problem. In this paper, we focus on the problem of 3-D human detection, i.e., finding the human bodies and determining their 3-D coordinates in complex 3-D space using depth data only. Since the traditional sliding-window-based approaches for target localization are time-consuming and the recent deep-learning-based object detectors generate too many region proposals, we propose to utilize the candidate head-top locating stage to efficiently and quickly find the plausible head-top locations. In the second stage, we propose a Depth map, Multiorder depth template, and Height difference map representation encoding three channels of information for each candidate region to utilize the neural network pretrained on large-scale well-annotated datasets to classify the candidate regions. We evaluate our method on four publicly available challenging datasets. Extensive experimental results demonstrate that the proposed method is superior to the state-of-the-art methods while achieving real-time performance",

author = "L. Tian and M. Li and Y. Hao and Jun Liu and G. Zhang and Y.Q. Chen",

year = "2018",

month = sep,

day = "30",

doi = "10.1109/TMM.2018.2803526",

language = "English",

volume = "20",

pages = "2249--2261",

journal = "IEEE Transactions on Multimedia",

issn = "1520-9210",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "9",

}

RIS

TY - JOUR

T1 - Robust 3-D human detection in complex environments with a depth camera

AU - Tian, L.

AU - Li, M.

AU - Hao, Y.

AU - Liu, Jun

AU - Zhang, G.

AU - Chen, Y.Q.

PY - 2018/9/30

Y1 - 2018/9/30

N2 - Human detection has received great attention during the past few decades, which is yet still a challenging problem. In this paper, we focus on the problem of 3-D human detection, i.e., finding the human bodies and determining their 3-D coordinates in complex 3-D space using depth data only. Since the traditional sliding-window-based approaches for target localization are time-consuming and the recent deep-learning-based object detectors generate too many region proposals, we propose to utilize the candidate head-top locating stage to efficiently and quickly find the plausible head-top locations. In the second stage, we propose a Depth map, Multiorder depth template, and Height difference map representation encoding three channels of information for each candidate region to utilize the neural network pretrained on large-scale well-annotated datasets to classify the candidate regions. We evaluate our method on four publicly available challenging datasets. Extensive experimental results demonstrate that the proposed method is superior to the state-of-the-art methods while achieving real-time performance

AB - Human detection has received great attention during the past few decades, which is yet still a challenging problem. In this paper, we focus on the problem of 3-D human detection, i.e., finding the human bodies and determining their 3-D coordinates in complex 3-D space using depth data only. Since the traditional sliding-window-based approaches for target localization are time-consuming and the recent deep-learning-based object detectors generate too many region proposals, we propose to utilize the candidate head-top locating stage to efficiently and quickly find the plausible head-top locations. In the second stage, we propose a Depth map, Multiorder depth template, and Height difference map representation encoding three channels of information for each candidate region to utilize the neural network pretrained on large-scale well-annotated datasets to classify the candidate regions. We evaluate our method on four publicly available challenging datasets. Extensive experimental results demonstrate that the proposed method is superior to the state-of-the-art methods while achieving real-time performance

U2 - 10.1109/TMM.2018.2803526

DO - 10.1109/TMM.2018.2803526

M3 - Journal article

VL - 20

SP - 2249

EP - 2261

JO - IEEE Transactions on Multimedia

JF - IEEE Transactions on Multimedia

SN - 1520-9210

IS - 9

ER -

Research

Links

Text available via DOI: