Home > Research > Publications & Outputs > Deep learning based object detection from multi...

Links

Text available via DOI:

View graph of relations

Deep learning based object detection from multi-modal sensors: an overview

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Deep learning based object detection from multi-modal sensors: an overview. / Liu, Ye; Meng, Shiyang; Wang, Hongzhang et al.
In: Multimedia Tools and Applications, Vol. 83, No. 7, 29.02.2024, p. 19841-19870.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Liu, Y, Meng, S, Wang, H & Liu, J 2024, 'Deep learning based object detection from multi-modal sensors: an overview', Multimedia Tools and Applications, vol. 83, no. 7, pp. 19841-19870. https://doi.org/10.1007/s11042-023-16275-z

APA

Liu, Y., Meng, S., Wang, H., & Liu, J. (2024). Deep learning based object detection from multi-modal sensors: an overview. Multimedia Tools and Applications, 83(7), 19841-19870. https://doi.org/10.1007/s11042-023-16275-z

Vancouver

Liu Y, Meng S, Wang H, Liu J. Deep learning based object detection from multi-modal sensors: an overview. Multimedia Tools and Applications. 2024 Feb 29;83(7):19841-19870. Epub 2023 Jul 28. doi: 10.1007/s11042-023-16275-z

Author

Liu, Ye ; Meng, Shiyang ; Wang, Hongzhang et al. / Deep learning based object detection from multi-modal sensors : an overview. In: Multimedia Tools and Applications. 2024 ; Vol. 83, No. 7. pp. 19841-19870.

Bibtex

@article{fb3e72439a29485b9d67a38ed19a73b1,
title = "Deep learning based object detection from multi-modal sensors: an overview",
abstract = "Object detection is an important problem and has a wide range of applications. In recent years, deep learning based object detection with conventional RGB cameras has made great progress. At the same time, people are more and more aware of the limitations of RGB cameras. The progress of algorithms alone can not fundamentally resolve the challenges of object detection. Unmanned vehicles or mobile robot platforms are often equipped with a variety of sensors in addition to RGB camera, each of which have its own characteristics, and can expand the sensing range of RGB camera from different dimensions. For example, infrared thermal imaging camera and multispectral camera broaden sensing range from spectral dimension, while LiDARs and depth cameras are able to broaden sensing range from the spatial dimension. This paper mainly summarizes the deep learning based object detection methods under the condition of multi-modal sensors, and surveys and categorizes the methods from the perspective of data fusion manner. The datasets of different modality are summarized, and the advantages and disadvantages with different combination of sensors are also discussed in this paper.",
keywords = "Deep learning, Multi-modal, Object detection, Sensor fusion",
author = "Ye Liu and Shiyang Meng and Hongzhang Wang and Jun Liu",
note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.",
year = "2024",
month = feb,
day = "29",
doi = "10.1007/s11042-023-16275-z",
language = "English",
volume = "83",
pages = "19841--19870",
journal = "Multimedia Tools and Applications",
issn = "1380-7501",
publisher = "Springer Netherlands",
number = "7",

}

RIS

TY - JOUR

T1 - Deep learning based object detection from multi-modal sensors

T2 - an overview

AU - Liu, Ye

AU - Meng, Shiyang

AU - Wang, Hongzhang

AU - Liu, Jun

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.

PY - 2024/2/29

Y1 - 2024/2/29

N2 - Object detection is an important problem and has a wide range of applications. In recent years, deep learning based object detection with conventional RGB cameras has made great progress. At the same time, people are more and more aware of the limitations of RGB cameras. The progress of algorithms alone can not fundamentally resolve the challenges of object detection. Unmanned vehicles or mobile robot platforms are often equipped with a variety of sensors in addition to RGB camera, each of which have its own characteristics, and can expand the sensing range of RGB camera from different dimensions. For example, infrared thermal imaging camera and multispectral camera broaden sensing range from spectral dimension, while LiDARs and depth cameras are able to broaden sensing range from the spatial dimension. This paper mainly summarizes the deep learning based object detection methods under the condition of multi-modal sensors, and surveys and categorizes the methods from the perspective of data fusion manner. The datasets of different modality are summarized, and the advantages and disadvantages with different combination of sensors are also discussed in this paper.

AB - Object detection is an important problem and has a wide range of applications. In recent years, deep learning based object detection with conventional RGB cameras has made great progress. At the same time, people are more and more aware of the limitations of RGB cameras. The progress of algorithms alone can not fundamentally resolve the challenges of object detection. Unmanned vehicles or mobile robot platforms are often equipped with a variety of sensors in addition to RGB camera, each of which have its own characteristics, and can expand the sensing range of RGB camera from different dimensions. For example, infrared thermal imaging camera and multispectral camera broaden sensing range from spectral dimension, while LiDARs and depth cameras are able to broaden sensing range from the spatial dimension. This paper mainly summarizes the deep learning based object detection methods under the condition of multi-modal sensors, and surveys and categorizes the methods from the perspective of data fusion manner. The datasets of different modality are summarized, and the advantages and disadvantages with different combination of sensors are also discussed in this paper.

KW - Deep learning

KW - Multi-modal

KW - Object detection

KW - Sensor fusion

U2 - 10.1007/s11042-023-16275-z

DO - 10.1007/s11042-023-16275-z

M3 - Journal article

AN - SCOPUS:85165938559

VL - 83

SP - 19841

EP - 19870

JO - Multimedia Tools and Applications

JF - Multimedia Tools and Applications

SN - 1380-7501

IS - 7

ER -