Home > Research > Publications & Outputs > Semantic Segmentation of Terrestrial Laser Scan...

Electronic data

  • 3 Manuscript_Final

    Rights statement: ©2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

    Accepted author manuscript, 1.4 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations. / Cai, Yuanzhi; Fan, Lei; Atkinson, Peter M. et al.
In: IEEE Transactions on Geoscience and Remote Sensing, Vol. 60, 5702815, 24.03.2022, p. 1-15.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

APA

Vancouver

Cai Y, Fan L, Atkinson PM, Zhang C. Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations. IEEE Transactions on Geoscience and Remote Sensing. 2022 Mar 24;60:1-15. 5702815. doi: 10.1109/tgrs.2022.3161982

Author

Cai, Yuanzhi ; Fan, Lei ; Atkinson, Peter M. et al. / Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations. In: IEEE Transactions on Geoscience and Remote Sensing. 2022 ; Vol. 60. pp. 1-15.

Bibtex

@article{c5d6d5cc2b96490db384bfeb22880a97,
title = "Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations",
abstract = "Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination IZeDe (i.e., intensity, enhanced Z -coordinate, and enhanced range images) outperformed the conventional I RGB and I RGB D channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.",
keywords = "General Earth and Planetary Sciences, Electrical and Electronic Engineering",
author = "Yuanzhi Cai and Lei Fan and Atkinson, {Peter M.} and Cheng Zhang",
note = "{\textcopyright}2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.",
year = "2022",
month = mar,
day = "24",
doi = "10.1109/tgrs.2022.3161982",
language = "English",
volume = "60",
pages = "1--15",
journal = "IEEE Transactions on Geoscience and Remote Sensing",
issn = "0196-2892",
publisher = "IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC",

}

RIS

TY - JOUR

T1 - Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

AU - Cai, Yuanzhi

AU - Fan, Lei

AU - Atkinson, Peter M.

AU - Zhang, Cheng

N1 - ©2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PY - 2022/3/24

Y1 - 2022/3/24

N2 - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination IZeDe (i.e., intensity, enhanced Z -coordinate, and enhanced range images) outperformed the conventional I RGB and I RGB D channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

AB - Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination IZeDe (i.e., intensity, enhanced Z -coordinate, and enhanced range images) outperformed the conventional I RGB and I RGB D channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

KW - General Earth and Planetary Sciences

KW - Electrical and Electronic Engineering

U2 - 10.1109/tgrs.2022.3161982

DO - 10.1109/tgrs.2022.3161982

M3 - Journal article

VL - 60

SP - 1

EP - 15

JO - IEEE Transactions on Geoscience and Remote Sensing

JF - IEEE Transactions on Geoscience and Remote Sensing

SN - 0196-2892

M1 - 5702815

ER -