Multiattention Network for Semantic Segmentation of Fine-Resolution Remote Sensing Images

Home > Research > Publications & Outputs > Multiattention Network for Semantic Segmentatio...

Lancaster Environment Centre

Associated organisational units

Electronic data

Multi_Attention_Network_TGRS_accepted
Rights statement: ©2021 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Accepted author manuscript, 3.79 MB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1109/TGRS.2021.3093977
Final published version

Keywords

fine-resolution remote sensing images, attention mechanism, semantic segmentation

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Rui Li
Shunyi Zheng
Ce Zhang
Chenxi Duan
Jianlin Su
Peter Atkinson

More...

Article number	5607713
<mark>Journal publication date</mark>	31/01/2022
<mark>Journal</mark>	IEEE Transactions on Geoscience and Remote Sensing
Volume	60
Number of pages	13
Publication Status	Published
Early online date	15/07/21
<mark>Original language</mark>	English

Abstract

Semantic segmentation of remote sensing images plays an important role in land resource management, yield estimation, and economic assessment. Although the accuracy of semantic segmentation in remote sensing images has been increased significantly by deep convolutional neural networks, there are still several limitations contained in standard models. First, for encoder-decoder architectures such as U-Net, the utilization of multi-scale features causes the overuse of information, where similar low-level features are exploited at multiple scales over multiple times. Second, long-range dependencies of feature maps are not sufficiently explored, resulting in feature representations associated with each semantic class not being optimized. Third, even though the dot-product attention mechanism has been introduced and utilized in semantic segmentation to model long-range dependencies, the high time and space complexities of attention impede the actual usage of attention in application scenarios with large-scale input. This paper proposed a Multi-Attention-Network (MANet) to handle these issues by extracting contextual dependencies through multiple efficient attention modules. A novel attention mechanism of kernel attention with linear complexity is proposed to alleviate the large computational demand in attention. We integrate local feature maps extracted by ResNeXt-101 with their corresponding global dependencies and reweight interdependent channel maps adaptively based on kernel attention and channel attention. Numerical experiments on three large-scale fine resolution remote sensing images captured by variant satellites demonstrate that the performance of the proposed MANet outperforms the DeepLab V3+, PSPNet, FastFCN, and other benchmark approaches.

Bibliographic note

©2021 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Research

Associated organisational units

Electronic data

Links

Text available via DOI:

Keywords