Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

Computing and Communications

Associated organisational unit

Artificial Intelligence

Electronic data

2402.17891v1
Final published version, 47.9 MB, PDF document
Available under license: CC BY: Creative Commons Attribution 4.0 International License

Keywords

cs.CV

View graph of relations

Research output: Working paper › Preprint

Published

Standard

Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation. / Yang, Xinyu ; Rahmani, Hossein ; Black, Sue et al.
Arxiv, 2024.

Research output: Working paper › Preprint

Bibtex

@techreport{fd2af1edd05747a0a1e8b0b9ef67bc02,

title = "Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation",

abstract = "Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study, we aim to reduce the observed CAM inconsistency and error to mitigate reliance on refinement processes. We propose an end-to-end WSSS model incorporating guided CAMs, wherein our segmentation model is trained while concurrently optimizing CAMs online. Our method, Co-training with Swapping Assignments (CoSA), leverages a dual-stream framework, where one sub-network learns from the swapped assignments generated by the other. We introduce three techniques: i) soft perplexity-based regularization to penalize uncertain regions; ii) a threshold-searching approach to dynamically revise the confidence threshold; and iii) contrastive separation to address the coexistence problem. CoSA demonstrates exceptional performance, achieving mIoU of 76.2\% and 51.0\% on VOC and COCO validation datasets, respectively, surpassing existing baselines by a substantial margin. Notably, CoSA is the first single-stage approach to outperform all existing multi-stage methods including those with additional supervision. Code is avilable at \url{https://github.com/youshyee/CoSA}.",

keywords = "cs.CV",

author = "Xinyu Yang and Hossein Rahmani and Sue Black and Williams, {Bryan M.}",

year = "2024",

month = feb,

day = "27",

language = "English",

publisher = "Arxiv",

type = "WorkingPaper",

institution = "Arxiv",

}

RIS

TY - UNPB

T1 - Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

AU - Yang, Xinyu

AU - Rahmani, Hossein

AU - Black, Sue

AU - Williams, Bryan M.

PY - 2024/2/27

Y1 - 2024/2/27

N2 - Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study, we aim to reduce the observed CAM inconsistency and error to mitigate reliance on refinement processes. We propose an end-to-end WSSS model incorporating guided CAMs, wherein our segmentation model is trained while concurrently optimizing CAMs online. Our method, Co-training with Swapping Assignments (CoSA), leverages a dual-stream framework, where one sub-network learns from the swapped assignments generated by the other. We introduce three techniques: i) soft perplexity-based regularization to penalize uncertain regions; ii) a threshold-searching approach to dynamically revise the confidence threshold; and iii) contrastive separation to address the coexistence problem. CoSA demonstrates exceptional performance, achieving mIoU of 76.2\% and 51.0\% on VOC and COCO validation datasets, respectively, surpassing existing baselines by a substantial margin. Notably, CoSA is the first single-stage approach to outperform all existing multi-stage methods including those with additional supervision. Code is avilable at \url{https://github.com/youshyee/CoSA}.

AB - Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study, we aim to reduce the observed CAM inconsistency and error to mitigate reliance on refinement processes. We propose an end-to-end WSSS model incorporating guided CAMs, wherein our segmentation model is trained while concurrently optimizing CAMs online. Our method, Co-training with Swapping Assignments (CoSA), leverages a dual-stream framework, where one sub-network learns from the swapped assignments generated by the other. We introduce three techniques: i) soft perplexity-based regularization to penalize uncertain regions; ii) a threshold-searching approach to dynamically revise the confidence threshold; and iii) contrastive separation to address the coexistence problem. CoSA demonstrates exceptional performance, achieving mIoU of 76.2\% and 51.0\% on VOC and COCO validation datasets, respectively, surpassing existing baselines by a substantial margin. Notably, CoSA is the first single-stage approach to outperform all existing multi-stage methods including those with additional supervision. Code is avilable at \url{https://github.com/youshyee/CoSA}.

KW - cs.CV

M3 - Preprint

BT - Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation

PB - Arxiv

ER -

Research

Associated organisational unit

Electronic data

Links

Keywords