Meta Spatio-Temporal Debiasing for Video Scene Graph Generation

Computing and Communications

Text available via DOI:

https://doi.org/10.1007/978-3-031-19812-0_22
Final published version

Keywords

Long-tailed bias, Meta learning, VidSGG

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Li Xu
Haoxuan Qu
Jason Kuen
Jiuxiang Gu
Jun Liu

More...

Publication date	30/10/2022
Host publication	Computer Vision – ECCV 2022 - 17th European Conference, 2022, Proceedings
Editors	Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, Tal Hassner
Place of Publication	Cham
Publisher	Springer
Pages	374-390
Number of pages	17
ISBN (electronic)	9783031198120
ISBN (print)	9783031198113
<mark>Original language</mark>	English
Event	17th European Conference on Computer Vision, ECCV 2022 - Tel Aviv, Israel Duration: 23/10/2022 → 27/10/2022

Conference

Conference	17th European Conference on Computer Vision, ECCV 2022
Country/Territory	Israel
City	Tel Aviv
Period	23/10/22 → 27/10/22

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	13687 LNCS
ISSN (Print)	0302-9743
ISSN (electronic)	1611-3349

Conference

Conference	17th European Conference on Computer Vision, ECCV 2022
Country/Territory	Israel
City	Tel Aviv
Period	23/10/22 → 27/10/22

Abstract

Video scene graph generation (VidSGG) aims to parse the video content into scene graphs, which involves modeling the spatio-temporal contextual information in the video. However, due to the long-tailed training data in datasets, the generalization performance of existing VidSGG models can be affected by the spatio-temporal conditional bias problem. In this work, from the perspective of meta-learning, we propose a novel Meta Video Scene Graph Generation (MVSGG) framework to address such a bias problem. Specifically, to handle various types of spatio-temporal conditional biases, our framework first constructs a support set and a group of query sets from the training data, where the data distribution of each query set is different from that of the support set w.r.t. a type of conditional bias. Then, by performing a novel meta training and testing process to optimize the model to obtain good testing performance on these query sets after training on the support set, our framework can effectively guide the model to learn to well generalize against biases. Extensive experiments demonstrate the efficacy of our proposed framework.

Research

Links

Text available via DOI:

Keywords