Home > Research > Publications & Outputs > Chaotic World

Links

Text available via DOI:

View graph of relations

Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Published

Standard

Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events. / Ong, Kian Eng; Ng, Xun Long; Li, Yanchao et al.
2023 IEEE/CVF International Conference on Computer Vision (ICCV). Institute of Electrical and Electronics Engineers Inc., 2024. p. 20156-20166 (Proceedings of the IEEE International Conference on Computer Vision).

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSNConference contribution/Paperpeer-review

Harvard

Ong, KE, Ng, XL, Li, Y, Ai, W, Zhao, K, Yeo, SY & Liu, J 2024, Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events. in 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Proceedings of the IEEE International Conference on Computer Vision, Institute of Electrical and Electronics Engineers Inc., pp. 20156-20166, 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023, Paris, France, 2/10/23. https://doi.org/10.1109/ICCV51070.2023.01849

APA

Ong, K. E., Ng, X. L., Li, Y., Ai, W., Zhao, K., Yeo, S. Y., & Liu, J. (2024). Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV) (pp. 20156-20166). (Proceedings of the IEEE International Conference on Computer Vision). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICCV51070.2023.01849

Vancouver

Ong KE, Ng XL, Li Y, Ai W, Zhao K, Yeo SY et al. Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events. In 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Institute of Electrical and Electronics Engineers Inc. 2024. p. 20156-20166. (Proceedings of the IEEE International Conference on Computer Vision). Epub 2023 Jun 1. doi: 10.1109/ICCV51070.2023.01849

Author

Ong, Kian Eng ; Ng, Xun Long ; Li, Yanchao et al. / Chaotic World : A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events. 2023 IEEE/CVF International Conference on Computer Vision (ICCV). Institute of Electrical and Electronics Engineers Inc., 2024. pp. 20156-20166 (Proceedings of the IEEE International Conference on Computer Vision).

Bibtex

@inproceedings{758e1c72148e4cb790d8fba9908d0b7e,
title = "Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic Events",
abstract = "Understanding and analyzing human behaviors (actions and interactions of people), voices, and sounds in chaotic events is crucial in many applications, e.g., crowd management, emergency response services. Different from human behaviors in daily life, human behaviors in chaotic events are generally different in how they behave and influence others, and hence are often much more complex. However, currently there is lack of a large video dataset for analyzing human behaviors in chaotic situations. To this end, we create the first large and challenging multi-modal dataset, Chaotic World, that simultaneously provides different levels of fine-grained and dense spatio-temporal annotations of sounds, individual actions and group interaction graphs, and even text descriptions for each scene in each video, thereby enabling a thorough analysis of complicated behaviors in crowds and chaos. Our dataset consists of a total of 299,923 annotated instances for detecting human behaviors for Spatiotemporal Action Localization in chaotic events, 224,275 instances for identifying interactions between people for Behavior Graph Analysis in chaotic events, 336,390 instances for localizing relevant scenes of interest in long videos for Spatiotemporal Event Grounding, and 378,093 instances for triangulating the source of sound for Event Sound Source Localization. Given the practical complexity and challenges in chaotic events (e.g., large crowds, serious occlusions, complicated interaction patterns), our dataset shall be able to facilitate the community to develop, adapt, and evaluate various types of advanced models for analyzing human behaviors in chaotic events. We also design a simple yet effective IntelliCare model with a Dynamic Knowledge Pathfinder module that intelligently learns from multiple tasks and can analyze various aspects of a chaotic scene in a unified architecture. This method achieves promising results in experiments. Dataset and code can be found at https://github.com/sutdcv/Chaotic-World.",
author = "Ong, {Kian Eng} and Ng, {Xun Long} and Yanchao Li and Wenjie Ai and Kuangyi Zhao and Yeo, {Si Yong} and Jun Liu",
note = "Publisher Copyright: {\textcopyright} 2023 IEEE.; 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 ; Conference date: 02-10-2023 Through 06-10-2023",
year = "2024",
month = jan,
day = "15",
doi = "10.1109/ICCV51070.2023.01849",
language = "English",
series = "Proceedings of the IEEE International Conference on Computer Vision",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "20156--20166",
booktitle = "2023 IEEE/CVF International Conference on Computer Vision (ICCV)",

}

RIS

TY - GEN

T1 - Chaotic World

T2 - 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023

AU - Ong, Kian Eng

AU - Ng, Xun Long

AU - Li, Yanchao

AU - Ai, Wenjie

AU - Zhao, Kuangyi

AU - Yeo, Si Yong

AU - Liu, Jun

N1 - Publisher Copyright: © 2023 IEEE.

PY - 2024/1/15

Y1 - 2024/1/15

N2 - Understanding and analyzing human behaviors (actions and interactions of people), voices, and sounds in chaotic events is crucial in many applications, e.g., crowd management, emergency response services. Different from human behaviors in daily life, human behaviors in chaotic events are generally different in how they behave and influence others, and hence are often much more complex. However, currently there is lack of a large video dataset for analyzing human behaviors in chaotic situations. To this end, we create the first large and challenging multi-modal dataset, Chaotic World, that simultaneously provides different levels of fine-grained and dense spatio-temporal annotations of sounds, individual actions and group interaction graphs, and even text descriptions for each scene in each video, thereby enabling a thorough analysis of complicated behaviors in crowds and chaos. Our dataset consists of a total of 299,923 annotated instances for detecting human behaviors for Spatiotemporal Action Localization in chaotic events, 224,275 instances for identifying interactions between people for Behavior Graph Analysis in chaotic events, 336,390 instances for localizing relevant scenes of interest in long videos for Spatiotemporal Event Grounding, and 378,093 instances for triangulating the source of sound for Event Sound Source Localization. Given the practical complexity and challenges in chaotic events (e.g., large crowds, serious occlusions, complicated interaction patterns), our dataset shall be able to facilitate the community to develop, adapt, and evaluate various types of advanced models for analyzing human behaviors in chaotic events. We also design a simple yet effective IntelliCare model with a Dynamic Knowledge Pathfinder module that intelligently learns from multiple tasks and can analyze various aspects of a chaotic scene in a unified architecture. This method achieves promising results in experiments. Dataset and code can be found at https://github.com/sutdcv/Chaotic-World.

AB - Understanding and analyzing human behaviors (actions and interactions of people), voices, and sounds in chaotic events is crucial in many applications, e.g., crowd management, emergency response services. Different from human behaviors in daily life, human behaviors in chaotic events are generally different in how they behave and influence others, and hence are often much more complex. However, currently there is lack of a large video dataset for analyzing human behaviors in chaotic situations. To this end, we create the first large and challenging multi-modal dataset, Chaotic World, that simultaneously provides different levels of fine-grained and dense spatio-temporal annotations of sounds, individual actions and group interaction graphs, and even text descriptions for each scene in each video, thereby enabling a thorough analysis of complicated behaviors in crowds and chaos. Our dataset consists of a total of 299,923 annotated instances for detecting human behaviors for Spatiotemporal Action Localization in chaotic events, 224,275 instances for identifying interactions between people for Behavior Graph Analysis in chaotic events, 336,390 instances for localizing relevant scenes of interest in long videos for Spatiotemporal Event Grounding, and 378,093 instances for triangulating the source of sound for Event Sound Source Localization. Given the practical complexity and challenges in chaotic events (e.g., large crowds, serious occlusions, complicated interaction patterns), our dataset shall be able to facilitate the community to develop, adapt, and evaluate various types of advanced models for analyzing human behaviors in chaotic events. We also design a simple yet effective IntelliCare model with a Dynamic Knowledge Pathfinder module that intelligently learns from multiple tasks and can analyze various aspects of a chaotic scene in a unified architecture. This method achieves promising results in experiments. Dataset and code can be found at https://github.com/sutdcv/Chaotic-World.

U2 - 10.1109/ICCV51070.2023.01849

DO - 10.1109/ICCV51070.2023.01849

M3 - Conference contribution/Paper

AN - SCOPUS:85185869976

T3 - Proceedings of the IEEE International Conference on Computer Vision

SP - 20156

EP - 20166

BT - 2023 IEEE/CVF International Conference on Computer Vision (ICCV)

PB - Institute of Electrical and Electronics Engineers Inc.

Y2 - 2 October 2023 through 6 October 2023

ER -