Real-time head-based deep-learning model for gaze probability regions in collaborative VR

Computing and Communications

Electronic data

Accepted Manuscript
Rights statement: © ACM, 2022. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ETRA '22: 2022 Symposium on Eye Tracking Research and Applications https://dl.acm.org/doi/10.1145/3517031.3529642
Accepted author manuscript, 1.52 MB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1145/3517031.3529642
Final published version

Keywords

Neural networks, Visual attention, Gaze inference, Gaze prediction

View graph of relations

Research output: Contribution in Book/Report/Proceedings - With ISBN/ISSN › Conference contribution/Paper › peer-review

Published

Riccardo Bovo
Daniele Giunchi
Ludwig Sidenmark
Enrico Costanza
Hans Gellersen
Thomas Heinis

More...

Publication date	11/06/2022
Host publication	ACM Symposium on Eye Tracking Research and Applications
Place of Publication	New York
Publisher	ACM
Number of pages	8
ISBN (electronic)	9781450392525
<mark>Original language</mark>	English
Event	ETRA '22: 2022 Symposium on Eye Tracking Research and Applications - Seattle Children’s Building Cure , Seattle, United States Duration: 8/06/2022 → 11/06/2022 https://etra.acm.org/2022/

Symposium

Symposium	ETRA '22: 2022 Symposium on Eye Tracking Research and Applications
Country/Territory	United States
City	Seattle
Period	8/06/22 → 11/06/22
Internet address	https://etra.acm.org/2022/

Symposium

Symposium	ETRA '22: 2022 Symposium on Eye Tracking Research and Applications
Country/Territory	United States
City	Seattle
Period	8/06/22 → 11/06/22
Internet address	https://etra.acm.org/2022/

Abstract

Eye behaviour has gained much interest in the VR research community as an interaction input and support for collaboration. Researchers implemented gaze inference models when eye-tracking is missing by using head behavior and saliency. However, these solutions are resource-demanding and thus unfit for untethered devices, and their angle accuracy is around 7°, which can be a problem in high-density informative areas. To address this issue, we propose a lightweight deep learning model that generates the probability density function of the gaze as a percentile contour. This solution allows us to introduce a visual attention representation based on a region rather than a point and manage a trade-off between the ambiguity of a region and the error of a point. We tested our model in untethered devices with real-time performances; we evaluated its accuracy which outperforms our identified baselines (average fixation map and head direction).

Bibliographic note

© ACM, 2022. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ETRA '22: 2022 Symposium on Eye Tracking Research and Applications https://dl.acm.org/doi/10.1145/3517031.3529642

Research

Electronic data

Links

Text available via DOI:

Keywords