Home > Research > Publications & Outputs > Multi-Temporal Depth Motion Maps-Based Local Bi...

Associated organisational unit

Electronic data

  • Multi-Temporal Depth Motion Maps-Based Local Binary Patterns for 3D human action recognition

    Rights statement: ©2017 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

    Accepted author manuscript, 2.9 MB, PDF document

    Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Links

Text available via DOI:

View graph of relations

Multi-Temporal Depth Motion Maps-Based Local Binary Patterns for 3D Human Action Recognition

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published
  • Chen Chen
  • Mengyang Liu
  • Hong Liu
  • Baochang Zhang
  • Jungong Han
  • Nasser Kahtarnavaz
Close
<mark>Journal publication date</mark>7/11/2017
<mark>Journal</mark>IEEE Access
Volume5
Number of pages15
Pages (from-to)22590-22604
Publication StatusPublished
Early online date2/10/17
<mark>Original language</mark>English

Abstract

This paper presents a local spatio-temporal descriptor for action recognition from depth video sequences which is capable of distinguishing similar actions as well as coping with different speeds of actions. This descriptor is based on three processing stages. In the first stage, the shape and motion cues are captured from a weighted depth sequence by temporally overlapped depth segments, leading to three improved depth motion maps (DMMs) compared to previously introduced DMMs. In the second stage, the improved DMMs are partitioned into dense patches, from which the local binary patterns histogram features are extracted to characterize local rotation invariant texture information. In the final stage, a Fisher kernel is used for generating a compact feature representation, which is then combined with a kernel-based extreme learning machine (ELM) classifier. The developed solution is applied to five public domain datasets and is extensively evaluated. The results obtained demonstrate the effectiveness of this solution as compared to the existing approaches.

Bibliographic note

©2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.