Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features

Computing and Communications

Associated organisational unit

Data Science

Electronic data

07936556
Rights statement: ©2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
Accepted author manuscript, 26.9 MB, PDF document
Available under license: CC BY-NC: Creative Commons Attribution-NonCommercial 4.0 International License

Text available via DOI:

https://doi.org/10.1109/TCSVT.2017.2710120
Final published version

Keywords

Face recognition, IJB-A, kNN-average pooling, S2S distance, Deep learning, Face recognition algorithms, Human intervention, Illumination conditions, Recognition accuracy, Similarity scores

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features. / Zhao, Jiaojiao; Han, Jungong; Shao, Ling.
In: IEEE Transactions on Circuits and Systems for Video Technology, Vol. 28, No. 10, 10.2018, p. 2679-2689.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Zhao, J, Han, J & Shao, L 2018, 'Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features', IEEE Transactions on Circuits and Systems for Video Technology, vol. 28, no. 10, pp. 2679-2689. https://doi.org/10.1109/TCSVT.2017.2710120

APA

Zhao, J., Han, J., & Shao, L. (2018). Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features. IEEE Transactions on Circuits and Systems for Video Technology, 28(10), 2679-2689. https://doi.org/10.1109/TCSVT.2017.2710120

Vancouver

Zhao J, Han J, Shao L. Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features. IEEE Transactions on Circuits and Systems for Video Technology. 2018 Oct;28(10):2679-2689. Epub 2017 May 31. doi: 10.1109/TCSVT.2017.2710120

Author

Zhao, Jiaojiao ; Han, Jungong ; Shao, Ling. / Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features. In: IEEE Transactions on Circuits and Systems for Video Technology. 2018 ; Vol. 28, No. 10. pp. 2679-2689.

Bibtex

@article{94388b5aba2f4d8f9546c15a3782cdd2,

title = "Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features",

abstract = "Recently considerable efforts have been dedicated to unconstrained face recognition, which requires to identify faces {"}in the wild{"} for a set of images and/or video frames captured without human intervention. Unlike traditional face recognition that compares one-to-one medium (either a single image or a video frame) only, we consider a problem of matching sets with heterogeneous contents of both images and videos. In this paper, we propose a novel Set-to-Set (S2S) distance measure to calculate the similarity between two sets with the aim to improve the accuracy of face recognition in real-world situations such as extreme poses or severe illumination conditions. Our S2S distance adopts the kNN-average pooling for the similarity scores computed on all the media in two sets, making the identification far less susceptible to the poor representations (outliers) than traditional feature-average pooling and score-average pooling. Furthermore, we show that various metrics can be embedded into our S2S distance framework, including both predefined and learned ones. This allows to choose the appropriate metric depending on the recognition task in order to achieve the best results. To evaluate the proposed S2S distance, we conduct extensive experiments on the challenging set-based IJB-A face dataset, which demonstrate that our algorithm achieves the stateof- the-art results and is clearly superior to the baselines including several deep learning based face recognition algorithms.",

keywords = "Face recognition, IJB-A, kNN-average pooling, S2S distance, Deep learning, Face recognition algorithms, Human intervention, Illumination conditions, Recognition accuracy, Similarity scores",

author = "Jiaojiao Zhao and Jungong Han and Ling Shao",

note = "{\textcopyright}2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.",

year = "2018",

month = oct,

doi = "10.1109/TCSVT.2017.2710120",

language = "English",

volume = "28",

pages = "2679--2689",

journal = "IEEE Transactions on Circuits and Systems for Video Technology",

issn = "1051-8215",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "10",

}

RIS

TY - JOUR

T1 - Unconstrained Face Recognition Using A Set-to-Set Distance Measure on Deep Learned Features

AU - Zhao, Jiaojiao

AU - Han, Jungong

AU - Shao, Ling

N1 - ©2017 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PY - 2018/10

Y1 - 2018/10

N2 - Recently considerable efforts have been dedicated to unconstrained face recognition, which requires to identify faces "in the wild" for a set of images and/or video frames captured without human intervention. Unlike traditional face recognition that compares one-to-one medium (either a single image or a video frame) only, we consider a problem of matching sets with heterogeneous contents of both images and videos. In this paper, we propose a novel Set-to-Set (S2S) distance measure to calculate the similarity between two sets with the aim to improve the accuracy of face recognition in real-world situations such as extreme poses or severe illumination conditions. Our S2S distance adopts the kNN-average pooling for the similarity scores computed on all the media in two sets, making the identification far less susceptible to the poor representations (outliers) than traditional feature-average pooling and score-average pooling. Furthermore, we show that various metrics can be embedded into our S2S distance framework, including both predefined and learned ones. This allows to choose the appropriate metric depending on the recognition task in order to achieve the best results. To evaluate the proposed S2S distance, we conduct extensive experiments on the challenging set-based IJB-A face dataset, which demonstrate that our algorithm achieves the stateof- the-art results and is clearly superior to the baselines including several deep learning based face recognition algorithms.

AB - Recently considerable efforts have been dedicated to unconstrained face recognition, which requires to identify faces "in the wild" for a set of images and/or video frames captured without human intervention. Unlike traditional face recognition that compares one-to-one medium (either a single image or a video frame) only, we consider a problem of matching sets with heterogeneous contents of both images and videos. In this paper, we propose a novel Set-to-Set (S2S) distance measure to calculate the similarity between two sets with the aim to improve the accuracy of face recognition in real-world situations such as extreme poses or severe illumination conditions. Our S2S distance adopts the kNN-average pooling for the similarity scores computed on all the media in two sets, making the identification far less susceptible to the poor representations (outliers) than traditional feature-average pooling and score-average pooling. Furthermore, we show that various metrics can be embedded into our S2S distance framework, including both predefined and learned ones. This allows to choose the appropriate metric depending on the recognition task in order to achieve the best results. To evaluate the proposed S2S distance, we conduct extensive experiments on the challenging set-based IJB-A face dataset, which demonstrate that our algorithm achieves the stateof- the-art results and is clearly superior to the baselines including several deep learning based face recognition algorithms.

KW - Face recognition

KW - IJB-A

KW - kNN-average pooling

KW - S2S distance

KW - Deep learning

KW - Face recognition algorithms

KW - Human intervention

KW - Illumination conditions

KW - Recognition accuracy

KW - Similarity scores

U2 - 10.1109/TCSVT.2017.2710120

DO - 10.1109/TCSVT.2017.2710120

M3 - Journal article

VL - 28

SP - 2679

EP - 2689

JO - IEEE Transactions on Circuits and Systems for Video Technology

JF - IEEE Transactions on Circuits and Systems for Video Technology

SN - 1051-8215

IS - 10

ER -

Research

Associated organisational unit

Electronic data

Links

Text available via DOI:

Keywords