Optimized Projection for Hashing

Computing and Communications

Electronic data

1-s2.0-S0167865518301478-main
Rights statement: This is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, 117, 2019 https://www.sciencedirect.com/journal/pattern-recognition-letters
Accepted author manuscript, 1.51 MB, PDF document
Available under license: CC BY-NC-ND: Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Text available via DOI:

https://doi.org/10.1016/j.patrec.2018.04.027
Final published version

Keywords

Hashing, Quantization, Algorithm

View graph of relations

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Published

Standard

Optimized Projection for Hashing. / Chu, Chaoqun; Gong, Dahan; Chen, Kai et al.
In: Pattern Recognition Letters, Vol. 117, 01.2019, p. 169-178.

Research output: Contribution to Journal/Magazine › Journal article › peer-review

Harvard

Chu, C, Gong, D, Chen, K, Guo, Y, Han, J & Ding, G 2019, 'Optimized Projection for Hashing', Pattern Recognition Letters, vol. 117, pp. 169-178. https://doi.org/10.1016/j.patrec.2018.04.027

APA

Chu, C., Gong, D., Chen, K., Guo, Y., Han, J., & Ding, G. (2019). Optimized Projection for Hashing. Pattern Recognition Letters, 117, 169-178. https://doi.org/10.1016/j.patrec.2018.04.027

Vancouver

Chu C, Gong D, Chen K, Guo Y, Han J, Ding G. Optimized Projection for Hashing. Pattern Recognition Letters. 2019 Jan;117:169-178. Epub 2018 Apr 19. doi: 10.1016/j.patrec.2018.04.027

Author

Chu, Chaoqun ; Gong, Dahan ; Chen, Kai et al. / Optimized Projection for Hashing. In: Pattern Recognition Letters. 2019 ; Vol. 117. pp. 169-178.

Bibtex

@article{be60e2dd4c954e31bcda1724bbd8b7c5,

title = "Optimized Projection for Hashing",

abstract = "Hashing, which seeks for binary codes to represent data, has drawn increasing research interest in recent years. Most existing Hashing methods follow a projection-quantization framework which first projects high-dimensional data into compact low-dimensional space and then quantifies the compact data into binary codes. The projection step plays a key role in Hashing and academia has paid considerable attention to it. Previous works have proven that a good projection should simultaneously 1) preserve important information in original data, and 2) lead to compact representation with low quantization error. However, they adopted a greedy two-step strategy to consider the above two properties separately. In this paper, we empirically show that such a two-step strategy will result in a sub-optimal solution because the optimal solution to 1) limits the feasible set for the solution to 2). We put forward a novel projection learning method for Hashing, dubbed Optimized Projection (OPH). Specifically, we propose to learn the projection in a unified formulation which can find a good trade-off such that the overall performance can be optimized. A general framework is given such that OPH can be incorporated with different Hashing methods for different situations. We also introduce an effective gradient-based optimization algorithm for OPH. We carried out extensive experiments for Hashing-based Approximate Nearest Neighbor search and Content-based Data Retrieval on six benchmark datasets. The results show that OPH significantly outperforms several state-of-the-art related Hashing methods.",

keywords = "Hashing, Quantization, Algorithm",

author = "Chaoqun Chu and Dahan Gong and Kai Chen and Yuchen Guo and Jungong Han and Guiguang Ding",

note = "This is the author{\textquoteright}s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, 117, 2019 https://www.sciencedirect.com/journal/pattern-recognition-letters",

year = "2019",

month = jan,

doi = "10.1016/j.patrec.2018.04.027",

language = "English",

volume = "117",

pages = "169--178",

journal = "Pattern Recognition Letters",

issn = "0167-8655",

publisher = "Elsevier Science B.V.",

}

RIS

TY - JOUR

T1 - Optimized Projection for Hashing

AU - Chu, Chaoqun

AU - Gong, Dahan

AU - Chen, Kai

AU - Guo, Yuchen

AU - Han, Jungong

AU - Ding, Guiguang

N1 - This is the author’s version of a work that was accepted for publication in Pattern Recognition Letters. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in Pattern Recognition Letters, 117, 2019 https://www.sciencedirect.com/journal/pattern-recognition-letters

PY - 2019/1

Y1 - 2019/1

N2 - Hashing, which seeks for binary codes to represent data, has drawn increasing research interest in recent years. Most existing Hashing methods follow a projection-quantization framework which first projects high-dimensional data into compact low-dimensional space and then quantifies the compact data into binary codes. The projection step plays a key role in Hashing and academia has paid considerable attention to it. Previous works have proven that a good projection should simultaneously 1) preserve important information in original data, and 2) lead to compact representation with low quantization error. However, they adopted a greedy two-step strategy to consider the above two properties separately. In this paper, we empirically show that such a two-step strategy will result in a sub-optimal solution because the optimal solution to 1) limits the feasible set for the solution to 2). We put forward a novel projection learning method for Hashing, dubbed Optimized Projection (OPH). Specifically, we propose to learn the projection in a unified formulation which can find a good trade-off such that the overall performance can be optimized. A general framework is given such that OPH can be incorporated with different Hashing methods for different situations. We also introduce an effective gradient-based optimization algorithm for OPH. We carried out extensive experiments for Hashing-based Approximate Nearest Neighbor search and Content-based Data Retrieval on six benchmark datasets. The results show that OPH significantly outperforms several state-of-the-art related Hashing methods.

AB - Hashing, which seeks for binary codes to represent data, has drawn increasing research interest in recent years. Most existing Hashing methods follow a projection-quantization framework which first projects high-dimensional data into compact low-dimensional space and then quantifies the compact data into binary codes. The projection step plays a key role in Hashing and academia has paid considerable attention to it. Previous works have proven that a good projection should simultaneously 1) preserve important information in original data, and 2) lead to compact representation with low quantization error. However, they adopted a greedy two-step strategy to consider the above two properties separately. In this paper, we empirically show that such a two-step strategy will result in a sub-optimal solution because the optimal solution to 1) limits the feasible set for the solution to 2). We put forward a novel projection learning method for Hashing, dubbed Optimized Projection (OPH). Specifically, we propose to learn the projection in a unified formulation which can find a good trade-off such that the overall performance can be optimized. A general framework is given such that OPH can be incorporated with different Hashing methods for different situations. We also introduce an effective gradient-based optimization algorithm for OPH. We carried out extensive experiments for Hashing-based Approximate Nearest Neighbor search and Content-based Data Retrieval on six benchmark datasets. The results show that OPH significantly outperforms several state-of-the-art related Hashing methods.

KW - Hashing

KW - Quantization

KW - Algorithm

U2 - 10.1016/j.patrec.2018.04.027

DO - 10.1016/j.patrec.2018.04.027

M3 - Journal article

VL - 117

SP - 169

EP - 178

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

SN - 0167-8655

ER -

Research

Electronic data

Links

Text available via DOI:

Keywords