TY - JOUR
T1 - An asymmetric distance model for cross-view feature mapping in person reidentification
AU - Chen, Ying Cong
AU - Zheng, Wei Shi
AU - Lai, Jian Huang
AU - Yuen, Pong C.
N1 - Funding Information:
This work was supported in part by the Computational Science Innovative Research Team Program, Guangdong Provincial Government of China, in part by the Natural Science Foundation of China under Grant 61472456, Grant 61522115, Grant 61573387, and Grant 6151101169, in part by the Guangzhou Pearl River Science and Technology Rising Star Project under Grant 2013J2200068, in part by the Guangdong Natural Science Funds for Distinguished Young Scholar under Grant S2013050014265, in part by the Guangdong Program under Grant 2015B010105005, and in part by the Hong Kong RGC General Research Fund under Grant HKBU 12202514.
Publisher copyright:
© 2016 IEEE.
PY - 2017/8
Y1 - 2017/8
N2 - Person reidentification, which matches person images of the same identity across nonoverlapping camera views, becomes an important component for cross-camera-view activity analysis. Most (if not all) person reidentification algorithms are designed based on appearance features. However, appearance features are not stable across nonoverlapping camera views under dramatic lighting change, and those algorithms assume that two cross-view images of the same person can be well represented either by exploring robust and invariant features or by learning matching distance. Such an assumption ignores the nature that images are captured under different camera views with different camera characteristics and environments, and thus, mostly there exists large discrepancy between the extracted features under different views. To solve this problem, we formulate an asymmetric distance model for learning camera-specific projections to transform the unmatched features of each view into a common space where discriminative features across view space are extracted. A cross-view consistency regularization is further introduced to model the correlation between view-specific feature transformations of different camera views, which reflects their nature relations and plays a significant role in avoiding overfitting. A kernel cross-view discriminant component analysis is also presented. Extensive experiments have been conducted to show that asymmetric distance modeling is important for person reidentification, which matches the concerns on cross-disjoint-view matching, reporting superior performance compared with related distance learning methods on six publically available data sets.
AB - Person reidentification, which matches person images of the same identity across nonoverlapping camera views, becomes an important component for cross-camera-view activity analysis. Most (if not all) person reidentification algorithms are designed based on appearance features. However, appearance features are not stable across nonoverlapping camera views under dramatic lighting change, and those algorithms assume that two cross-view images of the same person can be well represented either by exploring robust and invariant features or by learning matching distance. Such an assumption ignores the nature that images are captured under different camera views with different camera characteristics and environments, and thus, mostly there exists large discrepancy between the extracted features under different views. To solve this problem, we formulate an asymmetric distance model for learning camera-specific projections to transform the unmatched features of each view into a common space where discriminative features across view space are extracted. A cross-view consistency regularization is further introduced to model the correlation between view-specific feature transformations of different camera views, which reflects their nature relations and plays a significant role in avoiding overfitting. A kernel cross-view discriminant component analysis is also presented. Extensive experiments have been conducted to show that asymmetric distance modeling is important for person reidentification, which matches the concerns on cross-disjoint-view matching, reporting superior performance compared with related distance learning methods on six publically available data sets.
KW - Cross-view matching
KW - person reidentification
KW - visual surveillance
UR - http://www.scopus.com/inward/record.url?scp=85029366313&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2016.2515309
DO - 10.1109/TCSVT.2016.2515309
M3 - Journal article
AN - SCOPUS:85029366313
SN - 1051-8215
VL - 27
SP - 1661
EP - 1675
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 8
M1 - 7373616
ER -