TY - JOUR
T1 - Online Non-Negative Multi-Modality Feature Template Learning for RGB-Assisted Infrared Tracking
AU - Lan, Xiangyuan
AU - Ye, Mang
AU - Shao, Rui
AU - Zhong, Bineng
AU - Jain, Deepak Kumar
AU - Zhou, Huiyu
N1 - Funding Information:
This work was supported in part by the Hong Kong Baptist University Tier 1 Start-up Grant. The work of H. Zhou was supported in part by the UK EPSRC under Grant EP/N011074/1, in part by the Royal Society-Newton Advanced Fellowship under Grant NA160342, and in part by the European Union’s Horizon 2020 Research and Innovation Program under the Marie-Sklodowska-Curie Grant 720325. The work of B. Zhong was supported by the National Natural Science Foundation of China under Grant 61572205 and Grant 61802135. This work of D. K. Jain was supported in part by the Key Laboratory of Intelligent Air-Ground Cooperative Control for Universities in Chongqing and the Key Laboratory of Industrial IoT and Networked Control, Ministry of Education, College of Automation, Chongqing University of Posts and Telecommunications, Chongqing, China.
PY - 2019/5/14
Y1 - 2019/5/14
N2 - Infrared sensors have been deployed in many video surveillance systems because of the insensibility of their imaging procedure to some extreme conditions (e.g. low illumination condition, dim environment). To reduce human labor in video monitoring and perform intelligent infrared video understanding, an important issue we need to consider is how to locate the object of interest in consecutive video frames accurately. Therefore, developing a robust object tracking algorithm for infrared videos is necessary. However, the infrared information may not be reliable (e.g. thermal crossover), and appearance modeling with only the infrared modality may not be able to achieve good results. To address these issues, with the wide deployment of RGB-infrared camera systems, this paper proposes an infrared tracking framework in which information from RGB-modality will be exploited to assist the infrared object tracking. Specifically, within the tracking framework, in order to deal with the contaminated features caused by large appearance variations, an online non-negative feature template learning model is designed. The non-negative constraint enables the model to capture the local part-based characteristic of the target appearance. To ensure more important modality contribute more in appearance representation, an adaptive modality importance weight learning scheme is also incorporated in the proposed feature learning model. To guarantee the model optimality, an iterative optimization algorithm is derived. The experimental results on various RGB-infrared videos show the effectiveness of the proposed method.
AB - Infrared sensors have been deployed in many video surveillance systems because of the insensibility of their imaging procedure to some extreme conditions (e.g. low illumination condition, dim environment). To reduce human labor in video monitoring and perform intelligent infrared video understanding, an important issue we need to consider is how to locate the object of interest in consecutive video frames accurately. Therefore, developing a robust object tracking algorithm for infrared videos is necessary. However, the infrared information may not be reliable (e.g. thermal crossover), and appearance modeling with only the infrared modality may not be able to achieve good results. To address these issues, with the wide deployment of RGB-infrared camera systems, this paper proposes an infrared tracking framework in which information from RGB-modality will be exploited to assist the infrared object tracking. Specifically, within the tracking framework, in order to deal with the contaminated features caused by large appearance variations, an online non-negative feature template learning model is designed. The non-negative constraint enables the model to capture the local part-based characteristic of the target appearance. To ensure more important modality contribute more in appearance representation, an adaptive modality importance weight learning scheme is also incorporated in the proposed feature learning model. To guarantee the model optimality, an iterative optimization algorithm is derived. The experimental results on various RGB-infrared videos show the effectiveness of the proposed method.
KW - computer vision
KW - Optical image processing
KW - sensor fusion
UR - http://www.scopus.com/inward/record.url?scp=85067241789&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2019.2916895
DO - 10.1109/ACCESS.2019.2916895
M3 - Journal article
AN - SCOPUS:85067241789
SN - 2169-3536
VL - 7
SP - 67761
EP - 67771
JO - IEEE Access
JF - IEEE Access
M1 - 8713854
ER -