Abstract
We address the problem of people detection in RGB-D data where we leverage depth information to develop a region-of-interest (ROI) selection method that provides proposals to two color and depth CNNs. To combine the detections produced by the two CNNs, we propose a novel fusion approach based on the characteristics of depth images. We also present a new depth-encoding scheme, which not only encodes depth images into three channels but also enhances the information for classification. We conduct experiments on a publicly available RGB-D people dataset and show that our approach outperforms the baseline models that only use RGB data.
Original language | English |
---|---|
Title of host publication | Proceedings of the 15th IAPR International Conference on Machine Vision Applications, MVA 2017 |
Publisher | IEEE |
Pages | 306-309 |
Number of pages | 4 |
ISBN (Electronic) | 9784901122160 |
ISBN (Print) | 9781538604953 |
DOIs | |
Publication status | Published - May 2017 |
Event | 15th IAPR International Conference on Machine Vision Applications, MVA 2017 - Nagoya, Japan Duration: 8 May 2017 → 12 May 2017 https://ieeexplore.ieee.org/xpl/conhome/7981294/proceeding |
Publication series
Name | Proceedings of the IAPR International Conference on Machine Vision Applications, MVA |
---|
Conference
Conference | 15th IAPR International Conference on Machine Vision Applications, MVA 2017 |
---|---|
Country/Territory | Japan |
City | Nagoya |
Period | 8/05/17 → 12/05/17 |
Internet address |