Abstract
We address the problem of people detection in RGB-D data where we leverage depth information to develop a region-of-interest (ROI) selection method that provides proposals to two color and depth CNNs. To combine the detections produced by the two CNNs, we propose a novel fusion approach based on the characteristics of depth images. We also present a new depth-encoding scheme, which not only encodes depth images into three channels but also enhances the information for classification. We conduct experiments on a publicly available RGB-D people dataset and show that our approach outperforms the baseline models that only use RGB data.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 15th IAPR International Conference on Machine Vision Applications, MVA 2017 |
| Publisher | IEEE |
| Pages | 306-309 |
| Number of pages | 4 |
| ISBN (Electronic) | 9784901122160 |
| ISBN (Print) | 9781538604953 |
| DOIs | |
| Publication status | Published - May 2017 |
| Event | 15th IAPR International Conference on Machine Vision Applications, MVA 2017 - Nagoya, Japan Duration: 8 May 2017 → 12 May 2017 https://ieeexplore.ieee.org/xpl/conhome/7981294/proceeding |
Publication series
| Name | Proceedings of the IAPR International Conference on Machine Vision Applications, MVA |
|---|
Conference
| Conference | 15th IAPR International Conference on Machine Vision Applications, MVA 2017 |
|---|---|
| Country/Territory | Japan |
| City | Nagoya |
| Period | 8/05/17 → 12/05/17 |
| Internet address |