Abstract
Novel view synthesis typically requires a comprehensive set of multi-view images for either image-based rendering or scene representation-based optimization. However, achieving high-fidelity novel view rendering often demands a large number of images. To address this limitation, we propose NCDI-Diffusion, a novel diffusion-based view synthesis method that reduces the number of required images by leveraging the prior knowledge embedded in pre-trained diffusion models. Specifically, NCDI-Diffusion encapsulates both the contextual and directional information of a scene by utilizing neural descriptors, which are inversely derived from a limited set of positioned multi-view training images. These descriptors guide the diffusion model's image synthesis process, enabling the generation of high-quality novel views. Empirical results on the Forward-facing Dataset demonstrate the effectiveness of our approach to novel view synthesis.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
| Editors | Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta |
| Publisher | IEEE |
| Pages | 1-5 |
| Number of pages | 5 |
| ISBN (Electronic) | 9798350368741 |
| ISBN (Print) | 9798350368758 |
| DOIs | |
| Publication status | Published - 6 Apr 2025 |
| Event | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 - Hyderabad, India Duration: 6 Apr 2025 → 11 Apr 2025 https://ieeexplore.ieee.org/xpl/conhome/10887540/proceeding |
Publication series
| Name | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
|---|---|
| Publisher | IEEE |
| ISSN (Print) | 1520-6149 |
| ISSN (Electronic) | 2379-190X |
Conference
| Conference | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 |
|---|---|
| Country/Territory | India |
| City | Hyderabad |
| Period | 6/04/25 → 11/04/25 |
| Internet address |
User-Defined Keywords
- Diffusion model
- Novel view synthesis
- Textual Inversion