Abstract
Novel view synthesis typically requires a comprehensive set of multi-view images for either image-based rendering or scene representation-based optimization. However, achieving high-fidelity novel view rendering often demands a large number of images. To address this limitation, we propose NCDI-Diffusion, a novel diffusion-based view synthesis method that reduces the number of required images by leveraging the prior knowledge embedded in pre-trained diffusion models. Specifically, NCDI-Diffusion encapsulates both the contextual and directional information of a scene by utilizing neural descriptors, which are inversely derived from a limited set of positioned multi-view training images. These descriptors guide the diffusion model's image synthesis process, enabling the generation of high-quality novel views. Empirical results on the Forward-facing Dataset demonstrate the effectiveness of our approach to novel view synthesis.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
Editors | Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta |
Publisher | IEEE |
Number of pages | 5 |
ISBN (Electronic) | 9798350368741 |
ISBN (Print) | 9798350368758 |
DOIs | |
Publication status | Published - 6 Apr 2025 |
Event | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 - Hyderabad, India Duration: 6 Apr 2025 → 11 Apr 2025 https://ieeexplore.ieee.org/xpl/conhome/10887540/proceeding |
Publication series
Name | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
---|---|
Publisher | IEEE |
Conference
Conference | 2025 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025 |
---|---|
Country/Territory | India |
City | Hyderabad |
Period | 6/04/25 → 11/04/25 |
Internet address |
User-Defined Keywords
- Diffusion model
- Novel view synthesis
- Textual Inversion