ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba

Huiyu Zhai, Guang Jin, Xingxing Yang*, Guosheng Kang

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Translating NIR to the visible spectrum is challenging due to cross-domain complexities. Current models struggle to balance a broad receptive field with computational efficiency, limiting practical use. Although the Selective Structured State Space Model, especially the improved version, Mamba, excels in generative tasks by capturing long-range dependencies with linear complexity, its default approach of converting 2D images into 1D sequences neglects local context. In this work, we propose a simple but effective backbone, dubbed ColorMamba, which first introduces Mamba into spectral translation tasks. To explore global long-range dependencies and local context for efficient spectral translation, we introduce learnable padding tokens to enhance the distinction of image boundaries and prevent potential confusion within the sequence model. Furthermore, local convolutional enhancement and agent attention are designed to improve the vanilla Mamba. Moreover, we exploit the HSV color to provide multi-scale guidance in the reconstruction process for more accurate spectral translation. Extensive experiments show that our ColorMamba achieves a 1.02 improvement in terms of PSNR compared with the state-of-the-art method. Our code is available at https://github.com/AlexYangxx/ColorMamba/.

Original languageEnglish
Title of host publicationProceedings of Asian Conference on Machine Learning 2024
EditorsVu Nguyen, Hsuan-Tien Lin
PublisherML Research Press
Pages765-780
Number of pages16
Publication statusPublished - 5 Dec 2024
Event16th Asian Conference on Machine Learning, ACML 2024 - Hanoi, Viet Nam
Duration: 5 Dec 20248 Dec 2024
https://proceedings.mlr.press/v260/

Publication series

NameProceedings of Machine Learning Research
PublisherML Research Press
Volume260
ISSN (Print)2640-3498

Conference

Conference16th Asian Conference on Machine Learning, ACML 2024
Country/TerritoryViet Nam
CityHanoi
Period5/12/248/12/24
Internet address

User-Defined Keywords

  • Colorization
  • Mamba
  • NIR Image
  • Spectral Translation
  • State Space Model

Fingerprint

Dive into the research topics of 'ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba'. Together they form a unique fingerprint.

Cite this