DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning

  • Jialang Lu
  • , Huayu Zhao
  • , Huiyu Zhai
  • , Xingxing Yang*
  • , Shini Han
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

1 Citation (Scopus)

Abstract

There has long been a belief that high-level semantics learning can benefit various downstream computer vision tasks. However, in the low-light image enhancement (LLIE) community, existing methods learn a brutal mapping between low-light and normal-light domains without considering the semantic information of different regions, especially in those extremely dark regions that suffer from severe information loss. To address this issue, we propose a new deep semantic prior-guided framework (DeepSPG) based on Retinex image decomposition for LLIE to explore informative semantic knowledge via a pre-trained semantic segmentation model and multimodal learning. Notably, we incorporate both image-level semantic prior and text-level semantic prior, and thus formulate a multimodal learning framework with combinatorial deep semantic prior guidance for LLIE. Specifically, we incorporate semantic knowledge to guide the enhancement process via three designs: an image-level semantic prior guidance by leveraging hierarchical semantic features from a pre-trained semantic segmentation model; a text-level semantic prior guidance by integrating natural language semantic constraints via a pre-trained vision-language model; a multi-scale semantic-aware structure that facilitates effective semantic feature incorporation. Eventually, our proposed DeepSPG demonstrates superior performance compared to state-of-the-art methods across five benchmark datasets. The implementation details and code are publicly available at https://github.com/Wenyuzhy/DeepSPG.
Original languageEnglish
Title of host publicationICMR 2025 - Proceedings of the 2025 International Conference on Multimedia Retrieval
PublisherAssociation for Computing Machinery (ACM)
Pages935-943
Number of pages9
ISBN (Electronic)9798400718779
DOIs
Publication statusPublished - 30 Jun 2025
Event2025 International Conference on Multimedia Retrieval, ICMR 2025 - Chicago, United States
Duration: 30 Jun 20253 Jul 2025
https://dl.acm.org/doi/proceedings/10.1145/3731715

Publication series

NameICMR 2025 - Proceedings of the 2025 International Conference on Multimedia Retrieval

Conference

Conference2025 International Conference on Multimedia Retrieval, ICMR 2025
Country/TerritoryUnited States
CityChicago
Period30/06/253/07/25
Internet address

User-Defined Keywords

  • Low-light image enhancement
  • Retinex decomposition
  • multimodal learning
  • semantic guidance

Fingerprint

Dive into the research topics of 'DeepSPG: Exploring Deep Semantic Prior Guidance for Low-light Image Enhancement with Multimodal Learning'. Together they form a unique fingerprint.

Cite this