Skip to main navigation Skip to search Skip to main content

PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation

  • Chikai Shang
  • , Mengke Li
  • , Yiqun Zhang
  • , Zhen Chen
  • , Jinlin Wu
  • , Fangqing Gu*
  • , Yang Lu*
  • , Yiu-ming Cheung
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Visual prompt tuning (VPT), i.e., fine-tuning some lightweight prompt tokens, provides an efficient and effective approach for adapting pre-trained models to various downstream tasks. However, most prior art indiscriminately uses a fixed prompt distribution across different tasks, neglecting the importance of each block varying depending on the task. In this paper, we introduce adaptive distribution optimization (ADO) by tackling two key questions: (1) How to appropriately and formally define ADO, and (2) How to design an adaptive distribution strategy guided by this definition? Through empirical analysis, we first confirm that properly adjusting the distribution significantly improves VPT performance, and further uncover a key insight that a nested relationship exists between ADO and VPT. Based on these findings, we propose a new VPT framework, termed PRO-VPT (iterative Prompt RelOcation-based VPT), which adaptively adjusts the distribution built upon a nested optimization formulation. Specifically, we develop a prompt relocation strategy derived from this formulation, comprising two steps: pruning idle prompts from prompt-saturated blocks, followed by allocating these prompts to the most prompt-needed blocks. By iteratively performing prompt relocation and VPT, our proposal can adaptively learn the optimal prompt distribution in a nested optimization-based manner, thereby unlocking the full potential of VPT. Extensive experiments demonstrate that our proposal significantly outperforms advanced VPT methods, e.g., PRO-VPT surpasses VPT by 1.6 pp and 2.0 pp average accuracy, leading prompt-based methods to state-of-the-art performance on VTAB-1k and FGVC benchmarks. The code is available at https://github.com/ckshang/PRO-VPT.
Original languageEnglish
Title of host publication2025 IEEE/CVF International Conference on Computer Vision (ICCV)
Place of PublicationHonolulu
PublisherIEEE
Pages1558-1568
Number of pages11
ISBN (Electronic)9798331587758
ISBN (Print)9798331587765
DOIs
Publication statusPublished - 19 Oct 2025
Event2025 IEEE/CVF International Conference on Computer Vision, ICCV 2025 - Honolulu, United States
Duration: 19 Oct 202523 Oct 2025
https://iccv.thecvf.com/virtual/2025/index.html (Conference website)
https://openaccess.thecvf.com/ICCV2025 (Conference papers)
https://ieeexplore.ieee.org/xpl/conhome/11443115/proceeding (Conference proceedings)

Publication series

NameIEEE/CVF International Conference on Computer Vision
PublisherIEEE
ISSN (Print)1550-5499
ISSN (Electronic)2380-7504

Conference

Conference2025 IEEE/CVF International Conference on Computer Vision, ICCV 2025
Country/TerritoryUnited States
CityHonolulu
Period19/10/2523/10/25
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure

Fingerprint

Dive into the research topics of 'PRO-VPT: Distribution-Adaptive Visual Prompt Tuning via Prompt Relocation'. Together they form a unique fingerprint.

Cite this