Abstract
Large language models (LLM) have manifested unparalleled modeling capability on various tasks, e.g., multi-step reasoning, but the input to these models is mostly limited to plain text, which could be very long and contain noisy information. Long text could take long time to process, and thus may not be efficient enough for recommender systems that require immediate response. In LLM-based recommendation models, user and item IDs are usually filled in a template (i.e., discrete prompt) to allow the models to understand a given task, but the models usually need extensive fine-tuning to bridge the user/item IDs and the template words and to unleash the power of LLM for recommendation. To address the problems, we propose to distill the discrete prompt for a specific task to a set of continuous prompt vectors so as to bridge IDs and words and to reduce the inference time. We also design a training strategy with an attempt to improve the efficiency of training these models. Experimental results on three real-world datasets demonstrate the effectiveness of our PrOmpt Distillation (POD) approach on both sequential recommendation and top-N recommendation tasks. Although the training efficiency can be significantly improved, the improvement of inference efficiency is limited. This finding may inspire researchers in the community to further improve the inference efficiency of LLM-based recommendation models.
Original language | English |
---|---|
Title of host publication | CIKM 2023 - Proceedings of the 32nd ACM International Conference on Information and Knowledge Management |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1348-1357 |
Number of pages | 10 |
ISBN (Print) | 9798400701245 |
DOIs | |
Publication status | Published - 21 Oct 2023 |
Event | 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 - Birmingham, United Kingdom Duration: 21 Oct 2023 → 25 Oct 2023 https://dl.acm.org/doi/proceedings/10.1145/3583780 |
Publication series
Name | International Conference on Information and Knowledge Management, Proceedings |
---|
Conference
Conference | 32nd ACM International Conference on Information and Knowledge Management, CIKM 2023 |
---|---|
Country/Territory | United Kingdom |
City | Birmingham |
Period | 21/10/23 → 25/10/23 |
Internet address |
Scopus Subject Areas
- Business, Management and Accounting(all)
- Decision Sciences(all)
User-Defined Keywords
- Explainable Recommendation
- Generative Recommendation
- Large Language Models
- Prompt Distillation
- Recommender Systems
- Sequential Recommendation
- Top-N Recommendation