Unlocking the Hidden Treasures: Enhancing Recommendations with Unlabeled Data

Yuhan Zhao, Rui Chen*, Qilong Han*, Hongtao Song, Li Chen

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Collaborative filtering (CF) stands as a cornerstone in recommender systems, yet effectively leveraging the massive unlabeled data presents a significant challenge. Current research focuses on addressing the challenge of unlabeled data by extracting a subset that closely approximates negative samples. Regrettably, the remaining data are overlooked, failing to fully integrate this valuable information into the construction of user preferences. To address this gap, we introduce a novel positive-neutral-negative (PNN) learning paradigm. PNN introduces a neutral class, encompassing intricate items that are challenging to categorize directly as positive or negative samples. By training a model based on this triple-wise partial ranking, PNN offers a promising solution to learning complex user preferences. Through theoretical analysis, we connect PNN to one-way partial AUC (OPAUC) to validate its efficacy. Implementing the PNN paradigm is, however, technically challenging because: (1) it is difficult to classify unlabeled data into neutral or negative in the absence of supervised signals; (2) there does not exist any loss function that can handle set-level triple-wise ranking relationships. To address these challenges, we propose a semi-supervised learning method coupled with a user-aware attention model for knowledge acquisition and classification refinement. Additionally, a novel loss function with a two-step centroid ranking approach enables handling set-level rankings. Extensive experiments on four real-world datasets demonstrate that, when combined with PNN, a wide range of representative CF models can consistently and significantly boost their performance. Even with a simple matrix factorization, PNN can achieve comparable performance to sophisticated graph neutral networks. Our code is publicly available at https://github.com/Asa9aoTK/PNN-RecBole.
Original languageEnglish
Title of host publicationProceedings of the 18th ACM Conference on Recommender Systems, RecSys 2024
EditorsTommaso Di Noia, Pasquale Lops, Thorsten Joachims, Katrien Verbert, Pablo Castells, Zhenhua Dong, Ben London
PublisherAssociation for Computing Machinery (ACM)
Pages247-256
Number of pages10
ISBN (Electronic)9798400705052
ISBN (Print)9798400705052
DOIs
Publication statusPublished - 8 Oct 2024
EventThe 18th ACM Conference on Recommender Systems - Bari, Italy
Duration: 14 Oct 202418 Oct 2024
https://recsys.acm.org/recsys24/ (Conference website)
https://recsys.acm.org/recsys24/program/ (conference program)

Publication series

NameProceedings of the ACM Conference on Recommender Systems, RecSys
PublisherAssociation for Computing Machinery

Conference

ConferenceThe 18th ACM Conference on Recommender Systems
Abbreviated titleRecSys 2024
Country/TerritoryItaly
CityBari
Period14/10/2418/10/24
Internet address

User-Defined Keywords

  • Collaborative filtering
  • loss function
  • positive-neutral-negative learning

Fingerprint

Dive into the research topics of 'Unlocking the Hidden Treasures: Enhancing Recommendations with Unlabeled Data'. Together they form a unique fingerprint.

Cite this