Skip to main navigation Skip to search Skip to main content

Enhancing Sample Selection Against Label Noise by Cutting Mislabeled Easy Examples

  • Suqin Yuan
  • , Lei Feng*
  • , Bo Han
  • , Tongliang Liu*
  • *Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

Sample selection is a prevalent approach in learning with noisy labels, aiming to identify confident samples for training. Although existing sample selection methods have achieved decent results by reducing the noise rate of the selected subset, they often overlook that not all mislabeled examples harm the model's performance equally. In this paper, we demonstrate that mislabeled examples correctly predicted by the model early in the training process are particularly harmful to model performance. We refer to these examples as Mislabeled Easy Examples (MEEs). To address this, we propose Early Cutting, which introduces a recalibration step that employs the model's later training state to re-select the confident subset identified early in training, thereby avoiding misleading confidence from early learning and effectively filtering out MEEs. Experiments on the CIFAR, WebVision, and full ImageNet-1k datasets demonstrate that our method effectively improves sample selection and model performance by reducing MEEs.
Original languageEnglish
Title of host publication39th Conference on Neural Information Processing Systems, NeurIPS 2025
EditorsD. Belgrave, C. Zhang, H. Lin, R. Pascanu, P. Koniusz, M. Ghassemi, N. Chen
PublisherNeural Information Processing Systems Foundation
Pages1-29
Number of pages29
Publication statusPublished - Dec 2025
Event39th Conference on Neural Information Processing Systems, NeurIPS 2025 - San Diego, United States
Duration: 2 Dec 20257 Dec 2025
https://neurips.cc/Conferences/2025 (Conference website)
https://neurips.cc/virtual/2025/loc/san-diego/papers.html (Conference schedule)
https://proceedings.neurips.cc/paper_files/paper/2025 (Conference proceedings)

Publication series

NameAdvances in Neural Information Processing Systems
Volume38
NameNeurIPS Proceedings

Conference

Conference39th Conference on Neural Information Processing Systems, NeurIPS 2025
Abbreviated titleNeurIPS 2025
Country/TerritoryUnited States
CitySan Diego
Period2/12/257/12/25
Internet address

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 9 - Industry, Innovation, and Infrastructure
    SDG 9 Industry, Innovation, and Infrastructure

Fingerprint

Dive into the research topics of 'Enhancing Sample Selection Against Label Noise by Cutting Mislabeled Easy Examples'. Together they form a unique fingerprint.

Cite this