Subclass-Dominant Label Noise: A Counterexample for the Success of Early Stopping

Yingbin Bai, Zhongyi Han, Erkun Yang, Jun Yu, Bo Han, Dadong Wang, Tongliang Liu*

*Corresponding author for this work

Research output: Chapter in book/report/conference proceedingConference proceedingpeer-review

Abstract

In this paper, we empirically investigate a previously overlooked and widespread type of label noise, subclass-dominant label noise (SDN).Our findings reveal that, during the early stages of training, deep neural networks can rapidly memorize mislabeled examples in SDN.This phenomenon poses challenges in effectively selecting confident examples using conventional early stopping techniques.To address this issue, we delve into the properties of SDN and observe that long-trained representations are superior at capturing the high-level semantics of mislabeled examples, leading to a clustering effect where similar examples are grouped together.Based on this observation, we propose a novel method called NoiseCluster that leverages the geometric structures of long-trained representations to identify and correct SDN.Our experiments demonstrate that NoiseCluster outperforms state-of-the-art baselines on both synthetic and real-world datasets, highlighting the importance of addressing SDN in learning with noisy labels.The code is available at https://github.com/tmllab/2023_NeurIPS_SDN.

Original languageEnglish
Title of host publication37th Conference on Neural Information Processing Systems, NeurIPS 2023
EditorsA. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, S. Levine
PublisherNeural Information Processing Systems Foundation
Number of pages24
ISBN (Print)9781713899921
Publication statusPublished - 10 Dec 2023
Event37th Conference on Neural Information Processing Systems, NeurIPS 2023 - Ernest N. Morial Convention Center, New Orleans, United States
Duration: 10 Dec 202316 Dec 2023
https://proceedings.neurips.cc/paper_files/paper/2023 (Conference Paper Search)
https://openreview.net/group?id=NeurIPS.cc/2023/Conference#tab-accept-oral (Conference Paper Search)
https://neurips.cc/Conferences/2023 (Conference Website)

Publication series

NameAdvances in Neural Information Processing Systems
Volume36
ISSN (Print)1049-5258
NameNeurIPS Proceedings

Conference

Conference37th Conference on Neural Information Processing Systems, NeurIPS 2023
Country/TerritoryUnited States
CityNew Orleans
Period10/12/2316/12/23
Internet address

Scopus Subject Areas

  • Computer Networks and Communications
  • Information Systems
  • Signal Processing

Fingerprint

Dive into the research topics of 'Subclass-Dominant Label Noise: A Counterexample for the Success of Early Stopping'. Together they form a unique fingerprint.

Cite this