Abstract
Label Smoothing is a widely used technique in many areas. It can prevent the network from being over-confident. However, it hypotheses that the prior distribution of all classes is uniform. Here, we decide to abandon this hypothesis and propose a new smoothing method, called Smoothing with Fake Label. It shares a part of the prediction probability to a new fake class. Our experiment results show that the method can increase the performance of the models on most tasks and outperform the Label Smoothing on text classification and cross-lingual transfer tasks.
Original language | English |
---|---|
Title of host publication | CIKM 2021 - Proceedings of the 30th ACM International Conference on Information and Knowledge Management |
Publisher | Association for Computing Machinery (ACM) |
Pages | 3303-3307 |
Number of pages | 5 |
ISBN (Electronic) | 9781450384469 |
DOIs | |
Publication status | Published - 30 Oct 2021 |
Event | 30th ACM International Conference on Information and Knowledge Management, CIKM 2021 - Virtual, Online, Gold Coast, Queensland, Australia Duration: 1 Nov 2021 → 5 Nov 2021 https://www.cikm2021.org/ https://dl.acm.org/doi/proceedings/10.1145/3459637 |
Publication series
Name | Proceedings of the International Conference on Information and Knowledge Management |
---|---|
Publisher | Association for Computing Machinery |
Conference
Conference | 30th ACM International Conference on Information and Knowledge Management, CIKM 2021 |
---|---|
Country/Territory | Australia |
City | Gold Coast, Queensland |
Period | 1/11/21 → 5/11/21 |
Internet address |
User-Defined Keywords
- cross-lingual
- label smoothing
- machine translation
- neural networks
- text classification