Abstract
The maximum mean discrepancy (MMD) test could in principle detect any distributional discrepancy between two datasets. However, it has been shown that the MMD test is unaware of adversarial attacks–the MMD test failed to detect the discrepancy between natural data and adversarial data. Given this phenomenon, we raise a question: are natural and adversarial data really from different distributions? The answer is affirmative–the previous use of the MMD test on the purpose missed three key factors, and accordingly, we propose three components. Firstly, the Gaussian kernel has limited representation power, and we replace it with an effective deep kernel. Secondly, the test power of the MMD test was neglected, and we maximize it following asymptotic statistics. Finally, adversarial data may be non-independent, and we overcome this issue with the help of wild bootstrap. By taking care of the three factors, we verify that the MMD test is aware of adversarial attacks, which lights up a novel road for adversarial data detection based on two-sample tests.
Original language | English |
---|---|
Title of host publication | Proceedings of the 38th International Conference on Machine Learning (ICML 2021) |
Editors | Marina Meila, Tong Zhang |
Publisher | ML Research Press |
Pages | 3564-3575 |
Number of pages | 12 |
Publication status | Published - 18 Jul 2021 |
Event | 38th International Conference on Machine Learning, ICML 2021 - Virtual Duration: 18 Jul 2021 → 24 Jul 2021 https://icml.cc/virtual/2021/index.html https://icml.cc/Conferences/2021 https://proceedings.mlr.press/v139/ |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
Volume | 139 |
ISSN (Print) | 2640-3498 |
Conference
Conference | 38th International Conference on Machine Learning, ICML 2021 |
---|---|
Period | 18/07/21 → 24/07/21 |
Internet address |