Signal Feed

Latest Articles

View All
Why Adversarial Examples Are Features, Not Flaws: What Mislabeled Data Reveals About How Neural Networks Really Learn
Artificial Intelligence

Why Adversarial Examples Are Features, Not Flaws: What Mislabeled Data Reveals About How Neural Networks Really Learn

Section 3.2 of Ilyas et al. (2019) demonstrates that a model trained exclusively on adversarial examples achieves non-trivial generalization on the original test set. This paper shows that these experiments represent a specific instance of a broader phenomenon: adversarial vulnerability can emerge from features that are genuinely predictive yet brittle, rather than from model artifacts or noise. The finding reframes adversarial robustness not as a bug to be patched, but as a structural property of the data distribution itself.

Aug 06, 2019 574 views