CARSO: Blending Adversarial Training and Purification Improves Adversarial Robustness

Ballarin, Emanuele; Ansuini, Alessio; Bortolussi, Luca

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06081 (cs)

[Submitted on 25 May 2023 (v1), last revised 17 Oct 2023 (this version, v3)]

Title:CARSO: Blending Adversarial Training and Purification Improves Adversarial Robustness

Authors:Emanuele Ballarin, Alessio Ansuini, Luca Bortolussi

View PDF

Abstract:In this work, we propose a novel adversarial defence mechanism for image classification - CARSO - blending the paradigms of adversarial training and adversarial purification in a mutually-beneficial, robustness-enhancing way. The method builds upon an adversarially-trained classifier, and learns to map its internal representation associated with a potentially perturbed input onto a distribution of tentative clean reconstructions. Multiple samples from such distribution are classified by the adversarially-trained model itself, and an aggregation of its outputs finally constitutes the robust prediction of interest. Experimental evaluation by a well-established benchmark of varied, strong adaptive attacks, across different image datasets and classifier architectures, shows that CARSO is able to defend itself against foreseen and unforeseen threats, including adaptive end-to-end attacks devised for stochastic defences. Paying a tolerable clean accuracy toll, our method improves by a significant margin the state of the art for CIFAR-10 and CIFAR-100 $\ell_\infty$ robust classification accuracy against AutoAttack. Code and pre-trained models are available at this https URL .

Comments:	19 pages, 1 figure, 9 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2306.06081 [cs.CV]
	(or arXiv:2306.06081v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06081

Submission history

From: Emanuele Ballarin [view email]
[v1] Thu, 25 May 2023 09:04:31 UTC (635 KB)
[v2] Wed, 14 Jun 2023 00:28:09 UTC (616 KB)
[v3] Tue, 17 Oct 2023 15:20:47 UTC (217 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CARSO: Blending Adversarial Training and Purification Improves Adversarial Robustness

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CARSO: Blending Adversarial Training and Purification Improves Adversarial Robustness

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators