Conference item
Improving adversarial transferability via model alignment
- Abstract:
- Neural networks are susceptible to adversarial perturbations that are transferable across different models. In this paper, we introduce a novel model alignment technique aimed at improving a given source model’s ability in generating transferable adversarial perturbations. During the alignment process, the parameters of the source model are fine-tuned to minimize an alignment loss. This loss measures the divergence in the predictions between the source model and another, independently trained model, referred to as the witness model. To understand the effect of model alignment, we conduct a geometric analysis of the resulting changes in the loss landscape. Extensive experiments on the ImageNet dataset, using a variety of model architectures, demonstrate that perturbations generated from aligned source models exhibit significantly higher transferability than those from the original source model. Our source code is available at https://github.com/averyma/model-alignment.
- Publication status:
- Published
- Peer review status:
- Peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Accepted manuscript, pdf, 1.0MB, Terms of use)
-
- Publisher copy:
- 10.1007/978-3-031-73033-7_5
Authors
+ Engineering and Physical Sciences Research Council
More from this funder
- Funder identifier:
- https://ror.org/0439y7842
- Grant:
- EP/W002981/1
- Publisher:
- Springer
- Host title:
- Proceedings of the 18th European Conference on Computer Vision (ECCV 2024)
- Pages:
- 74–92
- Series:
- Lecture Notes in Computer Science
- Series number:
- 15120
- Publication date:
- 2024-10-31
- Acceptance date:
- 2024-07-01
- Event title:
- 18th European Conference on Computer Vision (ECCV 2024)
- Event location:
- Milan, Italy
- Event website:
- https://eccv.ecva.net/virtual/2024/index.html.
- Event start date:
- 2024-09-29
- Event end date:
- 2024-10-04
- DOI:
- EISSN:
-
1611-3349
- ISSN:
-
0302-9743
- Language:
-
English
- Pubs id:
-
2061373
- Local pid:
-
pubs:2061373
- Deposit date:
-
2024-11-12
Terms of use
- Copyright holder:
- Ma et al.
- Copyright date:
- 2024
- Rights statement:
- © 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
- Notes:
- This paper was presented at the 18th European Conference on Computer Vision (ECCV 2024), 29th September - 4th October 2024, Milan, Italy. This is the accepted manuscript version of the article. The final version is available online from Springer at https://dx.doi.org/10.1007/978-3-031-73033-7_5
If you are the owner of this record, you can report an update to it here: Report update to this record