Internet publication
NormGrad: finding the pixels that matter for training
- Abstract:
- The different families of saliency methods, either based on contrastive signals, closed-form formulas mixing gradients with activations or on perturbation masks, all focus on which parts of an image are responsible for the model's inference. In this paper, we are rather interested by the locations of an image that contribute to the model's training. First, we propose a principled attribution method that we extract from the summation formula used to compute the gradient of the weights for a 1x1 convolutional layer. The resulting formula is fast to compute and can used throughout the network, allowing us to efficiently produce fined-grained importance maps. We will show how to extend it in order to compute saliency maps at any targeted point within the network. Secondly, to make the attribution really specific to the training of the model, we introduce a meta-learning approach for saliency methods by considering an inner optimisation step within the loss. This way, we do not aim at identifying the parts of an image that contribute to the model's output but rather the locations that are responsible for the good training of the model on this image. Conversely, we also show that a similar meta-learning approach can be used to extract the adversarial locations which can lead to the degradation of the model.
- Publication status:
- Published
- Peer review status:
- Not peer reviewed
Actions
Access Document
- Files:
-
-
(Preview, Version of record, pdf, 3.8MB, Terms of use)
-
- Publisher copy:
- 10.48550/arxiv.1910.08823
Authors
- Host title:
- arXiv
- Publication date:
- 2019-10-19
- DOI:
- Language:
-
English
- Pubs id:
-
1771176
- Local pid:
-
pubs:1771176
- Deposit date:
-
2024-11-22
Terms of use
- Copyright holder:
- Rebuffi et al
- Copyright date:
- 2019
- Rights statement:
- © 2019 The Author(s)
- Licence:
- Other
If you are the owner of this record, you can report an update to it here: Report update to this record