Why adversarial reprogramming works, when it fails, and how to tell the difference

Demontis, Ambra
;
Pintor, Maura;Biggio, Battista;Roli, Fabio
Last
2023-01-01

Abstract

Adversarial reprogramming allows repurposing a machine-learning model to perform a different task. For example, a model trained to recognize animals can be reprogrammed to recognize digits by embedding an adversarial program in the digit images provided as input. Recent work has shown that adversarial reprogramming may not only be used to abuse machine-learning models provided as a service, but also beneficially, to improve transfer learning when training data is scarce. However, the factors affecting its success are still largely unexplained. In this work, we develop a first-order linear model of adversarial reprogramming to show that its success inherently depends on the size of the average input gradient, which grows when input gradients are more aligned, and when inputs have higher dimensionality. The results of our experimental analysis, involving fourteen distinct reprogramming tasks, show that the above factors are correlated with the success and the failure of adversarial reprogramming.
2023
2023
Inglese
632
130
143
14
Esperti anonimi
internazionale
scientifica
Adversarial machine learning; Adversarial reprogramming; Neural networks; Transfer learning
Goal 9: Industry, Innovation, and Infrastructure
Zheng, Yang; Feng, Xiaoyi; Xia, Zhaoqiang; Jiang, Xiaoyue; Demontis, Ambra; Pintor, Maura; Biggio, Battista; Roli, Fabio
1.1 Articolo in rivista
info:eu-repo/semantics/article
1 Contributo su Rivista::1.1 Articolo in rivista
262
8
open
Files in This Item:
File Size Format  
why_adv_reprogramming.pdf

open access

Description: articolo online
Type: versione editoriale
Size 3.29 MB
Format Adobe PDF
3.29 MB Adobe PDF View/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Questionnaire and social

Share on:
Impostazioni cookie