$content.nome.text $content.cognome.text

Backdoor smoothing: Demystifying backdoor attacks on deep neural networks

Grosse, K;Lee, TS;Biggio, B;Park, Y;Backes, M;Molloy, I

2022-01-01

Abstract

Backdoor attacks mislead machine-learning models to output an attacker-specified class when presented a specific trigger at test time. These attacks require poisoning the training data to compromise the learning algorithm, e.g., by injecting poisoning samples containing the trigger into the training set, along with the desired class label. Despite the increasing number of studies on backdoor attacks and defenses, the underlying factors affecting the success of backdoor attacks, along with their impact on the learning algorithm, are not yet well understood. In this work, we aim to shed light on this issue by unveiling that backdoor attacks induce a smoother decision function around the triggered samples - a phenomenon which we refer to as backdoor smoothing . To quantify backdoor smoothing, we define a measure that evaluates the uncertainty associated to the predictions of a classifier around the input samples. Our experiments show that smoothness increases when the trigger is added to the input samples, and that this phenomenon is more pronounced for more successful attacks. We also provide preliminary evidence that backdoor triggers are not the only smoothing-inducing patterns, but that also other artificial patterns can be detected by our approach, paving the way towards understanding the limitations of current defenses and designing novel ones.

Short Card

Tab complete

Full Sheet(DC)

         Anno di pubblicazione 
       
        2022 
       
         Lingua/e 
       
        Inglese 
       
         Titolo della Rivista 
       
        COMPUTERS & SECURITY 
       
         Volume 
       
        120 
       
         Article Number 
       
        102814 
       
         Da pagina 
       
        1 
       
         A pagina 
       
        9 
       
         Numero di pagine 
       
        9 
       
         Codice DOI 
       
        https://dx.doi.org/10.1016/j.cose.2022.102814 
       
         Codice UT ISI 
       
        WOS:000841160900007 
       
         Codice Scopus 
       
        2-s2.0-85133250556 
       
         Referee 
       
        Esperti anonimi 
       
         Rilevanza della rivista 
       
        internazionale 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        ML security; Deep learning backdoors; ML poisoning; Training time attacks; Training time defenses 
       
         Presenza di coautori internazionali 
       
        sì 
       
         Tutti gli autori 
       
        Grosse, K; Lee, Ts; Biggio, B; Park, Y; Backes, M; Molloy, I
         
         Tipologia 
       
        1.1 Articolo in rivista 
       
         Tipologia 
       
        info:eu-repo/semantics/article 
       
         Tipologia 
       
        1 Contributo su Rivista::1.1 Articolo in rivista 
       
         Tipologia sito docente 
       
        262 
       
         Numero autori 
       
        6 
       
         Fulltext 
       
        partially_open 
       
         Type: 
       
        1.1 Articolo in rivista

Files in This Item:

File	Size	Format
1-s2.0-S0167404822002085-main-2.pdf Solo gestori archivio Description: articolo online Type: versione editoriale Size 1.48 MB Format Adobe PDF & nbsp; View / Open Request a copy	1.48 MB	Adobe PDF	& nbsp; View / Open Request a copy
preprint.pdf open access Type: versione pre-print Size 3.59 MB Format Adobe PDF View/Open	3.59 MB	Adobe PDF	View/Open

University of Cagliari

University of Cagliari

Backdoor smoothing: Demystifying backdoor attacks on deep neural networks

Grosse, K;Lee, TS;Biggio, B;Park, Y;Backes, M;Molloy, I

2022-01-01

Abstract

Short Card

Tab complete

Full Sheet(DC)

Backdoor smoothing: Demystifying backdoor attacks on deep neural networks

Grosse, K;Lee, TS;Biggio, B;Park, Y;Backes, M;Molloy, I

2022-01-01

Abstract

Short Card Tab complete Full Sheet(DC)

Questionnaire and social

Short Card

Tab complete

Full Sheet(DC)