Gianluca Cosentino

UniCa About Professors and Researchers Gianluca Cosentino Research Research outcomes (IRIS)

Gianluca Cosentino

Robustness-Congruent Adversarial Training for Secure Machine Learning Model Updates

Angioni, Daniele^First;Demetrio, Luca^Second;Pintor, Maura;Oneto, Luca;Anguita, Davide;Biggio, Battista^Penultimate;Roli, Fabio^Last

2025-01-01

Abstract

Machine-learning models demand periodic updates to improve their average accuracy, exploiting novel architectures and additional data. However, a newly updated model may commit mistakes that the previous model did not make. Such misclassifications are referred to as negative flips, experienced by users as a regression of performance. In this work, we show that this problem also affects robustness to adversarial examples, hindering the development of secure model update practices. In particular, when updating a model to improve its adversarial robustness, previously ineffective adversarial attacks on some inputs may become successful, causing a regression in the perceived security of the system. We propose a novel technique, named robustness-congruent adversarial training, to address this issue. It amounts to fine-tuning a model with adversarial training, while constraining it to retain higher robustness on the samples for which no adversarial example was found before update. We show that our algorithm and, more generally, learning with non-regression constraints, provides a theoretically-grounded framework to train consistent estimators. Our experiments on robust models for computer vision confirm that both accuracy and robustness, even if improved after model update, can be affected by negative flips, and our robustness-congruent adversarial training can mitigate the problem, outperforming competing baseline methods.

Short Card

Tab complete

Full Sheet(DC)

         Anno di pubblicazione 
       
        2025 
       
         Lingua/e 
       
        Inglese 
       
         Titolo della Rivista 
       
        IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 
       
         Volume 
       
        47 
       
         Fascicolo 
       
        9 
       
         Da pagina 
       
        7457 
       
         A pagina 
       
        7469 
       
         Numero di pagine 
       
        13 
       
         Codice DOI 
       
        https://dx.doi.org/10.1109/TPAMI.2025.3573237 
       
         Codice UT ISI 
       
        WOS:001547756300018 
       
         Codice Scopus 
       
        2-s2.0-105005944705 
       
         PMID 
       
        40408194 
       
         URL 
       
        https://ieeexplore.ieee.org/document/11014530 
       
         Referee 
       
        Esperti anonimi 
       
         Rilevanza della rivista 
       
        internazionale 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        Adversarial Examples; Adversarial Robustness; Machine Learning; Regression Testing 
       
         Presenza di coautori internazionali 
       
        no 
       
         Tutti gli autori 
       
        Angioni, Daniele; Demetrio, Luca; Pintor, Maura; Oneto, Luca; Anguita, Davide; Biggio, Battista; Roli, Fabio
         
         Tipologia 
       
        1.1 Articolo in rivista 
       
         Tipologia 
       
        info:eu-repo/semantics/article 
       
         Tipologia 
       
        1 Contributo su Rivista::1.1 Articolo in rivista 
       
         Tipologia sito docente 
       
        262 
       
         Numero autori 
       
        7 
       
         Fulltext 
       
        partially_open 
       
         Titolo del progetto 
       
           Project Title  European Lighthouse on Secure and Safe AI 
         
           Acronym  ELSA 
         
           Funder Name  European Commission 
         
           Funding Stream  Horizon Europe Framework Programme 
         
           Award Number  101070617 
         
           Project Title  A COMPREHENSIVE TRUSTWORTHY FRAMEWORK FOR CONNECTED MACHINE LEARNING AND SECURE INTERCONNECTED AI SOLUTIONS 
         
           Acronym  CoEvolution 
         
           Funder Name  European Commission 
         
           Funding Stream  Horizon Europe Framework Programme 
         
           Award Number  101168560 
         
         Type: 
       
        1.1 Articolo in rivista

Files in This Item:

File	Size	Format
Robustness-Congruent_Adversarial_Training_for_Secure_Machine_Learning_Model_Updates.pdf open access Type: Author’s Accepted Manuscript AAM, Post-print, (version accepted by the publisher) Size 3.72 MB Format Adobe PDF View/Open	3.72 MB	Adobe PDF	View/Open
Robustness-Congruent_Adversarial_Training_for_Secure_Machine_Learning_Model_Updates_FINAL.pdf Solo gestori archivio Type: versione editoriale Size 1.63 MB Format Adobe PDF & nbsp; View / Open Request a copy	1.63 MB	Adobe PDF	& nbsp; View / Open Request a copy

University of Cagliari

University of Cagliari