$content.nome.text $content.cognome.text

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Research in adversarial machine learning has shown how the performance of machine learning models can be seriously compromised by injecting even a small fraction of poisoning points into the training data. While the effects on model accuracy of such poisoning attacks have been widely studied, their potential effects on other model performance metrics remain to be evaluated. In this work, we introduce an optimization framework for poisoning attacks against algorithmic fairness, and develop a gradient-based poisoning attack aimed at introducing classification disparities among different groups in the data. We empirically show that our attack is effective not only in the white-box setting, in which the attacker has full access to the target model, but also in a more challenging black-box scenario in which the attacks are optimized against a substitute model and then transferred to the target model. We believe that our findings pave the way towards the definition of an entirely novel set of adversarial attacks targeting algorithmic fairness in different scenarios, and that investigating such vulnerabilities will help design more robust algorithms and countermeasures in the future.

Scheda breve

Scheda completa

Scheda completa (DC)

         Anno 
       
        2021 
       
         Lingua/e 
       
        Inglese 
       
         Titolo del Volume 
       
        Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) 
       
         Codice ISBN 
       
        978-3-030-67657-5
978-3-030-67658-2 
       
         Nome Editore 
       
        Springer Science and Business Media Deutschland GmbH 
       
         Titolo della Collana/serie 
       
        LECTURE NOTES IN ARTIFICIAL INTELLIGENCE 
       
         Volume 
       
        12457 
       
         Da pagina 
       
        162 
       
         A pagina 
       
        177 
       
         Numero di pagine 
       
        16 
       
         Codice DOI 
       
        https://dx.doi.org/10.1007/978-3-030-67658-2_10 
       
         Codice UT ISI 
       
        WOS:000717522300010 
       
         Codice Scopus 
       
        2-s2.0-85103241169 
       
         Titolo del convegno 
       
        European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2020 
       
         Relazione 
       
        Contributo 
       
         Referee 
       
        Esperti anonimi 
       
         Periodo del Convegno 
       
        2020 
       
         Luogo del Convegno 
       
        Online 
       
         Rilevanza del Convegno 
       
        internazionale 
       
         Caratterizzazione prevalente 
       
        scientifica 
       
         Parole chiave 
       
        Adversarial machine learning
Algorithmic discrimination
Algorithmic fairness
Machine learning security
Poisoning attacks 
       
         Presenza di coautori internazionali 
       
        sì 
       
         Tipologia 
       
        4 Contributo in Atti di Convegno (Proceeding)::4.1 Contributo in Atti di convegno 
       
         Tutti gli autori 
       
        Solans, D.; Biggio, B.; Castillo, C.
         
         Tipologia sito docente 
       
        273 
       
         Numero autori 
       
        3 
       
         Tipologia 
       
        4.1 Contributo in Atti di convegno 
       
         Fulltext 
       
        partially_open 
       
         Tipologia 
       
        info:eu-repo/semantics/conferencePaper 
       
         Tipologia: 
       
        4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
Solans2021_Chapter_PoisoningAttacksOnAlgorithmicF.pdf Solo gestori archivio Tipologia: versione editoriale Dimensione 1.44 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.44 MB	Adobe PDF	Visualizza/Apri Richiedi una copia
solans20-ecml.pdf accesso aperto Tipologia: versione pre-print Dimensione 1.07 MB Formato Adobe PDF Visualizza/Apri	1.07 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Università degli Studi di Cagliari

Università degli Studi di Cagliari

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Questionario e social

Scheda breve

Scheda completa

Scheda completa (DC)