$content.nome.text $content.cognome.text

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Research in adversarial machine learning has shown how the performance of machine learning models can be seriously compromised by injecting even a small fraction of poisoning points into the training data. While the effects on model accuracy of such poisoning attacks have been widely studied, their potential effects on other model performance metrics remain to be evaluated. In this work, we introduce an optimization framework for poisoning attacks against algorithmic fairness, and develop a gradient-based poisoning attack aimed at introducing classification disparities among different groups in the data. We empirically show that our attack is effective not only in the white-box setting, in which the attacker has full access to the target model, but also in a more challenging black-box scenario in which the attacks are optimized against a substitute model and then transferred to the target model. We believe that our findings pave the way towards the definition of an entirely novel set of adversarial attacks targeting algorithmic fairness in different scenarios, and that investigating such vulnerabilities will help design more robust algorithms and countermeasures in the future.

Short Card

Tab complete

Full Sheet(DC)

         Anno 
       
        2021 
       
         Codice ISBN 
       
        978-3-030-67657-5
978-3-030-67658-2 
       
         Parole chiave 
       
        Adversarial machine learning
Algorithmic discrimination
Algorithmic fairness
Machine learning security
Poisoning attacks 
       
         Type: 
       
        4.1 Contributo in Atti di convegno

Files in This Item:

File	Size	Format
Solans2021_Chapter_PoisoningAttacksOnAlgorithmicF.pdf Solo gestori archivio Type: versione editoriale Size 1.44 MB Format Adobe PDF & nbsp; View / Open Request a copy	1.44 MB	Adobe PDF	& nbsp; View / Open Request a copy
solans20-ecml.pdf open access Type: versione pre-print Size 1.07 MB Format Adobe PDF View/Open	1.07 MB	Adobe PDF	View/Open

University of Cagliari

University of Cagliari

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Short Card

Tab complete

Full Sheet(DC)

Poisoning Attacks on Algorithmic Fairness

Solans D.;Biggio B.;Castillo C.

2021-01-01

Abstract

Short Card Tab complete Full Sheet(DC)

Questionnaire and social

Short Card

Tab complete

Full Sheet(DC)