Using Pareto simulated annealing to address algorithmic bias in machine learning

William Blanzeisky; Pádraig Cunningham; William Blanzeisky; Pádraig Cunningham

doi:10.1017/S0269888922000029

Abstract: Algorithmic bias arises in machine learning when models that may have reasonable overall accuracy are biased in favor of ‘good’ outcomes for one side of a sensitive category, for example gender or race. The bias will manifest as an underestimation of good outcomes for the under-represented minority. In a sense, we should not be surprised that a model might be biased when it has not been ‘asked’ not to be; reasonable accuracy can be achieved by ignoring the under-represented minority. A common strategy to address this issue is to include fairness as a component in the learning objective. In this paper, we consider including fairness as an additional criterion in model training and propose a multi-objective optimization strategy using Pareto Simulated Annealing that optimizes for both accuracy and underestimation bias. Our experiments show that this strategy can identify families of models with members representing different accuracy/fairness tradeoffs. We demonstrate the effectiveness of this strategy on two synthetic and two real-world datasets.

Other Articles By Authors

Using Pareto simulated annealing to address algorithmic bias in machine learning

Received: 31 August 2021

Revised: 24 March 2022

Accepted: 25 March 2022

Published online: 04 May 2022

Abstract: Abstract: Algorithmic bias arises in machine learning when models that may have reasonable overall accuracy are biased in favor of ‘good’ outcomes for one side of a sensitive category, for example gender or race. The bias will manifest as an underestimation of good outcomes for the under-represented minority. In a sense, we should not be surprised that a model might be biased when it has not been ‘asked’ not to be; reasonable accuracy can be achieved by ignoring the under-represented minority. A common strategy to address this issue is to include fairness as a component in the learning objective. In this paper, we consider including fairness as an additional criterion in model training and propose a multi-objective optimization strategy using Pareto Simulated Annealing that optimizes for both accuracy and underestimation bias. Our experiments show that this strategy can identify families of models with members representing different accuracy/fairness tradeoffs. We demonstrate the effectiveness of this strategy on two synthetic and two real-world datasets.

HTML

Acknowledgments

The authors declare none

Acknowledgments

This work was funded by Science Foundation Ireland through the SFI Centre for Research Training in Machine Learning (Grant No.18/CRT/6183) with support from Microsoft Ireland.

https://github.com/williamblanzeisky/ParetoSimulatedAnnealing

https://scikit-learn.org/

Rights and permissions

This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.

References (27)

About this article

Cite this article

William Blanzeisky, Pádraig Cunningham. 2022. Using Pareto simulated annealing to address algorithmic bias in machine learning. The Knowledge Engineering Review. 37: doi: 10.1017/S0269888922000029

William Blanzeisky, Pádraig Cunningham. 2022. Using Pareto simulated annealing to address algorithmic bias in machine learning. The Knowledge Engineering Review. 37: doi: 10.1017/S0269888922000029

{{lists.name}}

Using Pareto simulated annealing to address algorithmic bias in machine learning

Abstract

Rights and permissions

References

About this article

Cite this article

Article Metrics

Access History

Other Articles By Authors