Explainable AI As Evidence of Fair Decisions

Overview

Journal Front Psychol

Date 2023 Mar 3

PMID 36865358

Authors

Derek Leben

Affiliations

Soon will be listed here.

Abstract

This paper will propose that explanations are valuable to those impacted by a model's decisions (model patients) to the extent that they provide evidence that a past adverse decision was unfair. Under this proposal, we should favor models and explainability methods which generate counterfactuals of two types. The first type of counterfactual is evidence of fairness: a set of states under the control of the patient which (if changed) would have led to a beneficial decision. The second type of counterfactual is evidence of fairness: a set of irrelevant group or behavioral attributes which (if changed) would have led to a beneficial decision. Each of these counterfactual statements is related to fairness, under the Liberal Egalitarian idea that treating one person differently than another is justified only on the basis of features which were plausibly under each person's control. Other aspects of an explanation, such as feature importance and actionable recourse, are essential under this view, and need not be a goal of explainable AI.

References

McDermid J, Jia Y, Porter Z, Habli I . Artificial intelligence explainability: the technical and ethical dimensions. Philos Trans A Math Phys Eng Sci. 2021; 379(2207):20200363. PMC: 8366909. DOI: 10.1098/rsta.2020.0363. View

Gallagher I . Philosophical conceptions of the self: implications for cognitive science. Trends Cogn Sci. 2000; 4(1):14-21. DOI: 10.1016/s1364-6613(99)01417-5. View

Cushman F . Crime and punishment: distinguishing the roles of causal and intentional analyses in moral judgment. Cognition. 2008; 108(2):353-80. DOI: 10.1016/j.cognition.2008.03.006. View

Belle V, Papantonis I . Principles and Practice of Explainable Machine Learning. Front Big Data. 2021; 4:688969. PMC: 8281957. DOI: 10.3389/fdata.2021.688969. View

Nowak M, Page K, Sigmund K . Fairness versus reason in the ultimatum game. Science. 2000; 289(5485):1773-5. DOI: 10.1126/science.289.5485.1773. View