Should counterfactual explanations always be data instances?
File(s)XLoKR-2022-0630-final.pdf (175.98 KB)
Accepted version
Author(s)
Jiang, Jay
Rago, antonio
Toni, Francesca
Type
Conference Paper
Abstract
Counterfactual explanations (CEs) are an increasingly popular way of explaining machine learning classifiers. Predominantly, they amount to data instances pointing to potential changes to the inputs that would lead to alternative outputs. In this position paper we question the widespread assumption that CEs should always be data instances, and argue instead that in some cases they may be better understood in terms of special types of relations between input features and classification variables. We illustrate how a special type of these relations, amounting to critical influences, can characterise and guide the search for data instances deemed suitable as CEs. These relations also provide compact indications of which input features - rather than their specific values in data instances - have counterfactual value.
Date Issued
2022-07-31
Date Acceptance
2022-06-11
Citation
2022
Copyright Statement
© The Author(s) 2022.
Source
XLoKR 2022: The Third Workshop on Explainable Logic-Based Knowledge Representation
Publication Status
Published
Start Date
2022-07-31
Finish Date
2022-07-31
Coverage Spatial
Haifa, Israel