Human-grounded evaluations of explanation methods for text classification
File(s)D19-1523.pdf (455.81 KB)
Published version
Author(s)
Lertvittayakumjorn, Piyawat
Toni, Francesca
Type
Conference Paper
Abstract
Due to the black-box nature of deep learning models, methods for explaining the models’ results are crucial to gain trust from humans and support collaboration between AIs
and humans. In this paper, we consider several model-agnostic and model-specific explanation methods for CNNs for text classification and conduct three human-grounded evaluations, focusing on different purposes of explanations: (1) revealing model behavior, (2)
justifying model predictions, and (3) helping humans investigate uncertain predictions.
The results highlight dissimilar qualities of the
various explanation methods we consider and
show the degree to which these methods could
serve for each purpose.
and humans. In this paper, we consider several model-agnostic and model-specific explanation methods for CNNs for text classification and conduct three human-grounded evaluations, focusing on different purposes of explanations: (1) revealing model behavior, (2)
justifying model predictions, and (3) helping humans investigate uncertain predictions.
The results highlight dissimilar qualities of the
various explanation methods we consider and
show the degree to which these methods could
serve for each purpose.
Date Acceptance
2019-08-13
Citation
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp.5195-5205
Publisher
ACL Anthology
Start Page
5195
End Page
5205
Journal / Book Title
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
Copyright Statement
© 2019 Association for Computational Linguistics. Licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/)
License URL
Identifier
https://www.aclweb.org/anthology/D19-1523
Source
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing
Subjects
cs.CL
cs.CL
cs.AI
cs.LG
Publication Status
Published
Start Date
2019-11-03
Finish Date
2019-11-07
Coverage Spatial
Hong Kong, China