A surprisingly robust Trick for the winograd schema challenge
File(s)P19-1478.pdf (227.83 KB)
Published version
Author(s)
Kocijan, Vid
Cretu, Ana-Maria
Camburu, Oana-Maria
Yordanov, Yordan
Lukasiewicz, Thomas
Type
Conference Paper
Abstract
The Winograd Schema Challenge (WSC) dataset WSC273 and its inference counterpart WNLI are popular benchmarks for natural language understanding and commonsense reasoning. In this paper, we show that the performance of three language models on WSC273 consistently and robustly improves when fine-tuned on a similar pronoun disambiguation problem dataset (denoted WSCR). We additionally generate a large unsupervised WSC-like dataset. By fine-tuning the BERT language model both on the introduced and on the WSCR dataset, we achieve overall accuracies of 72.5% and 74.7% on WSC273 and WNLI, improving the previous state-of-the-art solutions by 8.8% and 9.6%, respectively. Furthermore, our fine-tuned models are also consistently more accurate on the “complex” subsets of WSC273, introduced by Trichelair et al. (2018).
Date Issued
2019-08-02
Date Acceptance
2019-07-01
Citation
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, 2019
Publisher
Association for Computational Linguistics
Journal / Book Title
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Copyright Statement
© 2019 Association for Computational Linguistics
. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Identifier
https://www.aclweb.org/anthology/P19-1478/
Source
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Publication Status
Published
Start Date
2019-07-28
Finish Date
2019-08-02
Date Publish Online
2019-08-02