Neural and FST-based approaches to grammatical error correction
File(s)W19-4424.pdf (391.99 KB)
Published version
Author(s)
Yuan, Zheng
Stahlberg, Felix
Rei, Marek
Byrne, Bill
Yannakoudakis, Helen
Type
Conference Paper
Abstract
In this paper, we describe our submission to the BEA 2019 shared task on grammatical error correction. We present a system pipeline that utilises both error detection and correction models. The input text is first corrected by two complementary neural machine translation systems: one using convolutional networks and multi-task learning, and another using a neural Transformer-based system. Training is performed on publicly available data, along with artificial examples generated through back-translation. The n-best lists of these two machine translation systems are then combined and scored using a finite state transducer (FST). Finally, an unsupervised re-ranking system is applied to the n-best output of the FST. The re-ranker uses a number of error detection features to re-rank the FST n-best list and identify the final 1-best correction hypothesis. Our system achieves 66.75% F 0.5 on error correction (ranking 4th), and 82.52% F 0.5 on token-level error detection (ranking 2nd) in the restricted track of the shared task.
Date Issued
2019-08-02
Date Acceptance
2019-08-02
Citation
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, 2019, Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications, pp.228-239
Publisher
Association for Computational Linguistics
Start Page
228
End Page
239
Journal / Book Title
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications
Volume
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications
Copyright Statement
© 2019 Association for Computational Linguistics. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
License URL
Identifier
https://www.aclweb.org/anthology/W19-4424/
Source
Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications
Publication Status
Published
Start Date
2019-08-02
Finish Date
2019-08
Coverage Spatial
Florence, Italy
Date Publish Online
2019-08-02