An extended sequence tagging vocabulary for grammatical error correction
File(s)2023.findings-eacl.119.pdf (467.36 KB)
Published version
Author(s)
Mesham, Stuart
Bryant, Christopher
Rei, Marek
Yuan, Zheng
Type
Conference Paper
Abstract
We extend a current sequence-tagging approach to Grammatical Error Correction (GEC) by introducing specialised tags for spelling correction and morphological inflection using the SymSpell and LemmInflect algorithms. Our approach improves generalisation: the proposed new tagset allows a smaller number of tags to correct a larger range of errors. Our results show a performance improvement both overall and in the targeted error categories. We further show that ensembles trained with our new tagset outperform those trained with the baseline tagset on the public BEA benchmark.
Date Issued
2023
Date Acceptance
2023-05-02
Citation
Findings of the Association for Computational Linguistics: EACL 2023, 2023, pp.1608-1619
Publisher
Association for Computational Linguistics
Start Page
1608
End Page
1619
Journal / Book Title
Findings of the Association for Computational Linguistics: EACL 2023
Copyright Statement
ACL materials are Copyright © 1963–2024 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
License URL
Identifier
http://dx.doi.org/10.18653/v1/2023.findings-eacl.119
Source
The 17th Conference of the European Chapter of the Association for Computational Linguistics
Publication Status
Published
Start Date
2023-05-02
Finish Date
2023-05-06
Date Publish Online
2023