Personalized machine translation: preserving original author traits
File(s)document(7).pdf (226.99 KB)
Published version
Author(s)
Rabinovich, E
Mirkin, S
Patel, RN
Specia, L
Wintner, S
Type
Conference Paper
Abstract
© 2017 Association for Computational Linguistics. The language that we produce reflects our personality, and various personal and demographic characteristics can be detected in natural language texts. We focus on one particular personal trait of the author, gender, and study how it is manifested in original texts and in translations. We show that author's gender has a powerful, clear signal in originals texts, but this signal is obfuscated in human and machine translation. We then propose simple domainadaptation techniques that help retain the original gender traits in the translation, without harming the quality of the translation, thereby creating more personalized machine translation systems.
Date Issued
2017-04-07
Date Acceptance
2017-04-03
Citation
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, 2017, 1 (E17-1), pp.1074-1084
ISBN
9781510838604
Publisher
The Association for Computational Linguistics
Start Page
1074
End Page
1084
Journal / Book Title
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers
Volume
1
Issue
E17-1
Copyright Statement
© 2017 Association for Computational Linguistics. Licensed on a Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/).
Identifier
http://aclweb.org/anthology/E17-1101
Source
15th Conference of the European Chapter of the Association for Computational Linguistics
Subjects
cs.CL
Publication Status
Published
Start Date
2017-04-03
Finish Date
2017-04-07
Coverage Spatial
Valencia, Spain