A roadmap to neural automatic post-editing: an empirical approach

Dimitar Shterionov; Félix do Carmo; Joss Moorkens; Murhaf Hossari; Joachim Wagner; Eric Paquin; Dag Schmidtke; Declan Groves; Andy Way

doi:10.1007/s10590-020-09249-7

A roadmap to neural automatic post-editing: an empirical approach

Mach Transl. 2020;34(2):67-96. doi: 10.1007/s10590-020-09249-7. Epub 2020 Sep 3.

Authors

Dimitar Shterionov^{1

2}, Félix do Carmo^{3

2}, Joss Moorkens⁴, Murhaf Hossari², Joachim Wagner², Eric Paquin², Dag Schmidtke⁵, Declan Groves⁵, Andy Way²

Affiliations

¹ Department of Cognitive Science and Artificial Intelligence, Tilburg University, Tilburg, The Netherlands.
² ADAPT Centre, School of Computing, Dublin City University, Dublin, Ireland.
³ Centre for Translation Studies, University of Surrey, Surrey, UK.
⁴ ADAPT Centre and School of Applied Language and Intercultural Studies, Dublin City University, Dublin, Ireland.
⁵ Microsoft, South County Business Park, Leopardstown, Dublin, Ireland.

Abstract

In a translation workflow, machine translation (MT) is almost always followed by a human post-editing step, where the raw MT output is corrected to meet required quality standards. To reduce the number of errors human translators need to correct, automatic post-editing (APE) methods have been developed and deployed in such workflows. With the advances in deep learning, neural APE (NPE) systems have outranked more traditional, statistical, ones. However, the plethora of options, variables and settings, as well as the relation between NPE performance and train/test data makes it difficult to select the most suitable approach for a given use case. In this article, we systematically analyse these different parameters with respect to NPE performance. We build an NPE "roadmap" to trace the different decision points and train a set of systems selecting different options through the roadmap. We also propose a novel approach for APE with data augmentation. We then analyse the performance of 15 of these systems and identify the best ones. In fact, the best systems are the ones that follow the newly-proposed method. The work presented in this article follows from a collaborative project between Microsoft and the ADAPT centre. The data provided by Microsoft originates from phrase-based statistical MT (PBSMT) systems employed in production. All tested NPE systems significantly increase the translation quality, proving the effectiveness of neural post-editing in the context of a commercial translation workflow that leverages PBSMT.

Keywords: Automatic post-editing; Deep learning; Empirical evaluation; Machine Translation; Multi-source; Neural post-editing.