Redirigiendo al acceso original de articulo en 15 segundos...
Inicio  /  Informatics  /  Vol: 6 Par: 3 (2019)  /  Artículo
ARTÍCULO
TITULO

Translation Quality and Error Recognition in Professional Neural Machine Translation Post-Editing

Jennifer Vardaro    
Moritz Schaeffer and Silvia Hansen-Schirra    

Resumen

This study aims to analyse how translation experts from the German department of the European Commission?s Directorate-General for Translation (DGT) identify and correct different error categories in neural machine translated texts (NMT) and their post-edited versions (NMTPE). The term translation expert encompasses translator, post-editor as well as revisor. Even though we focus on neural machine-translated segments, translator and post-editor are used synonymously because of the combined workflow using CAT-Tools as well as machine translation. Only the distinction between post-editor, which refers to a DGT translation expert correcting the neural machine translation output, and revisor, which refers to a DGT translation expert correcting the post-edited version of the neural machine translation output, is important and made clear whenever relevant. Using an automatic error annotation tool and the more fine-grained manual error annotation framework to identify characteristic error categories in the DGT texts, a corpus analysis revealed that quality assurance measures by post-editors and revisors of the DGT are most often necessary for lexical errors. More specifically, the corpus analysis showed that, if post-editors correct mistranslations, terminology or stylistic errors in an NMT sentence, revisors are likely to correct the same error type in the same post-edited sentence, suggesting that the DGT experts were being primed by the NMT output. Subsequently, we designed a controlled eye-tracking and key-logging experiment to compare participants? eye movements for test sentences containing the three identified error categories (mistranslations, terminology or stylistic errors) and for control sentences without errors. We examined the three error types? effect on early (first fixation durations, first pass durations) and late eye movement measures (e.g., total reading time and regression path durations). Linear mixed-effects regression models predict what kind of behaviour of the DGT experts is associated with the correction of different error types during the post-editing process.

 Artículos similares

       
 
Séamus Lankford, Haithem Afli and Andy Way    
In this study, a human evaluation is carried out on how hyperparameter settings impact the quality of Transformer-based Neural Machine Translation (NMT) for the low-resourced English?Irish pair. SentencePiece models using both Byte Pair Encoding (BPE) an... ver más
Revista: Information

 
Aliya Turganbayeva, Diana Rakhimova, Vladislav Karyukin, Aidana Karibayeva and Asem Turarbek    
The problems of machine translation are constantly arising. While the most advanced translation platforms, such as Google and Yandex, allow for high-quality translations of languages with simple grammatical structures, more morphologically rich languages... ver más
Revista: Information

 
Giorgia Giacomini, Caterina Graziani, Veronica Lachi, Pietro Bongini, Niccolò Pancino, Monica Bianchini, Davide Chiarugi, Angelo Valleriani and Paolo Andreini    
In recent years, the Ribosome profiling technique (Ribo?seq) has emerged as a powerful method for globally monitoring the translation process in vivo at single nucleotide resolution. Based on deep sequencing of mRNA fragments, Ribo?seq allows to obtain p... ver más
Revista: Algorithms

 
Junjie Liu, Yong Yang, Xiaochao Fan, Ge Ren, Liang Yang and Qian Ning    
The rapid identification of offensive language in social media is of great significance for preventing viral spread and reducing the spread of malicious information, such as cyberbullying and content related to self-harm. In existing research, the public... ver más

 
Zhen Li, Heng Yao, Ran Shi, Tong Qiao and Chuan Qin    
In daily life, when taking photos of scenes containing glass, the images of the dominant transmission layer and the weak reflection layer are often blended, which are difficult to be uncoupled. Meanwhile, because the reflection layer contains sufficient ... ver más
Revista: Applied Sciences