Redirigiendo al acceso original de articulo en 23 segundos...
Inicio  /  Informatics  /  Vol: 8 Par: 1 (2021)  /  Artículo
ARTÍCULO
TITULO

Towards a Better Integration of Fuzzy Matches in Neural Machine Translation through Data Augmentation

Arda Tezcan    
Bram Bulté and Bram Vanroy    

Resumen

We identify a number of aspects that can boost the performance of Neural Fuzzy Repair (NFR), an easy-to-implement method to integrate translation memory matches and neural machine translation (NMT). We explore various ways of maximising the added value of retrieved matches within the NFR paradigm for eight language combinations, using Transformer NMT systems. In particular, we test the impact of different fuzzy matching techniques, sub-word-level segmentation methods and alignment-based features on overall translation quality. Furthermore, we propose a fuzzy match combination technique that aims to maximise the coverage of source words. This is supplemented with an analysis of how translation quality is affected by input sentence length and fuzzy match score. The results show that applying a combination of the tested modifications leads to a significant increase in estimated translation quality over all baselines for all language combinations.

 Artículos similares

       
 
Minhaz Farid Ahmed, Mazlin Bin Mokhtar, Chen Kim Lim, Izzati Afiqah Binti Che Suza, Ku Adriani Ku Ayob, Rd. Puteri Khairani Khirotdin and Nuriah Abd Majid    
Malaysia has numerous policies, institutions, and experts with foresight and vision for its development. Nevertheless, river basin management has been lacking due to several factors such as insufficient proactive leadership roles of institutions, as well... ver más
Revista: Water

 
Jiamei Wu, Chenxu Zhang, Huifen Yang, Pan Chen and Jian Cao    
The development of phytoremediation technology is constrained by gentle phytoextraction efficiency and slow biomass accumulation. In this study, a combined remediation of pioneer plants and solid waste towards Cd- and As-contaminated farmland soil was ex... ver más
Revista: Applied Sciences

 
Emily Kate Parsons, Emmanouil Panaousis, George Loukas and Georgia Sakellari    
The Internet of Things (IoT) continues to grow at a rapid pace, becoming integrated into the daily operations of individuals and organisations. IoT systems automate crucial services within daily life that users may rely on, which makes the assurance of s... ver más
Revista: Applied Sciences

 
Chiahsin Lin    
For the second year in a row, the theme is ?reef coral biotechnology?, specifically the interface between basic science and conservation. It has never been more important to attempt to leverage what we know about these beautiful, albeit highly imperiled ... ver más
Revista: Applied Sciences

 
George Ioannou, Georgios Alexandridis and Andreas Stafylopatis    
Importance sampling, a variant of online sampling, is often used in neural network training to improve the learning process, and, in particular, the convergence speed of the model. We study, here, the performance of a set of batch selection algorithms, n... ver más
Revista: Algorithms