Neural machine translation for low-resource languages: A Survey

Ranathunga, S.; Lee, E.-S. A; Prifti Skenduli, M; Shekhar, R; Alam, M.; Kaur, R.

UoM IR
→
Research Publications
→
Journals and Magazines
→
Articles authored by UoM staff (Publish in scimago's Q1 journals)
→
View Item

Neural machine translation for low-resource languages: A Survey

Ranathunga, S.; Lee, E.-S. A; Prifti Skenduli, M; Shekhar, R; Alam, M.; Kaur, R.

URI: http://dl.lib.uom.lk/handle/123/21868

DOI: https://doi.org/10.1145/3567592

Abstract:

Neural Machine Translation (NMT) has seen tremendous growth in the last ten years since the early 2000s and has already entered a mature phase. While considered the most widely used solution for Machine Translation, its performance on low-resource language pairs remains sub-optimal compared to the high-resource counterparts due to the unavailability of large parallel corpora. Therefore, the implementation of NMT techniques for low-resource language pairs has been receiving the spotlight recently, thus leading to substantial research on this topic. This article presents a detailed survey of research advancements in low-resource language NMT (LRL-NMT) and quantitative analysis to identify the most popular techniques. We provide guidelines to select the possible NMT technique for a given LRL data setting based on our findings. We also present a holistic view of the LRL-NMT research landscape and provide recommendations to enhance the research efforts further.

Citation:

Ranathunga, S., Lee, E.-S. A., Prifti Skenduli, M., Shekhar, R., Alam, M., & Kaur, R. (2023). Neural Machine Translation for Low-resource Languages: A Survey. ACM Computing Surveys, 55(11), 229 (1-37). https://doi.org/10.1145/3567592

Show full item record