82519

Автор(ы): 

Автор(ов): 

2

Параметры публикации

Тип публикации: 

Доклад

Название: 

Neural Machine Translation System for Lezgian, Russian and Azerbaijani Languages

Электронная публикация: 

Да

ISBN/ISSN: 

2767-9535

DOI: 

10.1109/ispras64596.2024.10899143

Наименование конференции: 

  • 2024 Ivannikov Ispras Open Conference (ISPRAS)

Наименование источника: 

  • Proceedings of the Ivannikov Memorial Workshop (IVMEM), 2024

Город: 

  • Москва

Издательство: 

  • IEEE

Год издания: 

2024

Страницы: 

https://ieeexplore.ieee.org/document/10899143
Аннотация
We release the first neural machine translation system for translation between Russian, Azerbaijani and the endangered Lezgian languages, as well as monolingual and parallel datasets collected and aligned for training and evaluating the system. Multiple experiments are conducted to identify how different sets of training language pairs and data domains can influence the resulting translation quality. We achieve BLEU scores of 26.14 for Lezgian-Azerbaijani, 22.89 for Azerbaijani-Lezgian, 29.48 for Lezgian-Russian and 24.25 for Russian-Lezgian pairs. The quality of zero-shot translation is assessed on a Large Language Model, showing its high level of fluency in Lezgian. However, the model often refuses to translate, justifying itself with its incompetence. We contribute our translation model along with the collected parallel and monolingual corpora and sentence encoder for the Lezgian language.

Библиографическая ссылка: 

Асваров А., Грабовой А.В. Neural Machine Translation System for Lezgian, Russian and Azerbaijani Languages / Proceedings of the Ivannikov Memorial Workshop (IVMEM), 2024. М.: IEEE, 2024. С. https://ieeexplore.ieee.org/document/10899143.