تصف هذه الورقة تقديم ISTIC إلى مهمة الترجمة الآلية الثلاثية من الترجمة الآلية الروسية إلى الصينية ل WMT '2021. من أجل الاستفادة الكاملة من الشركة المقدمة وتعزيز أداء الترجمة من الروسية إلى الصينية، يتم استخدام طريقة المحور في موقعناالنظام الذي خط أنابيب الترجمة الروسية إلى الإنجليزية والمترجم الإنجليزي إلى الصيني لتشكيل مترجم روسي إلى صيني.يعتمد نظامنا على بنية المحولات ويتم اعتماد العديد من الاستراتيجيات الفعالة لتحسين جودة الترجمة، بما في ذلك تصفية Corpus ومعالجة البيانات ومجمع النظام وفرقة النموذج.
This paper describes the ISTIC's submission to the Triangular Machine Translation Task of Russian-to-Chinese machine translation for WMT' 2021. In order to fully utilize the provided corpora and promote the translation performance from Russian to Chinese, the pivot method is used in our system which pipelines the Russian-to-English translator and the English-to-Chinese translator to form a Russian-to-Chinese translator. Our system is based on the Transformer architecture and several effective strategies are adopted to improve the quality of translation, including corpus filtering, data pre-processing, system combination and model ensemble.
References used
https://aclanthology.org/
This paper describes DUT-NLP Lab's submission to the WMT-21 triangular machine translation shared task. The participants are not allowed to use other data and the translation direction of this task is Russian-to-Chinese. In this task, we use the Tran
This paper describes Mininglamp neural machine translation systems of the WMT2021 news translation tasks. We have participated in eight directions translation tasks for news text including Chinese to/from English, Hausa to/from English, German to/fro
This paper describes NiuTrans neural machine translation systems of the WMT 2021 news translation tasks. We made submissions to 9 language directions, including English2Chinese, Japanese, Russian, Icelandic and English2Hausa tasks. Our primary system
This paper describes TenTrans large-scale multilingual machine translation system for WMT 2021. We participate in the Small Track 2 in five South East Asian languages, thirty directions: Javanese, Indonesian, Malay, Tagalog, Tamil, English. We mainly
This paper describes ANVITA-1.0 MT system, architected for submission to WAT2021 MultiIndicMT shared task by mcairt team, where the team participated in 20 translation directions: English→Indic and Indic→English; Indic set comprised of 10 Indian lang