Research papers, master and doctoral theses about supervised neural machine

Data and Parameter Scaling Laws for Neural Machine Translation

202 - Association for Computation Linguistics 2021 مقالة

We observe that the development cross-entropy loss of supervised neural machine translation models scales like a power law with the amount of training data and the number of non-embedding parameters in the model. We discuss some practical implication s of these results, such as predicting BLEU achieved by large scale models and predicting the ROI of labeling data in low-resource language pairs.

ترجمة المصطلحات العصبية parameter scaling laws supervised neural machine القوانين المعلمة القياس آلة عصبية خاضعة للإشراف صناعة حمض الفوسفور

Integrating Unsupervised Data Generation into Self-Supervised Neural Machine Translation for Low-Resource Languages

261 - Association for Computation Linguistics 2021 مقالة

For most language combinations and parallel data is either scarce or simply unavailable. To address this and unsupervised machine translation (UMT) exploits large amounts of monolingual data by using synthetic data generation techniques such as back- translation and noising and while self-supervised NMT (SSNMT) identifies parallel sentences in smaller comparable data and trains on them. To this date and the inclusion of UMT data generation techniques in SSNMT has not been investigated. We show that including UMT techniques into SSNMT significantly outperforms SSNMT (up to +4.3 BLEU and af2en) as well as statistical (+50.8 BLEU) and hybrid UMT (+51.5 BLEU) baselines on related and distantly-related and unrelated language pairs.

دراسة حالة الهند self-supervised neural machine integrating unsupervised data الجهاز العصبي الخاضع للإشراف دمج البيانات غير المدمرة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد