Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation

نهج تسلسل الجملة لتعزيز البيانات للترجمة الآلية العصبية

599 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Recently, neural machine translation is widely used for its high translation accuracy, but it is also known to show poor performance at long sentence translation. Besides, this tendency appears prominently for low resource languages. We assume that these problems are caused by long sentences being few in the train data. Therefore, we propose a data augmentation method for handling long sentences. Our method is simple; we only use given parallel corpora as train data and generate long sentences by concatenating two sentences. Based on our experiments, we confirm improvements in long sentence translation by proposed data augmentation despite the simplicity. Moreover, the proposed method improves translation quality more when combined with back-translation.

References used

https://aclanthology.org/

rate research

Counterfactual Data Augmentation for Neural Machine Translation

664 - Association for Computation Linguistics 2021 مقالة

We propose a data augmentation method for neural machine translation. It works by interpreting language models and phrasal alignment causally. Specifically, it creates augmented parallel translation corpora by generating (path-specific) counterfactua l aligned phrases. We generate these by sampling new source phrases from a masked language model, then sampling an aligned counterfactual target phrase by noting that a translation language model can be interpreted as a Gumbel-Max Structural Causal Model (Oberst and Sontag, 2019). Compared to previous work, our method takes both context and alignment into account to maintain the symmetry between source and target sequences. Experiments on IWSLT'15 English → Vietnamese, WMT'17 English → German, WMT'18 English → Turkish, and WMT'19 robust English → French show that the method can improve the performance of translation, backtranslation and translation robustness.

القدرة على الاحترام صناعة حمض الفوسفور

mixSeq: A Simple Data Augmentation Methodfor Neural Machine Translation

778 - Association for Computation Linguistics 2021 مقالة

Data augmentation, which refers to manipulating the inputs (e.g., adding random noise,masking specific parts) to enlarge the dataset,has been widely adopted in machine learning. Most data augmentation techniques operate on a single input, which limit s the diversity of the training corpus. In this paper, we propose a simple yet effective data augmentation technique for neural machine translation, mixSeq, which operates on multiple inputs and their corresponding targets. Specifically, we randomly select two input sequences,concatenate them together as a longer input aswell as their corresponding target sequencesas an enlarged target, and train models on theaugmented dataset. Experiments on nine machine translation tasks demonstrate that such asimple method boosts the baselines by a non-trivial margin. Our method can be further combined with single input based data augmentation methods to obtain further improvements.

augmentation methodfor neural data augmentation methodfor methodfor neural machine طريقة تكبير للجدل طريقة تكبير البيانات ل طريقة للآلة العصبية صناعة حمض الفوسفور المزيد..

Data and Parameter Scaling Laws for Neural Machine Translation

714 - Association for Computation Linguistics 2021 مقالة

We observe that the development cross-entropy loss of supervised neural machine translation models scales like a power law with the amount of training data and the number of non-embedding parameters in the model. We discuss some practical implication s of these results, such as predicting BLEU achieved by large scale models and predicting the ROI of labeling data in low-resource language pairs.

ترجمة المصطلحات العصبية parameter scaling laws supervised neural machine القوانين المعلمة القياس آلة عصبية خاضعة للإشراف صناعة حمض الفوسفور

Data Augmentation by Concatenation for Low-Resource Translation: A Mystery and a Solution

616 - Association for Computation Linguistics 2021 مقالة

In this paper, we investigate the driving factors behind concatenation, a simple but effective data augmentation method for low-resource neural machine translation. Our experiments suggest that discourse context is unlikely the cause for concatenatio n improving BLEU by about +1 across four language pairs. Instead, we demonstrate that the improvement comes from three other factors unrelated to discourse: context diversity, length diversity, and (to a lesser extent) position shifting.

تمرين effective data augmentation data augmentation method تكبير البيانات الفعالة طريقة تكبير البيانات صناعة حمض الفوسفور

Syntax-Based Attention Masking for Neural Machine Translation

655 - Association for Computation Linguistics 2021 مقالة

We present a simple method for extending transformers to source-side trees. We define a number of masks that limit self-attention based on relationships among tree nodes, and we allow each attention head to learn which mask or masks to use. On transl ation from English to various low-resource languages, and translation in both directions between English and German, our method always improves over simple linearization of the source-side parse tree and almost always improves over a sequence-to-sequence baseline, by up to +2.1 BLEU.

النصوص الفلسفية syntax-based attention masking masking for neural اخفاء الاهتمام بناء على بناء الجملة اخفاء للآلام العصبية صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Sentence Concatenation Approach to Data Augmentation for Neural Machine Translation

نهج تسلسل الجملة لتعزيز البيانات للترجمة الآلية العصبية

Ask ChatGPT about the research

Read More

suggested questions