New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Sentiment Preservation in Review Translation using Curriculum-based Re-inforcement Framework

تحفظ المعنويات في ترجمة المراجعة باستخدام إطار إعادة المعلومات القائمة على المناهج الدراسية

322 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

curriculum-based re-inforcement framework curriculum-based re-inforcement sentiment preservation إطار إعادة التنسيق القائم على المناهج الدراسية التعزيز القائم على المناهج الدراسية الحفاظ على المعنويات صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

غالبا ما تفشل أنظمة الترجمة الآلية في الحفاظ على خصائص أسلوبية وبراغمية مختلفة لنص المصدر (E.G. المشاعر والمشاعر والسمات الجنسانية وغيرها) إلى الهدف وخاصة في سيناريو منخفض الموارد. يمكن أن تؤثر هذه الخسارة على أداء أي مهمة معالجة اللغة الطبيعية المصب (NLP) ومثل تحليل المعرفات وهذا يعتمد بشدة على إخراج أنظمة MT. أصبحت القابلية للإصابة بفقدان القطبية أكثر شدة عندما يعمل نظام MT لترجمة محتوى مصدر يفتقر إلى بنية لغة شرعية (على سبيل المثال نص المراجعة). لذلك، يجب أن نجد طرقا لتقليل الآثار غير المرغوب فيها لتفقد المعنويات في الترجمة دون المساومة مع الكفاية. في عملنا الحالي، نقدم إطارا عميقا لتعليم التعلم (RL) مع التعلم من المناهج الدراسية (وفقا لصعوبات المكافأة) لضبط معايير نظام MT العصبي المدرب مسبقا بحيث الترجمة التي تم إنشاؤها يقوم بنجاح بترميز المعنويات الأساسية للمصدر دون المساس بالكفاية على عكس الأساليب السابقة. نقوم بتقييم أسلوبنا المقترح على مجموعات البيانات المراجعة باللغة الإنجليزية - الهندية والفرنسية - الإنجليزية (مجال مطعم) ووجدت أن طريقتنا تجلب تحسنا كبيرا على العديد من خطوط الأساس في مهام الترجمة الآلية وتصنيف المعنويات.

Machine Translation (MT) systems often fail to preserve different stylistic and pragmatic properties of the source text (e.g. sentiment and emotion and gender traits and etc.) to the target and especially in a low-resource scenario. Such loss can affect the performance of any downstream Natural Language Processing (NLP) task and such as sentiment analysis and that heavily relies on the output of the MT systems. The susceptibility to sentiment polarity loss becomes even more severe when an MT system is employed for translating a source content that lacks a legitimate language structure (e.g. review text). Therefore and we must find ways to minimize the undesirable effects of sentiment loss in translation without compromising with the adequacy. In our current work and we present a deep re-inforcement learning (RL) framework in conjunction with the curriculum learning (as per difficulties of the reward) to fine-tune the parameters of a pre-trained neural MT system so that the generated translation successfully encodes the underlying sentiment of the source without compromising the adequacy unlike previous methods. We evaluate our proposed method on the English--Hindi (product domain) and French--English (restaurant domain) review datasets and and found that our method brings a significant improvement over several baselines in the machine translation and and sentiment classification tasks.

References used

https://aclanthology.org/

rate research

Analyzing Curriculum Learning for Sentiment Analysis along Task Difficulty, Pacing and Visualization Axes

304 - Association for Computation Linguistics 2021 مقالة

While Curriculum Learning (CL) has recently gained traction in Natural language Processing Tasks, it is still not adequately analyzed. Previous works only show their effectiveness but fail short to explain and interpret the internal workings fully. I n this paper, we analyze curriculum learning in sentiment analysis along multiple axes. Some of these axes have been proposed by earlier works that need more in-depth study. Such analysis requires understanding where curriculum learning works and where it does not. Our axes of analysis include Task difficulty on CL, comparing CL pacing techniques, and qualitative analysis by visualizing the movement of attention scores in the model as curriculum phases progress. We find that curriculum learning works best for difficult tasks and may even lead to a decrement in performance for tasks with higher performance without curriculum learning. We see that One-Pass curriculum strategies suffer from catastrophic forgetting and attention movement visualization within curriculum pacing. This shows that curriculum learning breaks down the challenging main task into easier sub-tasks solved sequentially.

curriculum learning analyzing curriculum learning تعلم المناهج الدراسية معالجة اللغة الطبيعية تحليل تعلم المناهج الدراسية صناعة حمض الفوسفور

Active Curriculum Learning

193 - Association for Computation Linguistics 2021 مقالة

This paper investigates and reveals the relationship between two closely related machine learning disciplines, namely Active Learning (AL) and Curriculum Learning (CL), from the lens of several novel curricula. This paper also introduces Active Curri culum Learning (ACL) which improves AL by combining AL with CL to benefit from the dynamic nature of the AL informativeness concept as well as the human insights used in the design of the curriculum heuristics. Comparison of the performance of ACL and AL on two public datasets for the Named Entity Recognition (NER) task shows the effectiveness of combining AL and CL using our proposed framework.

active curriculum learning active curriculum المناهج الدراسية النشطة التعلم المناهج الدراسية النشطة صناعة حمض الفوسفور

Competence-based Curriculum Learning for Multilingual Machine Translation

282 - Association for Computation Linguistics 2021 مقالة

Currently, multilingual machine translation is receiving more and more attention since it brings better performance for low resource languages (LRLs) and saves more space. However, existing multilingual machine translation models face a severe challe nge: imbalance. As a result, the translation performance of different languages in multilingual translation models are quite different. We argue that this imbalance problem stems from the different learning competencies of different languages. Therefore, we focus on balancing the learning competencies of different languages and propose Competence-based Curriculum Learning for Multilingual Machine Translation, named CCL-M. Specifically, we firstly define two competencies to help schedule the high resource languages (HRLs) and the low resource languages: 1) Self-evaluated Competence, evaluating how well the language itself has been learned; and 2) HRLs-evaluated Competence, evaluating whether an LRL is ready to be learned according to HRLs' Self-evaluated Competence. Based on the above competencies, we utilize the proposed CCL-M algorithm to gradually add new languages into the training set in a curriculum learning manner. Furthermore, we propose a novel competence-aware dynamic balancing sampling strategy for better selecting training samples in multilingual training. Experimental results show that our approach has achieved a steady and significant performance gain compared to the previous state-of-the-art approach on the TED talks dataset.

لغة إزالة السموم صناعة حمض الفوسفور

Semi-Supervised Learning Based on Auto-generated Lexicon Using XAI in Sentiment Analysis

337 - Association for Computation Linguistics 2021 مقالة

In this study, we proposed a novel Lexicon-based pseudo-labeling method utilizing explainable AI(XAI) approach. Existing approach have a fundamental limitation in their robustness because poor classifier leads to inaccurate soft-labeling, and it lead to poor classifier repetitively. Meanwhile, we generate the lexicon consists of sentiment word based on the explainability score. Then we calculate the confidence of unlabeled data with lexicon and add them into labeled dataset for the robust pseudo-labeling approach. Our proposed method has three contributions. First, the proposed methodology automatically generates a lexicon based on XAI and performs independent pseudo-labeling, thereby guaranteeing higher performance and robustness compared to the existing one. Second, since lexicon-based pseudo-labeling is performed without re-learning in most of models, time efficiency is considerably increased, and third, the generated high-quality lexicon can be available for sentiment analysis of data from similar domains. The effectiveness and efficiency of our proposed method were verified through quantitative comparison with the existing pseudo-labeling method and qualitative review of the generated lexicon.

semi-supervised learning based semi-supervised learning learning based التعلم شبه الإشراف على أساس التعلم شبه الإشرافه التعلم مقرها صناعة حمض الفوسفور المزيد..

Automatic Speech-Based Checklist for Medical Simulations

385 - Association for Computation Linguistics 2021 مقالة

Medical simulators provide a controlled environment for training and assessing clinical skills. However, as an assessment platform, it requires the presence of an experienced examiner to provide performance feedback, commonly preformed using a task s pecific checklist. This makes the assessment process inefficient and expensive. Furthermore, this evaluation method does not provide medical practitioners the opportunity for independent training. Ideally, the process of filling the checklist should be done by a fully-aware objective system, capable of recognizing and monitoring the clinical performance. To this end, we have developed an autonomous and a fully automatic speech-based checklist system, capable of objectively identifying and validating anesthesia residents' actions in a simulation environment. Based on the analyzed results, our system is capable of recognizing most of the tasks in the checklist: F1 score of 0.77 for all of the tasks, and F1 score of 0.79 for the verbal tasks. Developing an audio-based system will improve the experience of a wide range of simulation platforms. Furthermore, in the future, this approach may be implemented in the operation room and emergency room. This could facilitate the development of automatic assistive technologies for these domains.

medical simulators provide automatic speech-based checklist checklist المحاكاة الطبية توفر قائمة المراجعة القائمة على الكلام قائمة تدقيق صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Sentiment Preservation in Review Translation using Curriculum-based Re-inforcement Framework

تحفظ المعنويات في ترجمة المراجعة باستخدام إطار إعادة المعلومات القائمة على المناهج الدراسية

Ask ChatGPT about the research

Read More

suggested questions