Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Understanding the Impact of UGC Specificities on Translation Quality

فهم تأثير خصوصيات UGC على جودة الترجمة

601 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This work takes a critical look at the evaluation of user-generated content automatic translation, the well-known specificities of which raise many challenges for MT. Our analyses show that measuring the average-case performance using a standard metric on a UGC test set falls far short of giving a reliable image of the UGC translation quality. That is why we introduce a new data set for the evaluation of UGC translation in which UGC specificities have been manually annotated using a fine-grained typology. Using this data set, we conduct several experiments to measure the impact of different kinds of UGC specificities on translation quality, more precisely than previously possible.

References used

https://aclanthology.org/

rate research

Noisy UGC Translation at the Character Level: Revisiting Open-Vocabulary Capabilities and Robustness of Char-Based Models

880 - Association for Computation Linguistics 2021 مقالة

This work explores the capacities of character-based Neural Machine Translation to translate noisy User-Generated Content (UGC) with a strong focus on exploring the limits of such approaches to handle productive UGC phenomena, which almost by definit ion, cannot be seen at training time. Within a strict zero-shot scenario, we first study the detrimental impact on translation performance of various user-generated content phenomena on a small annotated dataset we developed and then show that such models are indeed incapable of handling unknown letters, which leads to catastrophic translation failure once such characters are encountered. We further confirm this behavior with a simple, yet insightful, copy task experiment and highlight the importance of reducing the vocabulary size hyper-parameter to increase the robustness of character-based models for machine translation.

revisiting open-vocabulary capabilities revisiting open-vocabulary open-vocabulary capabilities إعادة النظر في قدرات المفردات المفتوحة إعادة النظر في المفردات المفتوحة قدرات المفردات المفتوحة صناعة حمض الفوسفور المزيد..

Understanding the Impact of Evidence-Aware Sentence Selection for Fact Checking

630 - Association for Computation Linguistics 2021 مقالة

Fact Extraction and VERification (FEVER) is a recently introduced task that consists of the following subtasks (i) document retrieval, (ii) sentence retrieval, and (iii) claim verification. In this work, we focus on the subtask of sentence retrieval. Specifically, we propose an evidence-aware transformer-based model that outperforms all other models in terms of FEVER score by using a subset of training instances. In addition, we conduct a large experimental study to get a better understanding of the problem, while we summarize our findings by presenting future research challenges.

evidence-aware sentence selection fact checking sentence selection اختيار عقوبة الاطلاع على الأدلة التحقق من حقيقة اختيار الجملة صناعة حمض الفوسفور المزيد..

Regressive Ensemble for Machine Translation Quality Evaluation

980 - Association for Computation Linguistics 2021 مقالة

This work introduces a simple regressive ensemble for evaluating machine translation quality based on a set of novel and established metrics. We evaluate the ensemble using a correlation to expert-based MQM scores of the WMT 2021 Metrics workshop. In both monolingual and zero-shot cross-lingual settings, we show a significant performance improvement over single metrics. In the cross-lingual settings, we also demonstrate that an ensemble approach is well-applicable to unseen languages. Furthermore, we identify a strong reference-free baseline that consistently outperforms the commonly-used BLEU and METEOR measures and significantly improves our ensemble's performance.

translation quality evaluation machine translation quality تقييم جودة الترجمة جودة الترجمة الجودة تقييم الجودة صناعة حمض الفوسفور

Understanding and Interpreting the Impact of User Context in Hate Speech Detection

849 - Association for Computation Linguistics 2021 مقالة

As hate speech spreads on social media and online communities, research continues to work on its automatic detection. Recently, recognition performance has been increasing thanks to advances in deep learning and the integration of user features. This work investigates the effects that such features can have on a detection model. Unlike previous research, we show that simple performance comparison does not expose the full impact of including contextual- and user information. By leveraging explainability techniques, we show (1) that user features play a role in the model's decision and (2) how they affect the feature space learned by the model. Besides revealing that---and also illustrating why---user features are the reason for performance gains, we show how such techniques can be combined to better understand the model and to detect unintended bias.

لغة عنصرية understanding and interpreting فهم وتفسير صناعة حمض الفوسفور

Direct Exploitation of Attention Weights for Translation Quality Estimation

656 - Association for Computation Linguistics 2021 مقالة

The paper presents our submission to the WMT2021 Shared Task on Quality Estimation (QE). We participate in sentence-level predictions of human judgments and post-editing effort. We propose a glass-box approach based on attention weights extracted fro m machine translation systems. In contrast to the previous works, we directly explore attention weight matrices without replacing them with general metrics (like entropy). We show that some of our models can be trained with a small amount of a high-cost labelled data. In the absence of training data our approach still demonstrates a moderate linear correlation, when trained with synthetic data.

محاكاة متعددة اللغات متعددة اللغات direct exploitation الاستغلال المباشر صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Understanding the Impact of UGC Specificities on Translation Quality

فهم تأثير خصوصيات UGC على جودة الترجمة

Ask ChatGPT about the research

Read More

suggested questions