New community

Subscribe to the gold package and get unlimited access to Shamra Academy

NUIG-DSI's submission to The GEM Benchmark 2021

تقديم نويج DSI إلى المعيار GEM 2021

172 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

gem benchmark gem benchmark paper gem مؤشر جوهرة جوهرة مرجع الورق جوهرة صناعة حمض الفوسفور

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper describes the submission by NUIG-DSI to the GEM benchmark 2021. We participate in the modeling shared task where we submit outputs on four datasets for data-to-text generation, namely, DART, WebNLG (en), E2E and CommonGen. We follow an approach similar to the one described in the GEM benchmark paper where we use the pre-trained T5-base model for our submission. We train this model on additional monolingual data where we experiment with different masking strategies specifically focused on masking entities, predicates and concepts as well as a random masking strategy for pre-training. In our results we find that random masking performs the best in terms of automatic evaluation metrics, though the results are not statistically significantly different compared to other masking strategies.

References used

https://aclanthology.org/

rate research

SimpleNER Sentence Simplification System for GEM 2021

350 - Association for Computation Linguistics 2021 مقالة

This paper describes SimpleNER, a model developed for the sentence simplification task at GEM-2021. Our system is a monolingual Seq2Seq Transformer architecture that uses control tokens pre-pended to the data, allowing the model to shape the generate d simplifications according to user desired attributes. Additionally, we show that NER-tagging the training data before use helps stabilize the effect of the control tokens and significantly improves the overall performance of the system. We also employ pretrained embeddings to reduce data sparsity and allow the model to produce more generalizable outputs.

simplener sentence simplification sentence simplification system sentence simplification task Simplener الجملة تبسيط نظام تبسيط الجملة جملة تبسيط المهمة صناعة حمض الفوسفور المزيد..

IST-Unbabel 2021 Submission for the Quality Estimation Shared Task

367 - Association for Computation Linguistics 2021 مقالة

We present the joint contribution of IST and Unbabel to the WMT 2021 Shared Task on Quality Estimation. Our team participated on two tasks: Direct Assessment and Post-Editing Effort, encompassing a total of 35 submissions. For all submissions, our ef forts focused on training multilingual models on top of OpenKiwi predictor-estimator architecture, using pre-trained multilingual encoders combined with adapters. We further experiment with and uncertainty-related objectives and features as well as training on out-of-domain direct assessment data.

الاستغلال المباشر صناعة حمض الفوسفور

ICL's Submission to the WMT21 Critical Error Detection Shared Task

553 - Association for Computation Linguistics 2021 مقالة

This paper presents Imperial College London's submissions to the WMT21 Quality Estimation (QE) Shared Task 3: Critical Error Detection. Our approach builds on cross-lingual pre-trained representations in a sequence classification model. We further im prove the base classifier by (i) adding a weighted sampler to deal with unbalanced data and (ii) introducing feature engineering, where features related to toxicity, named-entities and sentiment, which are potentially indicative of critical errors, are extracted using existing tools and integrated to the model in different ways. We train models with one type of feature at a time and ensemble those models that improve over the base classifier on the development (dev) set. Our official submissions achieve very competitive results, ranking second for three out of four language pairs.

detection shared task critical error detection error detection shared الكشف عن المهمة المشتركة كشف خطأ حرج كشف خطأ صناعة حمض الفوسفور المزيد..

Are References Really Needed? Unbabel-IST 2021 Submission for the Metrics Shared Task

360 - Association for Computation Linguistics 2021 مقالة

In this paper, we present the joint contribution of Unbabel and IST to the WMT 2021 Metrics Shared Task. With this year's focus on Multidimensional Quality Metric (MQM) as the ground-truth human assessment, our aim was to steer COMET towards higher c orrelations with MQM. We do so by first pre-training on Direct Assessments and then fine-tuning on z-normalized MQM scores. In our experiments we also show that reference-free COMET models are becoming competitive with reference-based models, even outperforming the best COMET model from 2020 on this year's development data. Additionally, we present COMETinho, a lightweight COMET model that is 19x faster on CPU than the original model, while also achieving state-of-the-art correlations with MQM. Finally, in the QE as a metric'' track, we also participated with a QE model trained using the OpenKiwi framework leveraging MQM scores and word-level annotations.

references really needed المراجع حاجة حقا صناعة حمض الفوسفور

Hybrid Statistical Machine Translation for English-Myanmar: UTYCC Submission to WAT-2021

422 - Association for Computation Linguistics 2021 مقالة

In this paper we describe our submissions to WAT-2021 (Nakazawa et al., 2021) for English-to-Myanmar language (Burmese) task. Our team, ID: YCC-MT1'', focused on bringing transliteration knowledge to the decoder without changing the model. We manuall y extracted the transliteration word/phrase pairs from the ALT corpus and applying XML markup feature of Moses decoder (i.e. -xml-input exclusive, -xml-input inclusive). We demonstrate that hybrid translation technique can significantly improve (around 6 BLEU scores) the baseline of three well-known Phrase-based SMT'', Operation Sequence Model'' and Hierarchical Phrase-based SMT''. Moreover, this simple hybrid method achieved the second highest results among the submitted MT systems for English-to-Myanmar WAT2021 translation share task according to BLEU (Papineni et al., 2002) and AMFM scores (Banchs et al., 2015).

statistical machine translation hybrid statistical machine statistical machine ترجمة آلة إحصائية الهجين آلة الإحصاء الآلة الإحصائية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

NUIG-DSI's submission to The GEM Benchmark 2021

تقديم نويج DSI إلى المعيار GEM 2021

Ask ChatGPT about the research

Read More

suggested questions