Research papers, master and doctoral theses about إنجليزي

Adam Mickiewicz University's English-Hausa Submissions to the WMT 2021 News Translation Task

137 - Association for Computation Linguistics 2021 مقالة

This paper presents the Adam Mickiewicz University's (AMU) submissions to the WMT 2021 News Translation Task. The submissions focus on the English↔Hausa translation directions, which is a low-resource translation scenario between distant languages. O ur approach involves thorough data cleaning, transfer learning using a high-resource language pair, iterative training, and utilization of monolingual data via back-translation. We experiment with NMT and PB-SMT approaches alike, using the base Transformer architecture for all of the NMT models while utilizing PB-SMT systems as comparable baseline solutions.

adam mickiewicz university mickiewicz university english-hausa university english-hausa submissions جامعة آدم ميكايكز جامعة ميكايكز إنجليزي هوسا التقديمات English-Hausa الجامعة صناعة حمض الفوسفور المزيد..

Detection of Puffery on the English Wikipedia

227 - Association for Computation Linguistics 2021 مقالة

On Wikipedia, an online crowdsourced encyclopedia, volunteers enforce the encyclopedia's editorial policies. Wikipedia's policy on maintaining a neutral point of view has inspired recent research on bias detection, including weasel words'' and hedges ''. Yet to date, little work has been done on identifying puffery,'' phrases that are overly positive without a verifiable source. We demonstrate that collecting training data for this task requires some care, and construct a dataset by combining Wikipedia editorial annotations and information retrieval techniques. We compare several approaches to predicting puffery, and achieve 0.963 f1 score by incorporating citation features into a RoBERTa model. Finally, we demonstrate how to integrate our model with Wikipedia's public infrastructure to give back to the Wikipedia editor community.

english wikipedia wikipedia الإنجليزية ويكيبيديا ويكيبيديا إنجليزي صناعة حمض الفوسفور

Detecting Multilingual COVID-19 Misinformation on Social Media via Contextualized Embeddings

267 - Association for Computation Linguistics 2021 مقالة

We present machine learning classifiers to automatically identify COVID-19 misinformation on social media in three languages: English, Bulgarian, and Arabic. We compared 4 multitask learning models for this task and found that a model trained with En glish BERT achieves the best results for English, and multilingual BERT achieves the best results for Bulgarian and Arabic. We experimented with zero shot, few shot, and target-only conditions to evaluate the impact of target-language training data on classifier performance, and to understand the capabilities of different models to generalize across languages in detecting misinformation online. This work was performed as a submission to the shared task, NLP4IF 2021: Fighting the COVID-19 Infodemic. Our best models achieved the second best evaluation test results for Bulgarian and Arabic among all the participating teams and obtained competitive scores for English.

contextualized embeddings english bert achieves إنجليزي بيرت يحقق صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد