تصف هذه الورقة النموذج الفائز في المهمة المشتركة باللغة العربية NLP4IF لمحاربة المعكرية CovID-19.الهدف من المهمة المشتركة هو التحقق من التضليل حول Covid-19 في تغريدات عربية.تم تصنيف نموذجنا المقترح الأول مع درجة F1 من 0.780 ونتيجة دقة من 0.762.تم تجربة مجموعة متنوعة من النماذج اللغوية المدربة المستندة إلى المحولات من خلال هذه الدراسة.يعد النموذج الأفضل سجل فرقة من نماذج عربيرت والقاعدة في عربيه، وأربرت.تتمثل إحدى النتائج الرئيسية في الدراسة في إظهار التأثير يمكن أن يكون للمعالجة المسبقة في درجة كل نموذج.بالإضافة إلى وصف النموذج الفائز، تظهر الدراسة الحالية تحليل الأخطاء.
This paper describes the winning model in the Arabic NLP4IF shared task for fighting the COVID-19 infodemic. The goal of the shared task is to check disinformation about COVID-19 in Arabic tweets. Our proposed model has been ranked 1st with an F1-Score of 0.780 and an Accuracy score of 0.762. A variety of transformer-based pre-trained language models have been experimented with through this study. The best-scored model is an ensemble of AraBERT-Base, Asafya-BERT, and ARBERT models. One of the study's key findings is showing the effect the pre-processing can have on every model's score. In addition to describing the winning model, the current study shows the error analysis.
References used
https://aclanthology.org/
This paper provides a detailed overview of the system and its outcomes, which were produced as part of the NLP4IF Shared Task on Fighting the COVID-19 Infodemic at NAACL 2021. This task is accomplished using a variety of techniques. We used state-of-
We present the results and the main findings of the NLP4IF-2021 shared tasks. Task 1 focused on fighting the COVID-19 infodemic in social media, and it was offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict whether that twe
The spread of COVID-19 has been accompanied with widespread misinformation on social media. In particular, Twitterverse has seen a huge increase in dissemination of distorted facts and figures. The present work aims at identifying tweets regarding CO
The massive spread of false information on social media has become a global risk especially in a global pandemic situation like COVID-19. False information detection has thus become a surging research topic in recent months. In recent years, supervis
In this paper, we describe our system for the shared task on Fighting the COVID-19 Infodemic in the English Language. Our proposed architecture consists of a multi-output classification model for the seven tasks, with a task-wise multi-head attention