مع ظهور جائحة Covid-19، تم دمج الجوانب السياسية والطبية المتمثلة في التضليل حيث حصلت المشكلة على ارتفاع إلى مستوى جديد تماما لتصبح أول انفجار عالمي. تم إعلان محاربة هذا المعف النقود أحد أهم مجالات التركيز في منظمة الصحة العالمية، مع وجود مخاطر تتراوح من تعزيز العلاجات المزيفة والشائعات ونظريات المؤامرة لنشر كراهية الأجانب والذعر. يتطلب معالجة القضية حل عدد من المشاكل الصعبة مثل تحديد الرسائل التي تحتوي على مطالبات، وتحديد الجدارة الشيكية والوصيل لها، وإمكاناتها لإيذاءها وكذلك طبيعة هذا الضرر، لنذكر عدد قليل فقط. لمعالجة هذه الفجوة، نقوم بإصدار مجموعة بيانات كبيرة من 16 ألف تغريدات مشروح يدويا لتحليل التهيئة الدقيقة التي يركز عليها (ط) على CovID-19، (II) تجمع بين وجهات النظر ومصالح الصحفيين، ومساجي الحقائق، ومنصات وسائل التواصل الاجتماعي ، صانعي السياسات، والمجتمع، و (3) يغطي العربية والكبلانية والهولندية والإنجليزية. أخيرا، نظهر نتائج تقييم قوية باستخدام محولات مسبقا، وبالتالي تؤكد الأداة المساعدة العملة لمجموعة البيانات في أحادي التوتر مقابل تعدد اللغات، ومهمة واحدة مقابل إعدادات متعددة.
With the emergence of the COVID-19 pandemic, the political and the medical aspects of disinformation merged as the problem got elevated to a whole new level to become the first global infodemic. Fighting this infodemic has been declared one of the most important focus areas of the World Health Organization, with dangers ranging from promoting fake cures, rumors, and conspiracy theories to spreading xenophobia and panic. Addressing the issue requires solving a number of challenging problems such as identifying messages containing claims, determining their check-worthiness and factuality, and their potential to do harm as well as the nature of that harm, to mention just a few. To address this gap, we release a large dataset of 16K manually annotated tweets for fine-grained disinformation analysis that (i) focuses on COVID-19, (ii) combines the perspectives and the interests of journalists, fact-checkers, social media platforms, policy makers, and society, and (iii) covers Arabic, Bulgarian, Dutch, and English. Finally, we show strong evaluation results using pretrained Transformers, thus confirming the practical utility of the dataset in monolingual vs. multilingual, and single task vs. multitask settings.
References used
https://aclanthology.org/
The massive spread of false information on social media has become a global risk especially in a global pandemic situation like COVID-19. False information detection has thus become a surging research topic in recent months. In recent years, supervis
This paper provides a detailed overview of the system and its outcomes, which were produced as part of the NLP4IF Shared Task on Fighting the COVID-19 Infodemic at NAACL 2021. This task is accomplished using a variety of techniques. We used state-of-
This paper describes the winning model in the Arabic NLP4IF shared task for fighting the COVID-19 infodemic. The goal of the shared task is to check disinformation about COVID-19 in Arabic tweets. Our proposed model has been ranked 1st with an F1-Sco
We present the results and the main findings of the NLP4IF-2021 shared tasks. Task 1 focused on fighting the COVID-19 infodemic in social media, and it was offered in Arabic, Bulgarian, and English. Given a tweet, it asked to predict whether that twe
The objective of this work was the introduction of an effective approach based on the AraBERT language model for fighting Tweets COVID-19 Infodemic. It was arranged in the form of a two-step pipeline, where the first step involved a series of pre-pro