تصف هذه الورقة تقديم فريقنا من أجل تعدين وسائل التواصل الاجتماعي للصحة (SMM4H) 2021 المهمة المشتركة.شاركنا في ثلاث مجموعات فرعية: تصنيف تأثير المخدرات السلبي، والتقرير الذاتي Covid-19، وأعراض Covid-19.يعتمد نظامنا على نموذج Bert المدرب مسبقا على النص الخاص بالمجال.بالإضافة إلى ذلك، نقوم بإجراء تنظيف البيانات والتكبير، بالإضافة إلى تحسين فرط التنفس وفرقة نموذجية لتعزيز أداء بيرت.حققنا الرتبة الأولى في كل من تأثيرات المخدرات الضارة ومهام التقرير الذاتي CovID-19.
This paper describes our team's submission for the Social Media Mining for Health (SMM4H) 2021 shared task. We participated in three subtasks: Classifying adverse drug effect, COVID-19 self-report, and COVID-19 symptoms. Our system is based on BERT model pre-trained on the domain-specific text. In addition, we perform data cleaning and augmentation, as well as hyperparameter optimization and model ensemble to further boost the BERT performance. We achieved the first rank in both classifying adverse drug effects and COVID-19 self-report tasks.
References used
https://aclanthology.org/
We bring the data from the social networking site Twitter
pages, and then we have worked on cleaning and processing
operation to the text of for the classification process texts retrieved
contain a lot of noise and information is useful for the pr
Detecting offensive language on Twitter has many applications ranging from detecting/predicting bullying to measuring polarization. In this paper, we focus on building a large Arabic offensive tweet dataset. We introduce a method for building a datas
The 2020 US Elections have been, more than ever before, characterized by social media campaigns and mutual accusations. We investigate in this paper if this manifests also in online communication of the supporters of the candidates Biden and Trump, b
The mix-up method (Zhang et al., 2017), one of the methods for data augmentation, is known to be easy to implement and highly effective. Although the mix-up method is intended for image identification, it can also be applied to natural language proce
Customers of machine learning systems demand accountability from the companies employing these algorithms for various prediction tasks. Accountability requires understanding of system limit and condition of erroneous predictions, as customers are oft