في نمو العالم اليوم والتكنولوجيا المتقدمة، تلعب شبكات وسائل التواصل الاجتماعي دورا مهما في التأثير على الأرواح البشرية.الرقابة هي الإطاحة عن الكلام أو ناقل الحركة العام أو التفاصيل الأخرى التي تلعب دورا كبيرا في وسائل التواصل الاجتماعي.قد يتم اعتبار المحتوى ضارا أو حساسا أو غير مريح.السلطات مثل المعاهد والحكومات وغيرها من المنظمات تصرف الرقابة.نفذت هذه الورقة نموذجا يساعد على تصنيف التغريدات الرقابة والكشف عنها كتصنيف ثنائي.تصف الورقة تقديمها إلى مهمة مشتركة للرقابة في ورشة عمل NLP4IF 2021.استخدمنا العديد من النماذج المدربة المستندة إلى المحولات، وتخرج XLNet دقة أفضل بين الجميع.نحن نضقل النموذج للحصول على أداء أفضل وحققت دقة معقولة، وتحسب مقاييس الأداء الأخرى.
In the growth of today's world and advanced technology, social media networks play a significant role in impacting human lives. Censorship is the overthrowing of speech, public transmission, or other details that play a vast role in social media. The content may be considered harmful, sensitive, or inconvenient. Authorities like institutes, governments, and other organizations conduct Censorship. This paper has implemented a model that helps classify censored and uncensored tweets as a binary classification. The paper describes submission to the Censorship shared task of the NLP4IF 2021 workshop. We used various transformer-based pre-trained models, and XLNet outputs a better accuracy among all. We fine-tuned the model for better performance and achieved a reasonable accuracy, and calculated other performance metrics.
References used
https://aclanthology.org/
In this study, we study language change in Chinese Biji by using a classification task: classifying Ancient Chinese texts by time periods. Specifically, we focus on a unique genre in classical Chinese literature: Biji (literally notebook'' or brush n
The reported work is a description of our participation in the Classification of COVID19 tweets containing symptoms'' shared task, organized by the Social Media Mining for Health Applications (SMM4H)'' workshop. The literature describes two machine l
This study describes our proposed model design for SMM4H 2021 shared tasks. We fine-tune the language model of RoBERTa transformers and their connecting classifier to complete the classification tasks of tweets for adverse pregnancy outcomes (Task 4)
In this paper, we propose a knowledge infusion mechanism to incorporate domain knowledge into language transformers. Weakly supervised data is regarded as the main source for knowledge acquisition. We pre-train the language models to capture masked k
We use Hypergraph Attention Networks (HyperGAT) to recognize multiple labels of Chinese humor texts. We firstly represent a joke as a hypergraph. The sequential hyperedge and semantic hyperedge structures are used to construct hyperedges. Then, atten