في هذه الورقة، نصف نهجنا تجاه استخدام النماذج المدربة مسبقا لمهمة الكشف عن الكلام الأمل.شاركنا في المهمة 2: الكشف عن الكلام للأمل للتساوي والتنوع والإدماج في LT-EDI-2021 @ EACL2021.الهدف من هذه المهمة هو التنبؤ بحضور خطاب الأمل، إلى جانب وجود العينات التي لا تنتمي إلى نفس اللغة في مجموعة البيانات.نحن نصف نهجنا لضبط روبرتا من أجل الكشف عن الكلام على الأمل باللغة الإنجليزية ونهجنا لضبط XLM-Roberta من أجل الكشف عن الكلام في التاميل والمالايالام، وهو لغتين منخفضان من الموارد.نوضح أداء نهجنا على تصنيف النص في الأمل، غير الأمل وغير اللغة.تصنيفنا في المرتبة الأولى في اللغة الإنجليزية (F1 = 0.93)، الأول في التاميل (F1 = 0.61) و 3 في مالايالام (F1 = 0.83).
In this paper, we describe our approach towards utilizing pre-trained models for the task of hope speech detection. We participated in Task 2: Hope Speech Detection for Equality, Diversity and Inclusion at LT-EDI-2021 @ EACL2021. The goal of this task is to predict the presence of hope speech, along with the presence of samples that do not belong to the same language in the dataset. We describe our approach to fine-tuning RoBERTa for Hope Speech detection in English and our approach to fine-tuning XLM-RoBERTa for Hope Speech detection in Tamil and Malayalam, two low resource Indic languages. We demonstrate the performance of our approach on classifying text into hope-speech, non-hope and not-language. Our approach ranked 1st in English (F1 = 0.93), 1st in Tamil (F1 = 0.61) and 3rd in Malayalam (F1 = 0.83).
References used
https://aclanthology.org/
This paper aims to describe the approach we used to detect hope speech in the HopeEDI dataset. We experimented with two approaches. In the first approach, we used contextual embeddings to train classifiers using logistic regression, random forest, SV
In a world with serious challenges like climate change, religious and political conflicts, global pandemics, terrorism, and racial discrimination, an internet full of hate speech, abusive and offensive content is the last thing we desire for. In this
Hope is an essential aspect of mental health stability and recovery in every individual in this fast-changing world. Any tools and methods developed for detection, analysis, and generation of hope speech will be beneficial. In this paper, we propose
Analysis and deciphering code-mixed data is imperative in academia and industry, in a multilingual country like India, in order to solve problems apropos Natural Language Processing. This paper proposes a bidirectional long short-term memory (BiLSTM)
The rapid rise of online social networks like YouTube, Facebook, Twitter allows people to express their views more widely online. However, at the same time, it can lead to an increase in conflict and hatred among consumers in the form of freedom of s