Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Challenges in Designing Games with a Purpose for Abusive Language Annotation

التحديات في تصميم الألعاب بغرض التعريفي باللغة التعريفي

661 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In this paper we discuss several challenges related to the development of a 3D game, whose goal is to raise awareness on cyberbullying while collecting linguistic annotation on offensive language. The game is meant to be used by teenagers, thus raising a number of issues that need to be tackled during development. For example, the game aesthetics should be appealing for players belonging to this age group, but at the same time all possible solutions should be implemented to meet privacy requirements. Also, the task of linguistic annotation should be possibly hidden, adopting so-called orthogonal game mechanics, without affecting the quality of collected data. While some of these challenges are being tackled in the game development, some others are discussed in this paper but still lack an ultimate solution.

References used

https://aclanthology.org/

rate research

Challenges in Detoxifying Language Models

926 - Association for Computation Linguistics 2021 مقالة

Large language models (LM) generate remarkably fluent text and can be efficiently adapted across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of safety is imperative for deploying LMs in the real world; to this end, pr ior work often relies on automatic evaluation of LM toxicity. We critically discuss this approach, evaluate several toxicity mitigation strategies with respect to both automatic and human evaluation, and analyze consequences of toxicity mitigation in terms of model bias and LM quality. We demonstrate that while basic intervention strategies can effectively optimize previously established automatic metrics on the REALTOXICITYPROMPTS dataset, this comes at the cost of reduced LM coverage for both texts about, and dialects of, marginalized groups. Additionally, we find that human raters often disagree with high automatic toxicity scores after strong toxicity reduction interventions---highlighting further the nuances involved in careful evaluation of LM toxicity.

detoxifying language models challenges in detoxifying detoxifying language نماذج لغة إزالة السموم التحديات في إزالة السموم لغة إزالة السموم صناعة حمض الفوسفور المزيد..

ParsiNLU: A Suite of Language Understanding Challenges for Persian

1024 - Association for Computation Linguistics 2021 مقالة

Abstract Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language, one of the widely spoken languages in the world, and yet there are few NLU datasets available for this language. The availability of high-quality evaluation datasets is a necessity for reliable assessment of the progress on different NLU tasks and domains. We introduce ParsiNLU, the first benchmark in Persian language that includes a range of language understanding tasks---reading comprehension, textual entailment, and so on. These datasets are collected in a multitude of ways, often involving manual annotations by native speakers. This results in over 14.5k new instances across 6 distinct NLU tasks. Additionally, we present the first results on state-of-the-art monolingual and multilingual pre-trained language models on this benchmark and compare them with human performance, which provides valuable insights into our ability to tackle natural language understanding challenges in Persian. We hope ParsiNLU fosters further research and advances in Persian language understanding.1

language understanding challenges persian language لغة فهم التحديات اللغة الفارسية صناعة حمض الفوسفور

Overcoming the challenges in morphological annotation of Turkish in universal dependencies framework

1078 - Association for Computation Linguistics 2021 مقالة

This paper presents several challenges faced when annotating Turkish treebanks in accordance with the Universal Dependencies (UD) guidelines and proposes solutions to address them. Most of these challenges stem from the lack of adequate support in th e UD framework to accurately represent null morphemes and complex derivations, which results in a significant loss of information for Turkish. This loss negatively impacts the tools that are developed based on these treebanks. We raised and discussed these issues within the community on the official UD portal. This paper presents these issues and our proposals to more accurately represent morphosyntactic information for Turkish while adhering to guidelines of UD. This work aims to contribute to the representation of Turkish and other agglutinative languages in UD-based treebanks, which in turn aids to develop more accurately annotated datasets for such languages.

universal dependencies framework morphological annotation إطار التبعيات العالمي التوضيحية المورفولوجية صناعة حمض الفوسفور

HateBERT: Retraining BERT for Abusive Language Detection in English

1071 - Association for Computation Linguistics 2021 مقالة

We introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we hav e curated and made available to the public. We present the results of a detailed comparison between a general pre-trained language model and the retrained version on three English datasets for offensive, abusive language and hate speech detection tasks. In all datasets, HateBERT outperforms the corresponding general BERT model. We also discuss a battery of experiments comparing the portability of the fine-tuned models across the datasets, suggesting that portability is affected by compatibility of the annotated phenomena.

abusive language detection retraining bert abusive language الكشف عن اللغة المسيئة إعادة تدريب بيرت لغة مسيئة صناعة حمض الفوسفور المزيد..

Arabic Language as a Second Language: Challenges Facing Foreign Learners

5112 - Damascus University 2010 ورقة بحثية

This research deals with teaching Arabic as a second language. It tackles the different characteristics and nationalities of learners in addition to their objectives in relation to learning Arabic. This is taken into consideration when preparing t he required curricula from two perspectives; the linguistic and the functional one. This research sheds light on the role of technology that is utilized to facilitate the task of learning Arabic by speakers of other languages in relation to the pronunciation of letters, sounds, writing, grammatical conjugation, comprehension and reading. The research also sheds light on the most important challenges facing the Arabic Language since the twenty first century such as the cultural challenge and the revival of local and spoken dialects.

اللغة العربية Arabic Language لغة ثانية الدارسين الأجانب تحديات Second Language Foreign Learners Challenges المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Challenges in Designing Games with a Purpose for Abusive Language Annotation

التحديات في تصميم الألعاب بغرض التعريفي باللغة التعريفي

Ask ChatGPT about the research

Read More

suggested questions