في هذه الورقة نناقش العديد من التحديات المتعلقة بتطوير لعبة ثلاثية الأبعاد، تهدف هدفها إلى زيادة الوعي بالتبريد الإلكتروني أثناء جمع التوضيح اللغوي في اللغة الهجومية.من المفترض أن تستخدم اللعبة من قبل المراهقين، وبالتالي رفع عدد من القضايا التي يجب معالجتها أثناء التنمية.على سبيل المثال، يجب أن تكون جماليات اللعبة جذابة للاعبين الذين ينتمون إلى هذه الفئة العمرية، ولكن في الوقت نفسه يجب تنفيذ جميع الحلول الممكنة لتلبية متطلبات الخصوصية.أيضا، ينبغي إخفاء مهمة الشروح اللغوية مخفية، وتبني ما يسمى ميكانيكا اللعبة المتعامدة، دون التأثير على جودة البيانات التي تم جمعها.في حين أن بعض هذه التحديات يتم تناولها في تطوير اللعبة، نناقش بعض الآخرين في هذه الورقة ولكن لا يزال يفتقر إلى حل نهائي.
In this paper we discuss several challenges related to the development of a 3D game, whose goal is to raise awareness on cyberbullying while collecting linguistic annotation on offensive language. The game is meant to be used by teenagers, thus raising a number of issues that need to be tackled during development. For example, the game aesthetics should be appealing for players belonging to this age group, but at the same time all possible solutions should be implemented to meet privacy requirements. Also, the task of linguistic annotation should be possibly hidden, adopting so-called orthogonal game mechanics, without affecting the quality of collected data. While some of these challenges are being tackled in the game development, some others are discussed in this paper but still lack an ultimate solution.
References used
https://aclanthology.org/
Large language models (LM) generate remarkably fluent text and can be efficiently adapted across NLP tasks. Measuring and guaranteeing the quality of generated text in terms of safety is imperative for deploying LMs in the real world; to this end, pr
Abstract Despite the progress made in recent years in addressing natural language understanding (NLU) challenges, the majority of this progress remains to be concentrated on resource-rich languages like English. This work focuses on Persian language,
This paper presents several challenges faced when annotating Turkish treebanks in accordance with the Universal Dependencies (UD) guidelines and proposes solutions to address them. Most of these challenges stem from the lack of adequate support in th
We introduce HateBERT, a re-trained BERT model for abusive language detection in English. The model was trained on RAL-E, a large-scale dataset of Reddit comments in English from communities banned for being offensive, abusive, or hateful that we hav
This research deals with teaching Arabic as a second language. It
tackles the different characteristics and nationalities of learners in
addition to their objectives in relation to learning Arabic. This is taken
into consideration when preparing t