في هذه الورقة، نطور Sindhi معجم شخصي باستخدام دمج الموارد الإنجليزية القائمة: NRC Lexicon، قائمة كلمات الرأي، Sentiwordnet، Sindhi-English Dictionary، وجمع معدلات Sindhi.يتم تعيين درجة المشاعر الإيجابية أو السلبية لكل كلمة sindhi رأي.بعد ذلك، نحدد تغطية المعجم المقترح مع تحليل الذاتية.علاوة على ذلك، نحن الزحف من سقسقة المجال سقسقة من الأخبار والرياضة والتمويل.يتم تفجيح Crescus Corpus من قبل Annetators ذوي الخبرة باستخدام أداة توضيح النص Doccano.يتم تقييم المشاعر المشروحة Corpus من خلال توظيف آلة ناقلات الدعم (SVM)، والشبكات العصبية المتكررة (RNN)، والشبكة العصبية التنافسية (CNN).
In this paper, we develop Sindhi subjective lexicon using a merger of existing English resources: NRC lexicon, list of opinion words, SentiWordNet, Sindhi-English bilingual dictionary, and collection of Sindhi modifiers. The positive or negative sentiment score is assigned to each Sindhi opinion word. Afterwards, we determine the coverage of the proposed lexicon with subjectivity analysis. Moreover, we crawl multi-domain tweet corpus of news, sports, and finance. The crawled corpus is annotated by experienced annotators using the Doccano text annotation tool. The sentiment annotated corpus is evaluated by employing support vector machine (SVM), recurrent neural network (RNN) variants, and convolutional neural network (CNN).
References used
https://aclanthology.org/
Adapters are light-weight modules that allow parameter-efficient fine-tuning of pretrained models. Specialized language and task adapters have recently been proposed to facilitate cross-lingual transfer of multilingual pretrained models (Pfeiffer et
A bigger is better'' explosion in the number of parameters in deep neural networks has made it increasingly challenging to make state-of-the-art networks accessible in compute-restricted environments. Compression techniques have taken on renewed impo
To build automated simplification systems, corpora of complex sentences and their simplified versions is the first step to understand sentence complexity and enable the development of automatic text simplification systems. We present a lexical and sy
This paper describes the participation of team oneNLP (LTRC, IIIT-Hyderabad) for the WMT 2021 task, similar language translation. We experimented with transformer based Neural Machine Translation and explored the use of language similarity for Tamil-
The widespread presence of offensive language on social media motivated the development of systems capable of recognizing such content automatically. Apart from a few notable exceptions, most research on automatic offensive language identification ha