Research papers, master and doctoral theses about الرومانية تويتر

ROFF - A Romanian Twitter Dataset for Offensive Language

184 - Association for Computation Linguistics 2021 مقالة

This paper describes the annotation process of an offensive language data set for Romanian on social media. To facilitate comparable multi-lingual research on offensive language, the annotation guidelines follow some of the recent annotation efforts for other languages. The final corpus contains 5000 micro-blogging posts annotated by a large number of volunteer annotators. The inter-annotator agreement and the initial automatic discrimination results we present are in line with earlier annotation efforts.

romanian twitter dataset romanian twitter مجموعة بيانات Twitter الرومانية الرومانية تويتر صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد