تقدم هذه الورقة العديد من التحديات التي تواجهها عند إشراف Treebanks التركية وفقا للمبادئ التوجيهية للتبض الشامل (UD) وتقترح الحلول لمعالجتها.معظم هذه التحديات تنبع من الافتقار إلى الدعم الكافي في إطار UD إلى بدقة تمثل مورفيمز البادئة والاشتقامات المعقدة، مما يؤدي إلى فقدان كبير للمعلومات من أجل التركية.تؤثر هذه الخسارة سلبا على الأدوات التي تم تطويرها بناء على هذه Treebanks.نشأنا وناقشت هذه القضايا داخل المجتمع على بوابة UD الرسمية.تعرض هذه الورقة هذه القضايا ومقترحاتنا تمثل أكثر دقة معلومات مورفوسنكتاسية للتركية في حين تلتزم بمبادئ توجيهية للتكييف.يهدف هذا العمل إلى المساهمة في تمثيل اللغات التركية وغيرها من اللغات الشاقة في Treebanks القائمة على UD، والتي بدورها تساعد على تطوير مجموعات بيانات مشروحة بدقة لهذه اللغات.
This paper presents several challenges faced when annotating Turkish treebanks in accordance with the Universal Dependencies (UD) guidelines and proposes solutions to address them. Most of these challenges stem from the lack of adequate support in the UD framework to accurately represent null morphemes and complex derivations, which results in a significant loss of information for Turkish. This loss negatively impacts the tools that are developed based on these treebanks. We raised and discussed these issues within the community on the official UD portal. This paper presents these issues and our proposals to more accurately represent morphosyntactic information for Turkish while adhering to guidelines of UD. This work aims to contribute to the representation of Turkish and other agglutinative languages in UD-based treebanks, which in turn aids to develop more accurately annotated datasets for such languages.
References used
https://aclanthology.org/
In recent years, remote digital healthcare using online chats has gained momentum, especially in the Global South. Though prior work has studied interaction patterns in online (health) forums, such as TalkLife, Reddit and Facebook, there has been lim
This paper describes a system proposed for the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (EUD). We propose a Graph Rewriting based system for computing Enhanced Universal Dependencies, given the Basic Universal Dependencies (UD).
Universal Conceptual Cognitive Annotation (UCCA) is a semantic annotation scheme that organizes texts into coarse predicate-argument structure, offering broad coverage of semantic phenomena. At the same time, there is still need for a finer-grained t
This paper details experiments we performed on the Universal Dependencies 2.7 corpora in order to investigate the dominant word order in the available languages. For this purpose, we used a graph rewriting tool, GREW, which allowed us to go beyond th
In this paper we discuss several challenges related to the development of a 3D game, whose goal is to raise awareness on cyberbullying while collecting linguistic annotation on offensive language. The game is meant to be used by teenagers, thus raisi