Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Overcoming the challenges in morphological annotation of Turkish in universal dependencies framework

التغلب على التحديات في شرح مورفولوجي للتركية في إطار التبعيات العالمي

1088 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper presents several challenges faced when annotating Turkish treebanks in accordance with the Universal Dependencies (UD) guidelines and proposes solutions to address them. Most of these challenges stem from the lack of adequate support in the UD framework to accurately represent null morphemes and complex derivations, which results in a significant loss of information for Turkish. This loss negatively impacts the tools that are developed based on these treebanks. We raised and discussed these issues within the community on the official UD portal. This paper presents these issues and our proposals to more accurately represent morphosyntactic information for Turkish while adhering to guidelines of UD. This work aims to contribute to the representation of Turkish and other agglutinative languages in UD-based treebanks, which in turn aids to develop more accurately annotated datasets for such languages.

References used

https://aclanthology.org/

rate research

A Linguistic Annotation Framework to Study Interactions in Multilingual Healthcare Conversational Forums

1156 - Association for Computation Linguistics 2021 مقالة

In recent years, remote digital healthcare using online chats has gained momentum, especially in the Global South. Though prior work has studied interaction patterns in online (health) forums, such as TalkLife, Reddit and Facebook, there has been lim ited work in understanding interactions in small, close-knit community of instant messengers. In this paper, we propose a linguistic annotation framework to facilitate analysis of health-focused WhatsApp groups. The primary aim of the framework is to understand interpersonal relationships among peer supporters in order to help develop NLP solutions for remote patient care and reduce burden of overworked healthcare providers. Our framework consists of fine-grained peer support categorization and message-level sentiment tagging. Additionally, due to the prevalence of code-mixing in such groups, we incorporate word-level language annotations. We use the proposed framework to study two WhatsApp groups in Kenya for youth living with HIV, facilitated by a healthcare provider.

multilingual healthcare conversational healthcare conversational forums conversational forums تعدد اللغات الرعاية الصحية منتديات صبايا الاردن منتديات صناعة حمض الفوسفور المزيد..

Graph Rewriting for Enhanced Universal Dependencies

1062 - Association for Computation Linguistics 2021 مقالة

This paper describes a system proposed for the IWPT 2021 Shared Task on Parsing into Enhanced Universal Dependencies (EUD). We propose a Graph Rewriting based system for computing Enhanced Universal Dependencies, given the Basic Universal Dependencies (UD).

المهام المشتركة EUD basic universal dependencies التبعيات العالمية الأساسية صناعة حمض الفوسفور

Subcategorizing Adverbials in Universal Conceptual Cognitive Annotation

646 - Association for Computation Linguistics 2021 مقالة

Universal Conceptual Cognitive Annotation (UCCA) is a semantic annotation scheme that organizes texts into coarse predicate-argument structure, offering broad coverage of semantic phenomena. At the same time, there is still need for a finer-grained t reatment of many of the categories. The Adverbial category is of special interest, as it covers a wide range of fundamentally different meanings such as negation, causation, aspect, and event quantification. In this paper we introduce a refinement annotation scheme for UCCA's Adverbial category, showing that UCCA Adverbials can indeed be subcategorized into at least 7 semantic types, and doing so can help clarify and disambiguate the otherwise coarse-grained labels. We provide a preliminary set of annotation guidelines, as well as pilot annotation experiments with high inter-annotator agreement, confirming the validity of the scheme.

universal conceptual cognitive conceptual cognitive annotation conceptual cognitive المعرفي المفاهيمي العالمي الشرح المعرفي المفاهيمي المعرفي المفاهيمي صناعة حمض الفوسفور المزيد..

Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting

614 - Association for Computation Linguistics 2021 مقالة

This paper details experiments we performed on the Universal Dependencies 2.7 corpora in order to investigate the dominant word order in the available languages. For this purpose, we used a graph rewriting tool, GREW, which allowed us to go beyond th e surface annotations and identify the implicit subjects. We first measured the distribution of the six different word orders (SVO, SOV, VSO, VOS, OVS, OSV) in the corpora and investigated when there was a significant difference in the corpora within a given language. Then, we compared the obtained results with information provided in the WALS database (Dryer and Haspelmath, 2013) and in ( ̈Ostling, 2015). Finally, we examined the impact of using a graph rewriting tool for this task. The tools and resources used for this research are all freely available.

dominant word order investigating dominant word universal dependencies كلمة مهيمنة أمر التحقيق في كلمة مهيمنة التبعيات العالمية صناعة حمض الفوسفور المزيد..

Challenges in Designing Games with a Purpose for Abusive Language Annotation

662 - Association for Computation Linguistics 2021 مقالة

In this paper we discuss several challenges related to the development of a 3D game, whose goal is to raise awareness on cyberbullying while collecting linguistic annotation on offensive language. The game is meant to be used by teenagers, thus raisi ng a number of issues that need to be tackled during development. For example, the game aesthetics should be appealing for players belonging to this age group, but at the same time all possible solutions should be implemented to meet privacy requirements. Also, the task of linguistic annotation should be possibly hidden, adopting so-called orthogonal game mechanics, without affecting the quality of collected data. While some of these challenges are being tackled in the game development, some others are discussed in this paper but still lack an ultimate solution.

abusive language annotation purpose for abusive التوضيحية لغة مسيئة الغرض من المسيئة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Overcoming the challenges in morphological annotation of Turkish in universal dependencies framework

التغلب على التحديات في شرح مورفولوجي للتركية في إطار التبعيات العالمي

Ask ChatGPT about the research

Read More

suggested questions