يمكن أن تلعب الموارد الحسابية مثل سورانيا المشروح الدولى دورا مهما في تمكين المتحدثين لغات الأقليات الأصلية للمشاركة في الحكومة والتعليم ومجالات الحياة العامة في لغتهم العامة.ومع ذلك، فإن العديد من اللغات - بشكل رئيسي أولئك الذين لديهم سكان متكلمون أصليين صغار ودون تقاليد مكتوبة - ليس لديهم دعما رقميا.عقبة واحدة في إنشاء هذه الموارد هي أنه بالنسبة للعديد من اللغات، سيكون عدد قليل من المتحدثين قادرين على تسجيل النصوص - وهي مهمة تتطلب محو الأمية وبعض التدريب اللغوي - وأن وقت هؤلاء الخبراء عادة ما يكون في ارتفاع الطلب على أعمال تخطيط اللغة.تقوم هذه الورقة بتقييم ما إذا كانت غير مكبرات الصوت المدربة في لغة أصلية يمكن أن تؤدي إشعالا دلاليين باستخدام عروض توضيحي موحدة، مما يسمح بإنشاء مواد حسابية دون إيصال المزيد من الضغط على موارد المجتمع.
Computational resources such as semantically annotated corpora can play an important role in enabling speakers of indigenous minority languages to participate in government, education, and other domains of public life in their own language. However, many languages -- mainly those with small native speaker populations and without written traditions -- have little to no digital support. One hurdle in creating such resources is that for many languages, few speakers would be capable of annotating texts -- a task which requires literacy and some linguistic training -- and that these experts' time is typically in high demand for language planning work. This paper assesses whether typologically trained non-speakers of an indigenous language can feasibly perform semantic annotation using Uniform Meaning Representations, thus allowing for the creation of computational materials without putting further strain on community resources.
References used
https://aclanthology.org/
Semantic divergence in related languages is a key concern of historical linguistics. We cross-linguistically investigate the semantic divergence of cognate pairs in English and Romance languages, by means of word embeddings. To this end, we introduce
Universal Conceptual Cognitive Annotation (UCCA) is a semantic annotation scheme that organizes texts into coarse predicate-argument structure, offering broad coverage of semantic phenomena. At the same time, there is still need for a finer-grained t
Graph-based semantic parsing aims to represent textual meaning through directed graphs. As one of the most promising general-purpose meaning representations, these structures and their parsing have gained a significant interest momentum during recent
Crowdsourcing from non-experts is one of the most common approaches to collecting data and annotations in NLP. Even though it is such a fundamental tool in NLP, crowdsourcing use is largely guided by common practices and the personal experience of re
Biomaterials are synthetic or natural materials used for constructing artificial organs, fabricating prostheses, or replacing tissues. The last century saw the development of thousands of novel biomaterials and, as a result, an exponential increase i