Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Defining meaningful units. Challenges in sign segmentation and segment-meaning mapping (short paper)

تحديد وحدات ذات مغزى.التحديات في تجزئة التوقيع ورسم الخرائط ذات معنى القطاع (ورقة قصيرة)

510 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper addresses the tasks of sign segmentation and segment-meaning mapping in the context of sign language (SL) recognition. It aims to give an overview of the linguistic properties of SL, such as coarticulation and simultaneity, which make these tasks complex. A better understanding of SL structure is the necessary ground for the design and development of SL recognition and segmentation methodologies, which are fundamental for machine translation of these languages. Based on this preliminary exploration, a proposal for mapping segments to meaning in the form of an agglomerate of lexical and non-lexical information is introduced.

References used

https://aclanthology.org/

rate research

AVASAG: A German Sign Language Translation System for Public Services (short paper)

624 - Association for Computation Linguistics 2021 مقالة

This paper presents an overview of AVASAG; an ongoing applied-research project developing a text-to-sign-language translation system for public services. We describe the scientific innovation points (geometry-based SL-description, 3D animation and video corpus, simplified annotation scheme, motion capture strategy) and the overall translation pipeline.

أسطورة التوقيع language translation system نظام ترجمة اللغة صناعة حمض الفوسفور

Online Evaluation of Text-to-sign Translation by Deaf End Users: Some Methodological Recommendations (short paper)

520 - Association for Computation Linguistics 2021 مقالة

We present a number of methodological recommendations concerning the online evaluation of avatars for text-to-sign translation, focusing on the structure, format and length of the questionnaire, as well as methods for eliciting and faithfully transcribing responses

deaf end users end users deaf end المستخدمين النهائيين الصم المستخدمين النهائيين نهاية الصم صناعة حمض الفوسفور المزيد..

Morphologically-Guided Segmentation For Translation of Agglutinative Low-Resource Languages

939 - Association for Computation Linguistics 2021 مقالة

Neural Machine Translation (NMT) for Low Resource Languages (LRL) is often limited by the lack of available training data, making it necessary to explore additional techniques to improve translation quality. We propose the use of the Prefix-Root-Post fix-Encoding (PRPE) subword segmentation algorithm to improve translation quality for LRLs, using two agglutinative languages as case studies: Quechua and Indonesian. During the course of our experiments, we reintroduce a parallel corpus for Quechua-Spanish translation that was previously unavailable for NMT. Our experiments show the importance of appropriate subword segmentation, which can go as far as improving translation quality over systems trained on much larger quantities of data. We show this by achieving state-of-the-art results for both languages, obtaining higher BLEU scores than large pre-trained models with much smaller amounts of data.

وسائل التواصل الاجتماعي التعليقات agglutinative low-resource languages improve translation quality بلغ ملغات الموارد المنخفضة تحسين جودة الترجمة صناعة حمض الفوسفور

Segment, Mask, and Predict: Augmenting Chinese Word Segmentation with Self-Supervision

1229 - Association for Computation Linguistics 2021 مقالة

Recent state-of-the-art (SOTA) effective neural network methods and fine-tuning methods based on pre-trained models (PTM) have been used in Chinese word segmentation (CWS), and they achieve great results. However, previous works focus on training the models with the fixed corpus at every iteration. The intermediate generated information is also valuable. Besides, the robustness of the previous neural methods is limited by the large-scale annotated data. There are a few noises in the annotated corpus. Limited efforts have been made by previous studies to deal with such problems. In this work, we propose a self-supervised CWS approach with a straightforward and effective architecture. First, we train a word segmentation model and use it to generate the segmentation results. Then, we use a revised masked language model (MLM) to evaluate the quality of the segmentation results based on the predictions of the MLM. Finally, we leverage the evaluations to aid the training of the segmenter by improved minimum risk training. Experimental results show that our approach outperforms previous methods on 9 different CWS datasets with single criterion training and multiple criteria training and achieves better robustness.

augmenting chinese word augmenting chinese زيادة الكلمة الصينية زيادة الصينية صناعة حمض الفوسفور

Dialect Identification in Nuanced Arabic Tweets Using Farasa Segmentation and AraBERT

628 - Association for Computation Linguistics 2021 مقالة

This paper presents our approach to address the EACL WANLP-2021 Shared Task 1: Nuanced Arabic Dialect Identification (NADI). The task is aimed at developing a system that identifies the geographical location(country/province) from where an Arabic twe et in the form of modern standard Arabic or dialect comes from. We solve the task in two parts. The first part involves pre-processing the provided dataset by cleaning, adding and segmenting various parts of the text. This is followed by carrying out experiments with different versions of two Transformer based models, AraBERT and AraELECTRA. Our final approach achieved macro F1-scores of 0.216, 0.235, 0.054, and 0.043 in the four subtasks, and we were ranked second in MSA identification subtasks and fourth in DA identification subtasks.

nuanced arabic tweets farasa segmentation nuanced العربية تغريدات تجزئة فاراسا صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Defining meaningful units. Challenges in sign segmentation and segment-meaning mapping (short paper)

تحديد وحدات ذات مغزى.التحديات في تجزئة التوقيع ورسم الخرائط ذات معنى القطاع (ورقة قصيرة)

Ask ChatGPT about the research

Read More

suggested questions