Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Linguistic Knowledge in Multilingual Grapheme-to-Phoneme Conversion

المعرفة اللغوية في التحويل متعدد اللغات

775 0 0 0.0 ( 0 )

Download Cite

Added by Association for Computation Linguistics مقالة

Publication date 2021

fields Artificial Intelligence

and research's language is English

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper documents the UBC Linguistics team's approach to the SIGMORPHON 2021 Grapheme-to-Phoneme Shared Task, concentrating on the low-resource setting. Our systems expand the baseline model with simple modifications informed by syllable structure and error analysis. In-depth investigation of test-set predictions shows that our best model rectifies a significant number of mistakes compared to the baseline prediction, besting all other submissions. Our results validate the view that careful error analysis in conjunction with linguistic knowledge can lead to more effective computational modeling.

References used

https://aclanthology.org/

rate research

CLUZH at SIGMORPHON 2021 Shared Task on Multilingual Grapheme-to-Phoneme Conversion: Variations on a Baseline

633 - Association for Computation Linguistics 2021 مقالة

This paper describes the submission by the team from the Department of Computational Linguistics, Zurich University, to the Multilingual Grapheme-to-Phoneme Conversion (G2P) Task 1 of the SIGMORPHON 2021 challenge in the low and medium settings. The submission is a variation of our 2020 G2P system, which serves as the baseline for this year's challenge. The system is a neural transducer that operates over explicit edit actions and is trained with imitation learning. For this challenge, we experimented with the following changes: a) emitting phoneme segments instead of single character phonemes, b) input character dropout, c) a mogrifier LSTM decoder (Melis et al., 2019), d) enriching the decoder input with the currently attended input character, e) parallel BiLSTM encoders, and f) an adaptive batch size scheduler. In the low setting, our best ensemble improved over the baseline, however, in the medium setting, the baseline was stronger on average, although for certain languages improvements could be observed.

فرقة cluzh at sigmorphon zurich university cluzh في سيغمورفون جامعة زيوريخ صناعة حمض الفوسفور

Multilingual AMR Parsing with Noisy Knowledge Distillation

846 - Association for Computation Linguistics 2021 مقالة

We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher. We constrain our exploration in a strict multilingua l setting: there is but one model to parse all different languages including English. We identify that noisy input and precise output are the key to successful distillation. Together with extensive pre-training, we obtain an AMR parser whose performances surpass all previously published results on four different foreign languages, including German, Spanish, Italian, and Chinese, by large margins (up to 18.8 Smatch points on Chinese and on average 11.3 Smatch points). Our parser also achieves comparable performance on English to the latest state-of-the-art English-only parser.

multilingual amr parsing noisy knowledge distillation تحليل عمرو متعدد اللغات تقطير المعرفة صاخبة صناعة حمض الفوسفور

Improving the Performance of UDify with Linguistic Typology Knowledge

621 - Association for Computation Linguistics 2021 مقالة

UDify is the state-of-the-art language-agnostic dependency parser which is trained on a polyglot corpus of 75 languages. This multilingual modeling enables the model to generalize over unknown/lesser-known languages, thus leading to improved performa nce on low-resource languages. In this work we used linguistic typology knowledge available in URIEL database, to improve the cross-lingual transferring ability of UDify even further.

linguistic typology knowledge linguistic typology typology knowledge المعرفة المعرفة اللغوية الطباعة اللغوية معرفة المعرفة صناعة حمض الفوسفور المزيد..

End-to-end mBERT based Seq2seq Enhanced Dependency Parser with Linguistic Typology knowledge

708 - Association for Computation Linguistics 2021 مقالة

We describe the NUIG solution for IWPT 2021 Shared Task of Enhanced Dependency (ED) parsing in multiple languages. For this shared task, we propose and evaluate an End-to-end Seq2seq mBERT-based ED parser which predicts the ED-parse tree of a given i nput sentence as a relative head-position tag-sequence. Our proposed model is a multitasking neural-network which performs five key tasks simultaneously namely UPOS tagging, UFeat tagging, Lemmatization, Dependency-parsing and ED-parsing. Furthermore we utilise the linguistic typology available in the WALS database to improve the ability of our proposed end-to-end parser to transfer across languages. Results show that our proposed Seq2seq ED-parser performs on par with state-of-the-art ED-parser despite having a much simpler de- sign.

enhanced dependency parser enhanced dependency محاضر التبعية المحسن تعزيز الاعتماد صناعة حمض الفوسفور

Avengers, Ensemble! Benefits of ensembling in grapheme-to-phoneme prediction

845 - Association for Computation Linguistics 2021 مقالة

We describe three baseline beating systems for the high-resource English-only sub-task of the SIGMORPHON 2021 Shared Task 1: a small ensemble that Dialpad's speech recognition team uses internally, a well-known off-the-shelf model, and a larger ensem ble model comprising these and others. We additionally discuss the challenges related to the provided data, along with the processing steps we took.

المعرفة اللغوية avengers المنتقمون فرقة صناعة حمض الفوسفور

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Linguistic Knowledge in Multilingual Grapheme-to-Phoneme Conversion

المعرفة اللغوية في التحويل متعدد اللغات

Ask ChatGPT about the research

Read More

suggested questions