Do you want to publish a course? Click here

Improved pronunciation prediction accuracy using morphology

تحسين دقة التنبؤ النطق باستخدام المورفولوجيا

596   0   0   0.0 ( 0 )
 Publication date 2021
and research's language is English
 Created by Shamra Editor




Ask ChatGPT about the research

Pronunciation lexicons and prediction models are a key component in several speech synthesis and recognition systems. We know that morphologically related words typically follow a fixed pattern of pronunciation which can be described by language-specific paradigms. In this work we explore how deep recurrent neural networks can be used to automatically learn and exploit this pattern to improve the pronunciation prediction quality of words related by morphological inflection. We propose two novel approaches for supplying morphological information, using the word's morphological class and its lemma, which are typically annotated in standard lexicons. We report improvements across a number of European languages with varying degrees of phonological and morphological complexity, and two language families, with greater improvements for languages where the pronunciation prediction task is inherently more challenging. We also observe that combining bidirectional LSTM networks with attention mechanisms is an effective neural approach for the computational problem considered, across languages. Our approach seems particularly beneficial in the low resource setting, both by itself and in conjunction with transfer learning.



References used
https://aclanthology.org/
rate research

Read More

Morphological tasks have gained decent popularity within the NLP community in the recent years, with large multi-lingual datasets providing morphological analysis of words, either in or out of context. However, the lack of a clear linguistic definiti on for words destines the annotative work to be incomplete and mired in inconsistencies, especially cross-linguistically. In this work we expand morphological inflection of words to inflection of sentences to provide true universality disconnected from orthographic traditions of white-space usage. To allow annotation for sentence-inflection we define a morphological annotation scheme by a fixed set of inflectional features. We present a small cross-linguistic dataset including semi-manually generated simple sentences in 4 typologically diverse languages annotated according to our suggested scheme, and show that the task of reinflection gets substantially more difficult but that the change of scope from words to well-defined sentences allows interface with contextualized language models.
This paper describes ongoing work aiming at adding pronunciation information to lexical semantic resources, with a focus on open wordnets. Our goal is not only to add a new modality to those semantic networks, but also to mark heteronyms listed in th em with the pronunciation information associated with their different meanings. This work could contribute in the longer term to the disambiguation of multi-modal resources, which are combining text and speech.
Point-of-interest (POI) type prediction is the task of inferring the type of a place from where a social media post was shared. Inferring a POI's type is useful for studies in computational social science including sociolinguistics, geosemiotics, and cultural geography, and has applications in geosocial networking technologies such as recommendation and visualization systems. Prior efforts in POI type prediction focus solely on text, without taking visual information into account. However in reality, the variety of modalities, as well as their semiotic relationships with one another, shape communication and interactions in social media. This paper presents a study on POI type prediction using multimodal information from text and images available at posting time. For that purpose, we enrich a currently available data set for POI type prediction with the images that accompany the text messages. Our proposed method extracts relevant information from each modality to effectively capture interactions between text and image achieving a macro F1 of 47.21 across 8 categories significantly outperforming the state-of-the-art method for POI type prediction based on text-only methods. Finally, we provide a detailed analysis to shed light on cross-modal interactions and the limitations of our best performing model.
We discussed in this work some predictive methods for time series and it is decomposing time series to its component (trend, Seasonality, cycle, random), Exponential smoothing, ARIMA, then we discussed some combining methods, then we formed a new c ombine for predict time series which depends on combining exponential smoothing and ARIMA using weighted average with MAPE weights, and applied all methods above on three seasonal time series , first hourly temperature in Aleppo in august 2011 ,second monthly milk production peer cow in Australia from Jan 1962 to Dec 1975,third quartly electricity production in Australia from Mar 1956 to Sep 1994, and compared the results which approved that the suggested method is the best.
أهداف البحث: -1 دراسة نظرية عن أهمية و أثر الدقة في التنبؤ بالمبيعات على خطط الإنتاج و التسويق و التوزيع. -2 دراسة مرجعية عن التنقيب في البيانات و التنبؤ باستخدام السلاسل الزمنية و الشبكات العصبونية. -3 استخدام الشبكات العصبية الصناعية في زيادة د قة التنبؤ بحجم المبيعات الشهرية لشركة الفنار. -4 اختبار تفوق الشبكات العصبية في التنبؤ على نموذجي المتوسطات المتحركة و الانحدار.

suggested questions

comments
Fetching comments Fetching comments
Sign in to be able to follow your search criteria
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا