Subscribe to the gold package and get unlimited access to Shamra Academy

Toward a HIAST Arabic Text to Speech System Using Semi Syllables Concatenation and Natural Prosody

نحو نظام لتركيب الكلام باللغة العربية من نصوص في المعهد العالي للعلوم التطبيقية و التكنولوجيا باستعمال الضم لأنصاف مقاطع صوتية و تنغيم طبيعي

2219 1 87 0 ( 0 )

Download Cite

Added by Higher Institute for Applied Sciences and Technology أطروحة دكتوراه

Publication date 2018

and research's language is العربية

Authors عفاف الشلبي( طالب ) - أميمة الدكاك( مشرف ) - ندى غنيم( مشرف )

Created by Shamra Editor

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In the present work, we present our Arabic Semi-Syllable Synthesizer. The work consists of seven steps: (1) building a Semi-Syllable Speech Database for Arabic Semi-Syllable Synthesizer, (2) building the Natural Language Processing Module which comprises a Text Pre-processing Module and a Text to Phoneme conversion using Arabic Transcription from Orthographic to Phonemes, (3) followed by a Phoneme to Semi-Syllables Mapping using a Syllabification Expert System, (4) an Acoustic Word Stress Analysis for Continuous Arabic Speech based on the three prosodic parameters (fundamental frequency, intensity, duration) in order to detect stressed syllables.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تقدم هذه الأطروحة نظامًا لتركيب الكلام باللغة العربية باستخدام تقنية ضم أنصاف المقاطع الصوتية. يتألف النظام من سبع مراحل أساسية تشمل بناء قاموس صوتي لأنصاف المقاطع، ومعالجة اللغة الطبيعية، ونظام خبير لتقطيع الصوتيمات، ودراسة تحليلية صوتية للنبر، ونظام خبير لإسناد النبر، ومعالجة الإشارة الرقمية باستخدام تقنية TD-PSOLA، واختبار النظام على مدونات نصية وصوتية. أظهرت النتائج دقة عالية في التقطيع وجودة عالية للكلام المركب مقارنةً بمركبات الكلام الأخرى المتاحة للغة العربية. كما تضمنت الدراسة تحليلًا صوتيًا للنبر في الحديث المتواصل، حيث تبين أن النبر يرتبط بشكل كبير بموسط المدة والمطال، وتميزت قواعد إسناد النبر بثلاث درجات: النبر الأولي، والثانوي، والمعدوم. كما جرى اختبار وضوح وفهم وطبيعية الكلام المركب باستخدام مدونات نصية متنوعة، وأظهرت النتائج تحسنًا ملحوظًا في جودة الكلام المركب بعد إضافة النبر.

Critical review

دراسة نقدية: تعتبر هذه الأطروحة خطوة مهمة في مجال تركيب الكلام باللغة العربية، حيث تقدم نظامًا متكاملًا يعتمد على ضم أنصاف المقاطع الصوتية وتوليد التنغيم الطبيعي. ومع ذلك، يمكن الإشارة إلى بعض النقاط التي قد تحتاج إلى تحسين. أولاً، لم يتم تضمين مشكل آلي للنصوص غير المشكولة، مما قد يقلل من فعالية النظام في التطبيقات العملية. ثانيًا، على الرغم من أن النظام أظهر تحسنًا في جودة الكلام المركب، إلا أن هناك مجالًا لتحسين التنغيم على مستوى الجمل بشكل آلي. أخيرًا، يمكن أن يكون هناك توجه نحو دمج النظام مع تقنيات تعلم الآلة الحديثة لتحسين دقة النتائج وزيادة طبيعية الكلام المركب.

Questions related to the research

ما هي تقنية ضم أنصاف المقاطع الصوتية المستخدمة في هذه الأطروحة؟

تقنية ضم أنصاف المقاطع الصوتية هي طريقة تستخدم لدمج أنصاف المقاطع الصوتية لتوليد الكلام المركب، حيث يتم تقسيم الكلمات إلى أنصاف مقاطع صوتية ثم دمجها باستخدام تقنية TD-PSOLA لتوليد الكلام المركب بجودة عالية.
ما هي المراحل الأساسية التي يتألف منها نظام تركيب الكلام في هذه الأطروحة؟

يتألف النظام من سبع مراحل أساسية تشمل بناء قاموس صوتي لأنصاف المقاطع، ومعالجة اللغة الطبيعية، ونظام خبير لتقطيع الصوتيمات، ودراسة تحليلية صوتية للنبر، ونظام خبير لإسناد النبر، ومعالجة الإشارة الرقمية باستخدام تقنية TD-PSOLA، واختبار النظام على مدونات نصية وصوتية.
ما هي النتائج التي أظهرتها الدراسة التحليلية الصوتية للنبر في الحديث المتواصل؟

أظهرت الدراسة التحليلية الصوتية للنبر أن النبر يرتبط بشكل كبير بموسط المدة والمطال، وتميزت قواعد إسناد النبر بثلاث درجات: النبر الأولي، والثانوي، والمعدوم.
ما هي التحديات التي تواجه نظام تركيب الكلام المطور في هذه الأطروحة؟

من التحديات التي تواجه النظام عدم تضمين مشكل آلي للنصوص غير المشكولة، والحاجة إلى تحسين التنغيم على مستوى الجمل بشكل آلي، بالإضافة إلى إمكانية دمج النظام مع تقنيات تعلم الآلة الحديثة لتحسين دقة النتائج وزيادة طبيعية الكلام المركب.

Keywords

تركيب الكلام اللغة العربية ضم أنصاف المقاطع الصوتية النبر معالجة الإشارة الرقمية TD-PSOLA

References used

A. A. Almisreb A. F. Abidin, N. M. Tahir An acoustic investigation of Arabic vowels pronounced by Malay speakers [Article] // Journal of King Saud University – Computer and Information Sciences . - [s.l.] : Conference: 2016 IEEE 12th International Colloquium on Signal Processing & Its Applications (CSPA), 2016. - 28, 148–156

A. Al.Shalaby O. Dakkak, N.Alawa Automatic Prosody Generation for Arabic Text To Speech Systems [Article] // Damascus University Journal for the Basic Sciences. - 2013. - Vol. 29 - No. 1

A. Almisreb A. F. Abidin, N. Md .Tahir An acoustic investigation of Arabic vowels pronounced by Malay speakers [Article] // Journal of King Saud University - Computer and Information Sciences. - 2016. - Vol.(28) Issue(2), Pages 148-156

rate research

Automatic Prosody Generation for Arabic Text- To - Speech Systems

2074 - Damascus University 2011 ورقة بحثية

The main purpose of the present research is to support Arabic Text- to - Speech synthesizers, with natural prosody, based on linguistic analysis of texts to synthesize, and automatic prosody generation, using rules which are deduced from recorded s ignals analysis, of different types of sentences in Arabic. All the types of Arabic sentences (declarative and constructive) were enumerated with the help of an expert in Arabic linguistics . A textual corpus of about 2500 sentences covering most of these types was built and recorded both in natural prosody and without prosody. Later, these sentences were analyzed to extract prosody effect on the signal parameters, and to build prosody generation rules. In this paper, we present the results on negation sentences, applied on synthesized speech using the open source tool MBROLA. The results can be used with any parametric Arabic synthesizer. Future work will apply the rules on a new Arabic synthesizer based on semi-syllables units, which is under development in the Higher Institute for Applied Sciences and Technology.

تركيب الكلام من نصوص للغة العربية موسطات التنغيم قواعد لتوليد التغيم آلياً تحليل لغوي مدونة نصية مدونة كلامية تحليل الإشارة الكلامية Arabic Text To Speech Prosodic Parameters Automatic Prosody Generation Rules Linguistic analysis Text Corpus Speech Corpus Speech Signal Analysis المزيد..

Towards a system for constructing Arabic Ontology based on natural text

2589 - Damascus University 2011 ورقة بحثية

This paper presents ArOntoLearn, a Framework for Arabic Ontology learning from textual resources. Supporting Arabic language and using domain knowledge in the learning process are the main features of our framework. Besides it represents the learne d ontology in Probabilistic Ontology Model (POM), which can be translated into any knowledge representation formalism, and implements data-driven change discovery. Therefore it updates the POM according to the corpus changes only, and allows user to trace the evolution of the ontology with respect to the changes in the underlying corpus. Our framework analyses Arabic textual resources, and matches them to Arabic Lexico-syntactic patterns in order to learn new Concepts and Relations. Supporting Arabic language is not that easy task, because current linguistic analysis tools are not efficient enough to process unvocalized Arabic corpuses that rarely contain appropriate punctuation. So we tried to build a flexible and freely configured framework whereas any linguistic analysis tool can be replaced by more sophisticated one whenever it is available.

Ontology تعلم الأنطولوجية المعالجة الآلية للغة العربية استحصال المعرفة نموذج الأنطولوجية الاحتمالي Ontology Learning Knowledge Acquisition Arabic Natural Language Processing المزيد..

The IDC System for Sentiment Classification and Sarcasm Detection in Arabic

1012 - Association for Computation Linguistics 2021 مقالة

Sentiment classification and sarcasm detection attract a lot of attention by the NLP research community. However, solving these two problems in Arabic and on the basis of social network data (i.e., Twitter) is still of lower interest. In this paper w e present designated solutions for sentiment classification and sarcasm detection tasks that were introduced as part of a shared task by Abu Farha et al. (2021). We adjust the existing state-of-the-art transformer pretrained models for our needs. In addition, we use a variety of machine-learning techniques such as down-sampling, augmentation, bagging, and usage of meta-features to improve the models performance. We achieve an F1-score of 0.75 over the sentiment classification problem where the F1-score is calculated over the positive and negative classes (the neutral class is not taken into account). We achieve an F1-score of 0.66 over the sarcasm detection problem where the F1-score is calculated over the sarcastic class only. In both cases, the above reported results are evaluated over the ArSarcasm-v2--an extended dataset of the ArSarcasm (Farha and Magdy, 2020) that was introduced as part of the shared task. This reflects an improvement to the state-of-the-art results in both tasks.

idc system نظام IDC تصنيف المعنويات صناعة حمض الفوسفور

Linguistic performances of Arabic language teachers and its relation to their Attitudes toward Teaching

2259 - Tishreen University 2017 ورقة بحثية

The study aimed at investigating linguistic performances of the teachers of Arabic language and their relation to their attitudes towards teaching. The sample of the study consisted of 40 Arabic teachers from the public schools in the Northeastern Badia Directorate of Education. To achieve the purpose of study, analytical descriptive approach was used. The instruments of the study were a note card, and a measure of trends towards the teaching. The results of the study showed that the linguistic performances of Arabic teachers and their attitudes toward teaching were medium which indicates a strong correlation between their linguistic performances and their attitudes toward teaching.

الأداء اللغوي الاتجاهات نحو تدريس اللغة العربية Language Performance Attitudes towards Teaching Arabic Language

Arabic words Ira'b using expert system

2417 - Damascus University 2018 ورقة بحثية

An expert system was developed to consider words' grammar case in Arabic phrases without diacritics. First, the system gets words' morphology and tags using Microsoft tool (ATK), then it depends on Arabic grammar to get words' grammar case in nominal phrases. The system gave a very good results as they compared with Arabic language expert.

Nlp Expert System Arabic morphology Arabic grammar

comments

Fetching comments

Information Technology Institute ITI

Additional details More universities

Toward a HIAST Arabic Text to Speech System Using Semi Syllables Concatenation and Natural Prosody

نحو نظام لتركيب الكلام باللغة العربية من نصوص في المعهد العالي للعلوم التطبيقية و التكنولوجيا باستعمال الضم لأنصاف مقاطع صوتية و تنغيم طبيعي

Ask ChatGPT about the research

Read More