Do you want to publish a course? Click here

Generation and Analysis of Vowel Polygons for Syrian Dialects Using a Created Speech Database

توليد و تحليل المضلعات الصوتية للهجات سورية اعتماداً على قاعدة بيانات صوتية مُنجَزة

1623   0   16   0 ( 0 )
 Publication date 2015
and research's language is العربية
 Created by Shamra Editor




Ask ChatGPT about the research

Speech databases form the main foundation in the construction of automatic utterance, speaker recognition and speech recognition systems in different languages and dialects. The elements of the speech database are audio files recorded for people's voices in the required language or dialect. The more the speech database is enriched with comprehensive elements the more it contributes to produce systems that communicate with the excellent performed machine. According to the lack of speech databases for the Syrian dialects, the research did one. The created database contained sixteen voluntaries from different Syrian dialects. Voluntaries' voices were recorded in different recording conditions that is for studying the effect of variety of dialects, gender and the conditions of recording on the vowel polygons. This research invested the created speech database in the field of generating and analyzing of vowel polygons, as the vowel polygon is a geometric polygon where its vertices represent the values of formant frequencies, and the area of the polygon represents the output acoustic space.


Artificial intelligence review:
Research summary
تتناول هذه الدراسة إنشاء قاعدة بيانات صوتية للهجات السورية وتحليل المضلعات الصوتية الناتجة عنها. تم تسجيل أصوات 16 متحدثًا من مختلف المناطق السورية بلهجات متنوعة وظروف تسجيل مختلفة. يهدف البحث إلى دراسة تأثير تنوع اللهجات والجنس وظروف التسجيل على مساحات المضلعات الصوتية. تم استخدام خوارزمية MFCC لاستخلاص ترددات النغمات الصوتية وتحليلها. النتائج أظهرت تباينًا في مساحات المضلعات الصوتية بين التسجيل الاحترافي والتسجيل العادي، وكذلك بين الذكور والإناث. توصي الدراسة بتوسيع قاعدة البيانات لتشمل فئات عمرية مختلفة ودراسة تأثير العمر على المجال الصوتي.
Critical review
دراسة نقدية: تُعتبر هذه الدراسة خطوة مهمة نحو فهم الخصائص الصوتية للهجات السورية، إلا أنها تفتقر إلى شمولية أكبر من حيث عدد المتحدثين وتنوع الأعمار. كما أن الاعتماد على تسجيلات في ظروف مختلفة قد يؤثر على دقة النتائج. يُفضل أن يتم استخدام تقنيات تسجيل موحدة لضمان تجانس البيانات. بالإضافة إلى ذلك، يمكن أن تكون الدراسة أكثر فائدة إذا تضمنت تحليلًا أعمق لتأثير العوامل الاجتماعية والثقافية على اللهجات.
Questions related to the research
  1. ما الهدف الرئيسي من إنشاء قاعدة البيانات الصوتية للهجات السورية؟

    الهدف الرئيسي هو دراسة تأثير تنوع اللهجات والجنس وظروف التسجيل على مساحات المضلعات الصوتية وتطوير نظم حاسوبية للتعرف على الكلام والنطق الآلي للهجات السورية.

  2. ما هي خوارزمية MFCC المستخدمة في الدراسة؟

    خوارزمية MFCC (Mel Frequency Cepstral Coefficients) هي خوارزمية تُستخدم لاستخلاص السمات الصوتية من الإشارات الصوتية، وهي تُستخدم بشكل واسع في تحليل ومعالجة الصوتيات.

  3. ما هي النتائج الرئيسية التي توصلت إليها الدراسة؟

    النتائج أظهرت تباينًا في مساحات المضلعات الصوتية بين التسجيل الاحترافي والتسجيل العادي، وكذلك بين الذكور والإناث، حيث كانت مساحات المضلعات الصوتية الناتجة عن التسجيل العادي أكبر من تلك الناتجة عن التسجيل الاحترافي.

  4. ما هي التوصيات التي قدمتها الدراسة لتحسين قاعدة البيانات الصوتية؟

    توصي الدراسة بتوسيع قاعدة البيانات الصوتية لتشمل تسجيلات لأشخاص من فئات عمرية مختلفة ودراسة تأثير العمر على المجال الصوتي، وكذلك بناء قاعدة بيانات للأصوات الهاتفية للمتحدثين السوريين.


References used
STANEK, M., SIGMUND, M. Speaker Dependent Changes in Formants Based on Normalization of Vowel Triangle. In Proc. 23rd International Conference RADIOELEKTRONIKA. Pardubice. Czech Republic, 2013, pp. 337-341
ALGHAMDI, M. Analysis, Synthesis and Perception of Voicing in Arabic. Al- ToubahBookshop, Riyadh. 2004, P. 50
KENSTOWICZ, M. Parametric variation and accent in the Arabic dialects, 1983, CLS19: 205-213
rate research

Read More

In this research, a new comparison criterion was proposed to study properties of the audio signal for each of the varieties of smokers and non-smoking persons. For this purpose, a database for smokers has been created. The smoker database contains 12 Syrian native speakers, six of them were smokers and the others were non-smokers. The smokers had been smoking for more than 10 years. All speakers were men and their ages ranging between 35 and 42 years old. They live in rural towns and speak the same dialect. Syrian vowels can be classified into long vowels and short ones. The long vowels are /AA/, /UU/, /II/ pronounced as ([ ي, و, ا ]) and the short vowels are /A/, /U/, /I/ pronounced as ([ كسرة, ضمة, فتحة ]). In this study, the Speakers have to pronounce the following sentence /I love Syria/ pronounced as ([ أَنَاْ أَحَبُّ سُوْرِيْة ]), and it was spoken during three hours. This sentence is rich with vowels. For each speaker, a long vowel triangle in ten planes and a short vowel triangle in ten planes as well were generated and analyzed. A new criterion was suggested to determine the most suitable vowel triangle for smoker distinction. This criterion depends on calculating the different distances among all centers of vowel triangles in each plane and determining the minimal distance called d. For each plane, the most suitable vowel triangle had been set as AIU35 short vowel triangle and AAIIUU45 long vowel triangle.
The importance of research lies in the need to keep pace with the technological development of computer systems and technologies Modern methods, especially geographic information systems, in collecting, storing, analyzing and exiting Spatial inform ation and linking it to metadata, modeling and scenarios Planners and decision-makers to assist them in planning and finding appropriate solutions for various problems.
This paper presents a method integrating database with Jgroup based on Hibernate, which is one of Object Relational Mapping tools. We compare between the performance of Jgroup integrated with Hibernate and the performance of RMI integrated with Hibernate. The results show that Jgroup/Hibernate outperforms RMI/Hibernate when the number of clients increases.
The research aims to study how to add new components to Multisim database. Or how to model a component using the programing language C++ , to use this new component later in designing and making electronic circuits and devices. Multisim has built- in models for most types of devices, , the study aims to lay the foundations and method for modeling of electronic items which is not located within the Multisim program database, (or present with different values) , and that we need while using this program in the modeling and simulation process for a given circuit. Code modeling method has been proposed to reach this goal; this method relies on the behavior of the device or the modeled component. The study shows how to create a Code model for a specific capacitor that has different values to those existing within the database and add to it.
Most of the contracting companies suffer from poor coordination between the projects that they implement at the same time, and in Shauria, the situation is getting worse due to the absence of the application of modern methodologies in project managem ent, which causes a great waste of time and effort, especially for organizing and retrieving dozens of documents related to planning and follow-up, and consequently an increase in time, cost and poor quality.
comments
Fetching comments Fetching comments
Sign in to be able to follow your search criteria
mircosoft-partner

هل ترغب بارسال اشعارات عن اخر التحديثات في شمرا-اكاديميا