New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Analysis study about (MFCC and Endpoint) algorithms and the extent of their impact in voice recognition rates

دراسة تحليلية لخوارزميتي ( MFCC و ENDPOINT) و مدى تأثيرهما في نسب التعرف على الصوت

3811 7 187 0 ( 0 )

Download Cite

Added by Tishreen University ورقة بحثية

Publication date 2016

and research's language is العربية

Authors دعد يوسف الكعدي( باحث )

Created by Shamra Editor

Neural network المتكلم الكلام السمات الشبكات العصبية speaker speech feature

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

Voice recognition includes two basic parts: speech and speaker recognition. These recognition processes consider as the most important processes of modern technologies, many systems has been developed that differ in the methods used to extract features and classification ways to support recognition systems of this type. The study was conducted in this research on the previous subject, where the system is designed to recognize the speaker and his voice orders and focus on several complementary algorithms to carry out the research. we conducted an analytical study on MFCC algorithm used in the extraction of features, and it has been studying two parameters the number of filters in the filters bank and the number of features that taken from each frame and the impact of these two parameters in the recognition rate and the relationship of these two parameters on each other. It was the use of feed forwarding back propagation neural networks performance analysis as characteristics and we analyze the performance of the network to gain access to the best features and components to the process of achieving recognition. And it has been studying Endpoint algorithm that used to remove periods of silence and its impact on voice recognition rates.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تتناول هذه الدراسة تحليل خوارزميتي MFCC وEndpoint ومدى تأثيرهما على نسب التعرف على الصوت. تتضمن عملية التعرف على الصوت قسمين رئيسيين: التعرف على الكلام والتعرف على المتكلم. تم تصميم نظام للتعرف على المتكلم وأوامره الصوتية باستخدام عدة خوارزميات مكملة. تم إجراء دراسة تحليلية لخوارزمية MFCC التي تستخدم لاستخراج السمات الصوتية، مع التركيز على تأثير عدد المرشحات في بنك المرشحات وعدد السمات المأخوذة من كل إطار على نسب التعرف. كما تم استخدام الشبكات العصبية ذات التغذية الأمامية والانتشار الخلفي للخطأ (FFBPNN) كمصنف، وتم تحليل أداء الشبكة للوصول إلى أفضل خصائص ومكونات لتحقيق عملية التعرف. بالإضافة إلى ذلك، تمت دراسة خوارزمية Endpoint المستخدمة لإزالة فترات الصمت وتأثيرها على نسب التعرف على الصوت. توصلت الدراسة إلى أن زيادة عدد المرشحات في بنك المرشحات وعدد السمات المأخوذة من كل إطار يؤدي إلى تحسين نسب التعرف حتى حد معين، وبعد ذلك تثبت النسب. تم الحصول على أعلى نسبة تعرف قدرها 90.74% عند التعرف على الأوامر الصوتية و87.50% عند التعرف على المتكلم. توصي الدراسة باستخدام بنك مرشحات مكون من عدد مرشحات بين 24-35 واختيار عدد سمات أصغر من عدد المرشحات بقليل لتحقيق أفضل نتائج في التعرف على الصوت.

Critical review

دراسة نقدية: تعتبر هذه الدراسة خطوة مهمة في مجال التعرف على الصوت، حيث تقدم تحليلاً دقيقاً لخوارزميتي MFCC وEndpoint وتأثيرهما على نسب التعرف. ومع ذلك، يمكن تحسين الدراسة من خلال توسيع قاعدة البيانات المستخدمة لتشمل مجموعة أكبر من المتكلمين والأوامر الصوتية، مما يزيد من دقة النتائج وموثوقيتها. كما يمكن دراسة تأثير خوارزميات أخرى لإزالة فترات الصمت ومقارنتها بخوارزمية Endpoint المستخدمة في هذه الدراسة. بالإضافة إلى ذلك، يمكن تحسين الدراسة من خلال تحليل تأثير الضوضاء البيئية على نسب التعرف وتقديم حلول للتعامل معها. بشكل عام، تعتبر الدراسة قيمة وتقدم نتائج مفيدة، ولكن يمكن تحسينها من خلال توسيع نطاق البحث وتحليل المزيد من العوامل المؤثرة على نسب التعرف.

Questions related to the research

ما هي الخوارزميات التي تم تحليلها في الدراسة؟

تم تحليل خوارزميتي MFCC وEndpoint في الدراسة.
ما هي أعلى نسبة تعرف تم تحقيقها في الدراسة؟

تم تحقيق أعلى نسبة تعرف قدرها 90.74% عند التعرف على الأوامر الصوتية و87.50% عند التعرف على المتكلم.
ما هو تأثير زيادة عدد المرشحات في بنك المرشحات على نسب التعرف؟

زيادة عدد المرشحات في بنك المرشحات تؤدي إلى تحسين نسب التعرف حتى حد معين، وبعد ذلك تثبت النسب.
ما هي التوصيات التي قدمتها الدراسة لتحسين نسب التعرف؟

توصي الدراسة باستخدام بنك مرشحات مكون من عدد مرشحات بين 24-35 واختيار عدد سمات أصغر من عدد المرشحات بقليل لتحقيق أفضل نتائج في التعرف على الصوت.

Keywords

التعرف على الصوت التعرف على الكلام التعرف على المتكلم خوارزمية MFCC خوارزمية Endpoint الشبكات العصبية

References used

CARROLL, T.;COLANGELO, R.;STROTT, T."Bird Call Identifier –Identifying Songs of Bird Species through Digital Signal Processing Techniques". 2010,118

Xue, X."Joint Speech and Speaker Recognition Using Neural Networks".NOVIA-University of applied science. 2013,60

(CHOUDHARY, A.;KSHIRSAGAR,R."Process Speech Recognition System using Artificial Intelligence Technique".(IJSCE) ,ISSN: 2231-2307, Volume-2, Issue-5, 2012,PP(239-242

rate research

A Study of The Effectiveness and Sound Quality in Audio Compression Algorithms

1924 - Tishreen University 2016 ورقة بحثية

The sound is an essential component of multimedia, and due to the needto be used in many life applications such as television broadcasting andcommunication programs, so it was necessary for the existence of audio signal processing techniquessuch as compressing, improving, and noisereduction. Data compression process aims to reduce the bit rate used, by doing encoding information using fewer bits than the original representation for transmitting and storing. By this process,the unnecessary information is determined and removed, that means it gives the compressed information for useable compression, which we need as a fundamental, not the minutest details. This research aims to study how to process sound and musical signal. It's a process that consists of a wide range of applications like coding and digital compression for the effective transport and storage on mobile phones and portable music players, modeling and reproduction of the sound of musical instruments and music halls and the harmonics of digital music, editing digital music, and classification of music content, and other things.

تحويل التجب المتقطع التعديل النبضي المرمز معدل أخذ العينات خوارزمية MPEG (Pulse code modulation (PCM Sample rate MPEG (Moving Pictures Experts Groups) Algorithm (Discrete Cosine Transform (DCT المزيد..

A comparative study of compression algorithms and their impact on data communication in networks

1119 - Tishreen University 2018 ورقة بحثية

Due to the large increase in the use of data communication and information exchange services of different types in different environments, the standard and the programming had to be a language of characterization is ideal for scalability and develo pment that serve the growing needs in the best form and in the shortest possible time and was the most widely used language and the most widely used XML language. he adoption of graphics architecture sometimes created a problem affecting the performance of information transmission networks due to the large volume of data exchanged as well as the need for large storage capacity at both ends of the transmission and reception. Effective ways of reducing the amount of data exchanged through the network had to be found. There have been many scientific researches and practical experiments on finding effective ways to reduce the actual size of the data and by adopting different parameters that affect the process of compressing the files so as to achieve better results by reducing the volumes of files exchanged with attention to times of compression and decompression of files. In this research, we focused on the study and comparison of some compression algorithms for files and their effect on data communication in networks.

XML Extensible Markup Language Compression Ratio Factor CRF SOAP protocol Compression Time CT Compression Ratio CR Gun Zip

Study the impact of changing the number of users on the performance of the protocols of voice over the Internet H.323 and SIP

1813 - Aِl-Baath University 2017 ورقة بحثية

In this paper, we assess the Voice Over Internet Protocol performance by comparing the performance of two protocols used in VOIP such as SIP and H.323. Moreover, we evaluate the quality indicators such as delay and packets loss. For this purpose OPNET simulator is used as suitable simulation technology.

Delay بروتوكول نقل الصوت VOIP بروتوكول تهيئة الجلسة SIP بروتوكول H323 تأخير زمني ضياع الرزم المكالمات الفعالة Voice Over Internet VOIP SIP H.323 packets loss active calls المزيد..

An analytical study of high-tech building materials and their impact on the sustainability of the building

3041 - Aِl-Baath University 2017 ورقة بحثية

The concept of sustainability in architecture from the perspective of architectural thought focuses on creating a successful relationship between the building and the user and the environment through sustainable design principles and the preserva tion of these principles, whether physical Ooualemanoa to maintain and build on it, the advanced materials research a high priority, where he will witness the construction further sector of evolution, and issues related to smart and sustainable buildings become more important. That was supported us to analys high-tech building materials to know their impact on the sustainability of buildings

الاستدامة Buildings sustainable مباني مباني ذكية smart buildings

The extent of commitment to the International Accounting Standards by accountants in Syria and Lebanon at recognition and measurement of tangible fixed assets (Survey of sample study)

1713 - Damascus University 2014 ورقة بحثية

The International Accounting Standards have gained a wide international approval where they attempted to unify accounting practices on an international level to help investors and others in the process of decision- making on a unified basis. Numero us studies in Arab countries have proved the importance of adopting and implementing these standards. Therefore, this research questions the extent of implementing the International Accounting Standards in two Arab countries: Syria and Lebanon, as regards to recognition and measurement of tangible fixed assets. This was done using a questionnaire distributed to two samples of accountants in both countries. The result that was reached verifies that accountants in both countries do not fully implement the International Accounting Standard Number 16 (property, plant and equipment). However, their accounting practice does largely approach this standard but in different sections. This makes any comparison between the opportunities available for investors in both countries lacking as it cannot be based upon any unified grounds. The research also examined the most important points which are not implemented by accountants in both countries, with regard to this standard.

international accounting standards Syria سورية لبنان معايير المحاسبة الدولية الموجودات الثابتة المادية Lebanon tangible fixed assets المزيد..

comments

Fetching comments

Cordoba Private University

Additional details More universities

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Analysis study about (MFCC and Endpoint) algorithms and the extent of their impact in voice recognition rates

دراسة تحليلية لخوارزميتي ( MFCC و ENDPOINT) و مدى تأثيرهما في نسب التعرف على الصوت

Ask ChatGPT about the research

Read More