Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

طريقة النص المستقل لتحديد هوية المتحدث باستخدام صوته

1326 1 11 0 ( 0 )

Download Cite

Added by Aِl-Baath University ورقة بحثية

Publication date 2016

fields Informatics Engineering

and research's language is العربية

Authors حسان محمد أحمد( باحث )

Created by Shamra Editor

Gaussian mixture model بصمة الصوت نبرة الصوت تحديد هوية الشخص التحقق من هوية الشخص سمات الصوت سبستروم الإشارة الصوتية النموذج الصوتي النموذج الخلفي العام نموذج خليط غاوس voice identification verification voice features voice signal cepstrum

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In this paper, the text-independent method of person voice identification based on the features extraction from speech signal that characterize the linear prediction of the behavior of the autocorrelation function of the voice signal cepstrum are considered and developed.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تتناول هذه الورقة البحثية طريقة النص المستقل لتحديد هوية المتحدث باستخدام صوته، حيث تعتمد على استخراج الميزات من الإشارة الصوتية التي تميز التنبؤ الخطي لسلوك دالة الترابط الذاتي لسبستروم الإشارة الصوتية. يتم بناء نموذج صوتي للشخص على أساس متجه الميزات باستخدام نموذج خليط غاوس (GMM) الأكثر معقولية. يتم تنفيذ عملية تحديد الهوية عن طريق اختيار النموذج الذي يمتلك أعلى احتمال لاحق لاستعادته بواسطة الإشارة الصوتية المدخلة. أظهرت الطريقة المدروسة دقة عالية وكافية لتحديد هوية المتحدث باستخدام الصوت بشكل مستقل عن النص، مقارنة بالنتائج العالمية في هذا المجال. تعتمد الطريقة على متطلبات منخفضة لجودة الإشارة الصوتية وتبعية معتدلة لشروط تسجيل الإشارة الصوتية. تم اختبار الطريقة باستخدام بيانات NIST SRE للأعوام 2004، 2006، 2008، وأظهرت نتائج إيجابية في دقة تحديد الهوية.

Critical review

دراسة نقدية: تعتبر هذه الورقة البحثية خطوة مهمة في مجال تحديد هوية المتحدث باستخدام الصوت، إلا أن هناك بعض النقاط التي يمكن تحسينها. أولاً، على الرغم من أن الطريقة تعتمد على متطلبات منخفضة لجودة الإشارة الصوتية، إلا أن هناك حاجة لمزيد من الاختبارات في بيئات مختلفة وظروف تسجيل متنوعة للتأكد من فعالية الطريقة في جميع الحالات. ثانياً، الورقة تركز بشكل كبير على الجانب التقني دون التطرق بشكل كافٍ إلى التطبيقات العملية والتحديات التي قد تواجهها في الاستخدام الفعلي. ثالثاً، يمكن تحسين الورقة بإضافة مقارنة مفصلة مع تقنيات أخرى مشابهة لتوضيح الفروق والميزات بشكل أوضح. وأخيراً، قد يكون من المفيد تقديم تحليل أعمق للأخطاء التي تحدث أثناء عملية تحديد الهوية وكيفية تقليلها.

Questions related to the research

ما هي الطريقة المستخدمة لتحديد هوية المتحدث في هذه الورقة؟

الطريقة المستخدمة هي طريقة النص المستقل لتحديد هوية المتحدث باستخدام صوته، وتعتمد على استخراج الميزات من الإشارة الصوتية وبناء نموذج صوتي باستخدام نموذج خليط غاوس (GMM).
ما هي الميزات التي تعتمد عليها الطريقة المقترحة في تحديد هوية المتحدث؟

تعتمد الطريقة على الميزات المستخرجة من التنبؤ الخطي لسلوك دالة الترابط الذاتي لسبستروم الإشارة الصوتية.
ما هي البيانات المستخدمة لاختبار الطريقة المقترحة؟

تم استخدام بيانات NIST SRE للأعوام 2004، 2006، 2008 لاختبار الطريقة المقترحة.
ما هي النتائج التي توصلت إليها الدراسة بشأن دقة الطريقة المقترحة؟

أظهرت الدراسة أن الطريقة المقترحة تتمتع بدقة عالية وكافية لتحديد هوية المتحدث باستخدام الصوت بشكل مستقل عن النص، مقارنة بالنتائج العالمية في هذا المجال.

Keywords

تحديد هوية المتحدث النص المستقل سبستروم الإشارة الصوتية نموذج خليط غاوس التنبؤ الخطي دالة الترابط الذاتي

References used

REYNOLDS, D, 1994 Experimental evaluation of features for robust speaker identification. IEEE Trans. On Speech and Audio Processing. Vol. 2. No. 4, 639–643

BIMBOT, F, A, 2004 tutorial on text-independent speaker verification. EURASIP J. on Applied Signal Processing. No. 4, 430–451

REYNOLDS, D; ROSE, R, 1995 Robust text-independent speaker identification using Gaussian mixture speaker models. IEEE Trans. On Speech and Audio Processing. No. 3, 72–83

rate research

1461 - Damascus University 2012 ورقة بحثية

The analysis of time series data is one of the most important statistical topics, usually focuses on forecasting the future behavior of the series at a certain time for certain purposes.

ACF PACF Autoregressive model AR Moving Average model Autoregressive-Moving Average model

Evaluation of Vertical Handover Performance between WiFi and WiMax Networks using Media Independent Handover

1749 - Tishreen University 2016 ورقة بحثية

The current researches are moving towards more development in order to provide the growing the needs of users such as support real-time applications, quality of service, particularly; the high data rate transfer and other. That prompts the network service providers to integrate many properties for different networks resource, and support providing the service "anywhere and anytime". Hence, the importance of this research, which aims to study the vertical handover as very important and necessary step to provide the mobility of mobile nodes between the different networks by using Media Independent Handover (MIH) IEEE802.21standard which is developed in January 2009. In this paper, the performance of vertical handover between these two networks is evaluated taking into account many parameters such as packet loss, handover latency, and throughput, using NS2 simulator (Network Simulator version2) which includes a support for MIH technology by the National Institute of Standard and Technology (NIST).

التسليم المستقل عن الوسط التسليم الشاقولي الشبكات اللاسلكية المحلية الشبكات عريضة الحزمة EEE802.21Standard MIH (Vertical Handover (VHO WiFi WiMax المزيد..

تحديد هوية بكتيريا Lactococcus Lactis في جبن الغنم السوري باستخدام تقنيتي الـ FTIR و PCR

1027 - Damascus University 2015 رسالة ماجستير

أجريت الدراسة في مخابر كلية الزراعة قسم علوم الاغذية ومخابر الميكروبيولوجيا والمناعيات بقسم البيولوجيا الجزيئية والتقانة الحيوية بهيئة الطاقة الذرية.

علوم الاغذية جبن الغنم السوري Lactococcus Lactis

BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text

760 - Association for Computation Linguistics 2021 مقالة

With the growing popularity of smart speakers, such as Amazon Alexa, speech is becoming one of the most important modes of human-computer interaction. Automatic speech recognition (ASR) is arguably the most critical component of such systems, as erro rs in speech recognition propagate to the downstream components and drastically degrade the user experience. A simple and effective way to improve the speech recognition accuracy is to apply automatic post-processor to the recognition result. However, training a post-processor requires parallel corpora created by human annotators, which are expensive and not scalable. To alleviate this problem, we propose Back TranScription (BTS), a denoising-based method that can create such corpora without human labor. Using a raw corpus, BTS corrupts the text using Text-to-Speech (TTS) and Speech-to-Text (STT) systems. Then, a post-processing model can be trained to reconstruct the original text given the corrupted input. Quantitative and qualitative evaluations show that a post-processor trained using our approach is highly effective in fixing non-trivial speech recognition errors such as mishandling foreign words. We present the generated parallel corpus and post-processing platform to make our results publicly available.

النماذج المدربة مسبقا amazon alexa back transcription الأمازون اليكسا النسخ الخلفي صناعة حمض الفوسفور

Devil's Advocate: Novel Boosting Ensemble Method from Psychological Findings for Text Classification

429 - Association for Computation Linguistics 2021 مقالة

We present a new form of ensemble method--Devil's Advocate, which uses a deliberately dissenting model to force other submodels within the ensemble to better collaborate. Our method consists of two different training settings: one follows the convent ional training process (Norm), and the other is trained by artificially generated labels (DevAdv). After training the models, Norm models are fine-tuned through an additional loss function, which uses the DevAdv model as a constraint. In making a final decision, the proposed ensemble model sums the scores of Norm models and then subtracts the score of the DevAdv model. The DevAdv model improves the overall performance of the other models within the ensemble. In addition to our ensemble framework being based on psychological background, it also shows comparable or improved performance on 5 text classification tasks when compared to conventional ensemble methods.

devil advocate boosting ensemble method psychological findings دافع الشيطان طريقة تعزيز الفرقة النتائج النفسية صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

طريقة النص المستقل لتحديد هوية المتحدث باستخدام صوته

Ask ChatGPT about the research

In this paper, the text-independent method of person voice identification based on the features extraction from speech signal that characterize the linear prediction of the behavior of the autocorrelation function of the voice signal cepstrum are considered and developed.

Read More

suggested questions