Research papers, master and doctoral theses about خطاب آلي متعدد اللغات

A ResNet-50-Based Convolutional Neural Network Model for Language ID Identification from Speech Recordings

190 - Association for Computation Linguistics 2021 مقالة

This paper describes the model built for the SIGTYP 2021 Shared Task aimed at identifying 18 typologically different languages from speech recordings. Mel-frequency cepstral coefficients derived from audio files are transformed into spectrograms, whi ch are then fed into a ResNet-50-based CNN architecture. The final model achieved validation and test accuracies of 0.73 and 0.53, respectively.

خطاب آلي متعدد اللغات neural network model convolutional neural التنافيل الشبكة العصبية نموذج الشبكة العصبية التنافيل العصبي صناعة حمض الفوسفور المزيد..

Language ID Prediction from Speech Using Self-Attentive Pooling

382 - Association for Computation Linguistics 2021 مقالة

This memo describes NTR-TSU submission for SIGTYP 2021 Shared Task on predicting language IDs from speech. Spoken Language Identification (LID) is an important step in a multilingual Automated Speech Recognition (ASR) system pipeline. For many low-re source and endangered languages, only single-speaker recordings may be available, demanding a need for domain and speaker-invariant language ID systems. In this memo, we show that a convolutional neural network with a Self-Attentive Pooling layer shows promising results for the language identification task.

automated speech recognition prediction from speech multilingual automated speech اعتراف الكلام الآلي التنبؤ من الكلام خطاب آلي متعدد اللغات صناعة حمض الفوسفور المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد