Advanced search powered by artificial intelligence

New community

Subscribe to the gold package and get unlimited access to Shamra Academy

A Study of The Effectiveness and Sound Quality in Audio Compression Algorithms

دراسة فعالية و جودة الصوت في خوارزميات ضغط الصوت

2686 2 90 0 ( 0 )

Download Cite

Added by Tishreen University ورقة بحثية

Publication date 2016

fields Communication Engineering

and research's language is العربية

Authors فاتن فاروسي( باحث )

Created by Shamra Editor

تحويل التجب المتقطع التعديل النبضي المرمز معدل أخذ العينات خوارزمية MPEG (Pulse code modulation (PCM Sample rate MPEG (Moving Pictures Experts Groups) Algorithm (Discrete Cosine Transform (DCT

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

The sound is an essential component of multimedia, and due to the needto be used in many life applications such as television broadcasting andcommunication programs, so it was necessary for the existence of audio signal processing techniquessuch as compressing, improving, and noisereduction. Data compression process aims to reduce the bit rate used, by doing encoding information using fewer bits than the original representation for transmitting and storing. By this process,the unnecessary information is determined and removed, that means it gives the compressed information for useable compression, which we need as a fundamental, not the minutest details. This research aims to study how to process sound and musical signal. It's a process that consists of a wide range of applications like coding and digital compression for the effective transport and storage on mobile phones and portable music players, modeling and reproduction of the sound of musical instruments and music halls and the harmonics of digital music, editing digital music, and classification of music content, and other things.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تتناول هذه الدراسة فعالية وجودة الصوت في خوارزميات ضغط الصوت المختلفة، حيث يعتبر الصوت جزءًا أساسيًا من الوسائط المتعددة ويستخدم في العديد من التطبيقات مثل البث التلفزيوني وبرامج التواصل. تهدف عملية ضغط البيانات إلى تقليل معدل البتات المستخدمة عن طريق ترميز المعلومات باستخدام عدد أقل من البتات من التمثيل الأصلي، مما يؤدي إلى إزالة المعلومات غير الضرورية. يهدف البحث إلى دراسة كيفية معالجة الصوت والإشارة الموسيقية من خلال تطبيقات متعددة مثل الترميز والضغط الرقمي والنقل الفعال والتخزين على الهواتف المحمولة ومشغلات الموسيقى المحمولة. كما يتناول البحث نمذجة واستنساخ صوت الآلات الموسيقية وقاعات الموسيقى وتوافقيات الموسيقى الرقمية وتحرير الموسيقى الرقمية وتصنيف محتوى الموسيقى. تتضمن الدراسة أيضًا تحليل وتحسين جودة الصوت باستخدام خوارزميات مختلفة مثل MPEG وتحويل التجب المتقطع (DCT). تم استخدام برنامج MATLAB في تحليل الإشارات الصوتية والموسيقية وتقديم أمثلة عملية على ذلك. توصلت الدراسة إلى أن خوارزمية MP3 هي الأكثر فعالية في ضغط الصوت مع الحفاظ على جودة عالية، بينما تعتبر خوارزمية WAV الأفضل للإنتاج الصوتي والتسجيل الخام.

Critical review

تُعد هذه الدراسة شاملة ومفصلة في تناولها لموضوع ضغط الصوت وتحليل جودة الصوت باستخدام خوارزميات مختلفة. ومع ذلك، يمكن أن تكون الدراسة أكثر فعالية إذا تم تضمين مقارنات عملية بين الخوارزميات المختلفة من خلال تجارب ميدانية واقعية بدلاً من الاعتماد فقط على التحليل النظري والبرمجي. كما أن الدراسة قد تفتقر إلى تحليل تأثير ضغط الصوت على تجربة المستخدم النهائية، وهو جانب مهم يجب مراعاته عند تقييم فعالية وجودة خوارزميات الضغط. بالإضافة إلى ذلك، يمكن أن تكون الدراسة أكثر شمولية إذا تم تضمين المزيد من الخوارزميات الحديثة والتقنيات المتقدمة في مجال ضغط الصوت.

Questions related to the research

ما هي الأهداف الرئيسية لهذه الدراسة؟

تهدف الدراسة إلى تحليل فعالية وجودة الصوت في خوارزميات ضغط الصوت المختلفة، ودراسة كيفية معالجة الصوت والإشارة الموسيقية من خلال تطبيقات متعددة مثل الترميز والضغط الرقمي والنقل الفعال والتخزين على الهواتف المحمولة ومشغلات الموسيقى المحمولة.
ما هي الخوارزمية التي تعتبر الأكثر فعالية في ضغط الصوت وفقًا للدراسة؟

تعتبر خوارزمية MP3 الأكثر فعالية في ضغط الصوت مع الحفاظ على جودة عالية وفقًا للدراسة.
ما هي الأدوات البرمجية التي تم استخدامها في تحليل الإشارات الصوتية والموسيقية في هذه الدراسة؟

تم استخدام برنامج MATLAB في تحليل الإشارات الصوتية والموسيقية وتقديم أمثلة عملية على ذلك.
ما هي بعض التطبيقات التي تتناولها الدراسة في مجال معالجة الصوت والإشارة الموسيقية؟

تتضمن التطبيقات التي تتناولها الدراسة الترميز والضغط الرقمي والنقل الفعال والتخزين على الهواتف المحمولة ومشغلات الموسيقى المحمولة، ونمذجة واستنساخ صوت الآلات الموسيقية وقاعات الموسيقى وتوافقيات الموسيقى الرقمية وتحرير الموسيقى الرقمية وتصنيف محتوى الموسيقى.

Keywords

ضغط الصوت جودة الصوت خوارزميات الضغط MPEG تحويل التجب المتقطع (DCT) نمذجة الصوت تحليل الإشارة الصوتية MATLAB

References used

MUSMANN, H, G. "Genesis of the MP3 Audio Coding Standard". Transactions on Consumer Electronics. IEEE, 2006

DEHERY, Y, F. "MUSICAM Source Coding". AES 10th International Conference. Kensington, London, England. 1991

GLOVER, I; GRANT, P. "Digital Communications". University of Bradford, University of Edinburgh. Prentice Hall

rate research

Sound Data Compression Method Using Genetic Algorithm

1927 - Damascus University 2005 ورقة بحثية

The principal objective of this research is an adoption of the Genetic Algorithm (GA) for studying it firstly, and to stop over the operations which are introduced from the genetic algorithm.The candidate field for applying the operations of the g enetic algorithm is the sound data compression field. This research uses the operations of the genetic algorithm for the enhancement of the performance of one of the popular compression method. Vector Quantization (VQ) method is selected in this work. After studying this method, new proposed algorithm for mixing the (GA) with this method was constructed and then the required programs for testing this algorithm was written. A good enhancement was recorded for the performance of the (VQ) method when mixed with the (GA). The proposed algorithm was tested by applying it on some sound data files. Some fidelity measures are calculated to evaluate the performance of the new proposed algorithm.

GENETIC ALGORITHM ضغط بيانات الصوت الخوارزمية الوراثية (الجينية) Sound Data Compression

On Generative Spoken Language Modeling from Raw Audio

825 - Association for Computation Linguistics 2021 مقالة

Abstract We introduce Generative Spoken Language Modeling, the task of learning the acoustic and linguistic characteristics of a language from raw audio (no text, no labels), and a set of metrics to automatically evaluate the learned representations at acoustic and linguistic levels for both encoding and generation. We set up baseline systems consisting of a discrete speech encoder (returning pseudo-text units), a generative language model (trained on pseudo- text), and a speech decoder (generating a waveform from pseudo-text) all trained without supervision and validate the proposed metrics with human evaluation. Across 3 speech encoders (CPC, wav2vec 2.0, HuBERT), we find that the number of discrete units (50, 100, or 200) matters in a task-dependent and encoder- dependent way, and that some combinations approach text-based systems.1

spoken language modeling generative spoken language raw audio نمذجة اللغة المنطوقة اللغة المنطوقة التوليدية الصوت الخام صناعة حمض الفوسفور المزيد..

Analysis study about (MFCC and Endpoint) algorithms and the extent of their impact in voice recognition rates

4507 - Tishreen University 2016 ورقة بحثية

Voice recognition includes two basic parts: speech and speaker recognition. These recognition processes consider as the most important processes of modern technologies, many systems has been developed that differ in the methods used to extract feat ures and classification ways to support recognition systems of this type. The study was conducted in this research on the previous subject, where the system is designed to recognize the speaker and his voice orders and focus on several complementary algorithms to carry out the research. we conducted an analytical study on MFCC algorithm used in the extraction of features, and it has been studying two parameters the number of filters in the filters bank and the number of features that taken from each frame and the impact of these two parameters in the recognition rate and the relationship of these two parameters on each other. It was the use of feed forwarding back propagation neural networks performance analysis as characteristics and we analyze the performance of the network to gain access to the best features and components to the process of achieving recognition. And it has been studying Endpoint algorithm that used to remove periods of silence and its impact on voice recognition rates.

Neural network المتكلم الكلام السمات الشبكات العصبية speaker speech feature المزيد..

Propagation of structural Waves ( First Sound ) in Quantum Fermi Plasmas

1572 - Tishreen University 2015 ورقة بحثية

In this study, an Interaction Function between quasi particles has been introduced into an energy formula in Kinetic Equation of Quantum Plasmas. Such new type can be used to study quasi Particles of Quantum Fermi Plasmas, since it contains quantum term that is correlated with Bohm Potential, when the mean inter-particle distance is of the same order as the de-Broglie thermal wavelength. An interaction function between quasi particles has been expressed using spherical Functions in three-dimensional space with Landau’s spread coefficients for ℓ = 0 , 1 , 2 . Using such representation led to obtaining the dispersion relation of the structural waves and its energy Spectrum in balanced local condition. The use of Landau parameters in this study isconsidered new comparing to other studies in this field. It allows us to get more generic and more precise dispersion relations with new previously unknown spectrum energy in Quantum Fermi Plasmas.

نظرية سائل فرمي أشباه الجسيمات تابع التأثير المتبادل بارامترات لانداو بلازما فرمي الكمية جهد بوم Fermi Liquid Theory quasi Particles Interaction Function Landau Parameters Quantum Fermi Plasmas Bohm Potential المزيد..

Using DWT to Include Digital Watermark in Audio

1700 - Aِl-Baath University 2016 ورقة بحثية

In this paper, we propose a new method to embed digital watermarking in audio files, using Discrete Wavelet Transform (DWT) and the way to extract the watermark data. The method efficiency is measured using Peak Signal –to-Noise Ratio (PSNR) , No rmalized Correlation Coefficient (NC). The advantage of our method is the robustness against several attacks and compression.

العلامة المائية الرقمية الصوت الرقمي تحويل الموجة المتقطع Digital Watermarking Digital audio DWT

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

A Study of The Effectiveness and Sound Quality in Audio Compression Algorithms

دراسة فعالية و جودة الصوت في خوارزميات ضغط الصوت

Ask ChatGPT about the research

Read More

suggested questions