New community

Subscribe to the gold package and get unlimited access to Shamra Academy

Big Data with Machine Learning

علم البيانات الضخمة مع تقنيات التعلم الآلي

2861 6 226 0 ( 0 )

Download Cite

Added by Damascus University حلقة بحث

Publication date 2018

and research's language is العربية

Authors م. محمد فادي تقي الدين( مشرف ) - رنيم يد( طالب ) - رفيف طه( طالب ) - عبدالله أبو عمشة( طالب ) - أسما نحاس( طالب )

Created by asma nahas

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

In recent years, time-critical processing or real-time processing and analytics of bid data have received a significant amount of attentions. There are many areas/domains where real-time processing of data and making timely decision can save thousands of human lives, minimizing the risks of human lives and resources, enhance the quality of human lives, enhance the chance of profitability, efficient resources management etc. This paper has presented such type of real-time big data analytic applications and a classification of those applications. In addition, it presents the time requirements of each type of these applications along with its significant benefits. Also, a general overview of big data to describe a background knowledge on this scope.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تتناول هذه الورقة البحثية موضوع البيانات الكبيرة وتعلم الآلة، حيث تركز على كيفية معالجة وتحليل البيانات الكبيرة في الوقت الحقيقي. تبدأ الورقة بمقدمة توضح النمو السريع في حجم البيانات على مستوى العالم وأهمية تحليل هذه البيانات لاتخاذ قرارات فعالة. تستعرض الورقة خصائص البيانات الكبيرة مثل الحجم، السرعة، التنوع، الإيذائية، والقيمة. كما تناقش التحديات المرتبطة بمعالجة البيانات الكبيرة مثل جودة البيانات، الاكتشاف، التخزين، التحليلات، الأمن، ونقص الموهبة. تقدم الورقة نظرة عامة على تقنيات معالجة البيانات الكبيرة مثل Hadoop، وتشرح كيفية استخدام HDFS وYARN وMapReduce في معالجة البيانات. كما تستعرض الورقة خطوات التعلم الآلي وكيفية تطبيقها على البيانات الكبيرة، وتوضح العلاقة بين البيانات الكبيرة وتعلم الآلة. تختتم الورقة بتطبيقات البيانات الكبيرة في مجالات مختلفة مثل الرعاية الصحية، التصنيع، وسائل الإعلام والترفيه، إنترنت الأشياء، والحكومة، وتوضح كيف يمكن لهذه التطبيقات تحسين الكفاءة والإنتاجية وتقليل التكاليف.

Critical review

دراسة نقدية: تعتبر هذه الورقة البحثية شاملة ومفصلة في تناولها لموضوع البيانات الكبيرة وتعلم الآلة، حيث تقدم معلومات قيمة حول خصائص البيانات الكبيرة والتحديات المرتبطة بها. ومع ذلك، يمكن تحسين الورقة من خلال تقديم أمثلة عملية أكثر توضيحاً لكيفية تطبيق التقنيات المذكورة في سياقات حقيقية. كما أن الورقة تفتقر إلى مناقشة بعض الجوانب الأخلاقية المتعلقة باستخدام البيانات الكبيرة، مثل الخصوصية والأمان. بالإضافة إلى ذلك، يمكن تعزيز الورقة بإضافة دراسات حالة إضافية توضح الفوائد الفعلية لتطبيقات البيانات الكبيرة في مختلف المجالات.

Questions related to the research

ما هي الخصائص الخمس التي تحدد البيانات الكبيرة؟

الخصائص الخمس هي الحجم، السرعة، التنوع، الإيذائية، والقيمة.
ما هي التحديات الرئيسية في معالجة البيانات الكبيرة؟

التحديات الرئيسية تشمل جودة البيانات، الاكتشاف، التخزين، التحليلات، الأمن، ونقص الموهبة.
كيف يساعد Hadoop في معالجة البيانات الكبيرة؟

Hadoop يساعد من خلال نظام الملفات الموزعة HDFS لتخزين البيانات الكبيرة ومعالجتها بالتوازي باستخدام YARN وMapReduce.
ما هي بعض التطبيقات العملية للبيانات الكبيرة في مختلف المجالات؟

تشمل التطبيقات العملية في مجالات الرعاية الصحية، التصنيع، وسائل الإعلام والترفيه، إنترنت الأشياء، والحكومة.

Keywords

البيانات الكبيرة تعلم الآلة Hadoop تحليل البيانات التعلم الآلي التطبيقات العملية التحديات الخصائص

References used

https://www.edureka.co/blog/big-data-tutorial

https://www.coursera.org/learn/big-data-machine-learning

https://www.edureka.co/blog/big-data-applications-revolutionizing-various-domains/

https://docs.microsoft.com/en-us/sql/analysis-services/data-mining/discretizationmethods-data-mining?view=sql-analysis-services-2017

Salvador Garcı´a, Julia´n Luengo, Jose´ Antonio Sa´ ez, Victoria Lo´ pez, and Francisco Herrera A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning

Sergio Ramírez-Gallego, Salvador García Data discretization: taxonomy and big data challenge

Usama Fayyad,Keki Irani Multi interval discretization of continuous attributes for classification learning

https://www.edureka.co/blog/hadoop-tutorial/

rate research

Improving Decision Support Systems in Education Systems Using Data Mining and Machine Learning Techniques

1393 - Tishreen University 2021 ورقة بحثية

Educational data mining aims to study the available data in the educational field and extract the hidden knowledge from it in order to benefit from this knowledge in enhancing the education process and making successful decisions that will improve th e student’s academic performance. This study proposes the use of data mining techniques to improve student performance prediction. Three classification algorithms (Naïve Bayes,J48, Support Vector Machine) were applied to the student performance database, and then a new classifier was designed to combine the results of those individual classifiers using Voting Method. The WEKA tool was used, which supports a lot of data mining algorithms and methods. The results show that the ensemble classifier has the highest accuracy for predicting students' levels compared to other classifiers, as it has achieved a recognition accuracy of 74.8084%. The simple k-means clustering algorithm was useful in grouping similar students into separate groups, thus understanding the characteristics of each group, which helps to lead and direct each group separately.

العنقدة التنقيب عن البيانات خوارزميات العنقدة Ensemble Voting Method WEKA ويكا المزيد..

Data model Vs Simulation model in big data

1167 - Damascus University 2019 مقالة

حظي مؤخرا اختصاص البيانات الضخمة باهتمام كبير في مجالات متنوعة منها (الطب , العلوم , الادارة, السياسة , ......) و يهتم هذا الاختصاص بدراسة مجموعة البيانات الضخمة والتي تعجز الادوات والطرق الشائعة على معالجتها و ادارتها و تنظيمها خلال فترة زمنية مقبو لة و بناء نموذج للتعامل مع هذه المعطيات والتنبؤ باغراض مطلوبة منها. ولاجراء هذه الدراسات ظهرت طرق عدة منها النماذج التي تعتمد على مجموعة من البيانات و نماذج تعتمد على المحاكاة و في هذه المقالة تم توضيح الفرق بين النموذجين و تطبيق نهج جديد يعتمد على التكامل بين النموذجين لاعطاء نموذح افضل لمعالجة مسالة البيوت البلاستيكة

To Block or not to Block: Experiments with Machine Learning for News Comment Moderation

347 - Association for Computation Linguistics 2021 مقالة

Today, news media organizations regularly engage with readers by enabling them to comment on news articles. This creates the need for comment moderation and removal of disallowed comments -- a time-consuming task often performed by human moderators. In this paper we approach the problem of automatic news comment moderation as classification of comments into blocked and not blocked categories. We construct a novel dataset of annotated English comments, experiment with cross-lingual transfer of comment labels and evaluate several machine learning models on datasets of Croatian and Estonian news comments. Team name: SuperAdmin; Challenge: Detection of blocked comments; Tools/models: CroSloEn BERT, FinEst BERT, 24Sata comment dataset, Ekspress comment dataset.

أخبار سلوفينية كوربوس comment moderation comment حاجز تعليق الاعتدال تعليق صناعة حمض الفوسفور المزيد..

Develop a Software Library to Test Big Data Applications in a Simulated Hadoop Environment

1939 - Aِl-Baath University 2017 ورقة بحثية

Through our study, the HadoopOperationTesting software library has been developed to provide Big Data applications labs with a mechanism to test their applications in a simulated environment for the Hadoop environment with a similar mechanism to test traditional applications using the JUnit library.

اختبار Tools أدوات تطوير البيانات الضخمة مكتبة برمجية للاختبار Big Data Testing API for Testing Developing المزيد..

Artificial intelligence techniques for the development of statistical Automatic learning

1514 - جامعة البصرى 2020 مقالة

The word "massive data" spread in 2017 and became the most common in the industry of advanced technology, it uses automated learning that allows computers to analyze past data and predict future data widely in familiar places. Non-automated learning professionals can use it too. To study the analytical method of statistical Automatic learning, it is necessary to identify the concept of artificial intelligence and its main classification and analytical techniques included and represent in automatic learning and deep learning. Automatic learning has developed thanks to some breakthroughs in artificial intelligence. It is an awareness of the efficient teaching of computers in addition to the invention of the Internet. Neural networks have an important role to play in teaching computers, such as humans, where they use data they can access to make decisions. There are many algorithms for learning about automatic learning. In our study, we demonstrate the methods and applications of automated statistical analysis, such as regression analysis, decision tree, middle method k and association analysis.

تكنولوجيا التعليم الذكاء الاصطناعي الشبكات العصبية التعلم الآلي التعليم الالكتروني التحليل الإحصائي

comments

Fetching comments

Tartous University

Additional details More universities

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

Big Data with Machine Learning

علم البيانات الضخمة مع تقنيات التعلم الآلي

Ask ChatGPT about the research

Read More