New community

Subscribe to the gold package and get unlimited access to Shamra Academy

A New Algorithm for Data Clustering and Enhancing K-Means Algorithm

خوارزمية جديدة لعنقدة البيانات و تحسين خوارزمية الK-Means

3217 6 69 0 ( 0 )

Download Cite

Added by Aِl-Baath University ورقة بحثية

Publication date 2016

and research's language is العربية

Authors وائل علي( باحث ) - ريما القمحة( باحث )

Created by Shamra Editor

العنقدة Clustering Centroid المركز Data Mining البيانات الكائنات النقاط المنعزلة مربع الخطأ Objects Isolated Points Square Error

visit our facebook page

‎Shamra Academia - شمرا أكاديميا‎

Ask ChatGPT about the research

Abstract in Arabic Abstract in English

This paper introduces a new algorithm to solve some problems that data clustering algorithms such as K-Means suffer from. This new algorithm by itself is able to cluster data without the need of other clustering algorithms.

Artificial intelligence review:

Upgrade your account to view the content

Research summary

تقدم هذه الورقة البحثية خوارزمية جديدة تهدف إلى حل بعض المشاكل التي تعاني منها خوارزميات عنقدة البيانات مثل خوارزمية الـK-Means. تعتمد الخوارزمية الجديدة على حساب قيمة مناسبة للثابت α، الذي يمثل نصف قطر الجوار حيث يجب أن تتواجد الكائنات (النقاط) التي تنتمي للعنقود. يمكن استخدام الخوارزمية الجديدة لتحديد العدد المناسب من العناقيد وتحديد المراكز الابتدائية بشكل آلي، مما يقلل من الخطأ وعدد التكرارات في خوارزمية الـK-Means. تم تقديم مقارنة بين أداء خوارزمية الـK-Means الأصلية وخوارزمية الـK-Means بعد إدخال خرج الخوارزمية الجديدة، حيث أظهرت النتائج تقليل الخطأ وعدد التكرارات.

Critical review

دراسة نقدية: تعتبر هذه الورقة البحثية مساهمة قيمة في مجال عنقدة البيانات، حيث تقدم خوارزمية جديدة تحل بعض المشاكل الشائعة في خوارزمية الـK-Means. ومع ذلك، يمكن تحسين الورقة من خلال تقديم تحليل أعمق للأداء الزمني للخوارزمية الجديدة مقارنة بالخوارزميات الأخرى. كما يمكن تضمين تجارب إضافية على مجموعات بيانات متنوعة للتحقق من فعالية الخوارزمية في سياقات مختلفة. بالإضافة إلى ذلك، يمكن توضيح كيفية تحديد الثابت α بشكل أكثر تفصيلاً لتسهيل تطبيق الخوارزمية من قبل الباحثين الآخرين.

Questions related to the research

ما هي المشكلة الرئيسية التي تحاول الخوارزمية الجديدة حلها في خوارزمية الـK-Means؟

تحاول الخوارزمية الجديدة حل مشاكل التحديد المسبق لعدد العناقيد والتحديد العشوائي للمراكز الابتدائية في خوارزمية الـK-Means.
كيف يتم تحديد نصف قطر الجوار α في الخوارزمية الجديدة؟

يتم تحديد نصف قطر الجوار α من خلال حساب مصفوفة الأبعاد D بين كل نقطتين مختلفتين واستخدام العلاقة الرياضية المناسبة لحساب α.
ما هي الفائدة الرئيسية من استخدام الخوارزمية الجديدة مع خوارزمية الـK-Means؟

الفائدة الرئيسية هي تقليل الخطأ وعدد التكرارات في خوارزمية الـK-Means من خلال تحديد العدد المناسب من العناقيد والمراكز الابتدائية بشكل آلي.
هل تم اختبار الخوارزمية الجديدة على مجموعات بيانات متنوعة؟

نعم، تم اختبار الخوارزمية الجديدة على مجموعات بيانات مختلفة وأظهرت النتائج فعالية الخوارزمية في تقليل الخطأ وعدد التكرارات.

Keywords

عنقدة البيانات خوارزمية K-Means تنقيب البيانات النقاط المنعزلة مربع الخطأ المراكز الابتدائية

References used

HAN, J, AND KAMBER, M. 2006- Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, New Delhi, (2nd ed), 772p

RAUF, A, SHEEBA, S, KHUSOR, S, AND JAVED, H.2012- Enhanced K-Mean Clustering Algorithm to Reduce Number of Iterations and Time Complexity, Middle-East Journal of Scientific Research, Pakistan

ALARBEA, A, SENTHEKUMAR, H, AND BADER, A. 2013- Enhancing K-Means Algorithm with Initial Cluster Centers Derived from Data Partitioning along the Data Axis with PCA, Journal of Advances in Computer Networks, Vol. 1, No. 2, June 2013

rate research

Improve K-Means Algorithm

6719 - Aِl-Baath University 2014 ورقة بحثية

The algorithm classifies objects to a predefined number of clusters, which is given by the user (assume k clusters). The idea is to choose random cluster centers, one for each cluster. These centers are preferred to be as far as possible from each ot her. Starting points affect the clustering process and results. Here the Centroid initialization plays an important role in determining the cluster assignment in effective way. Also, the convergence behavior of clustering is based on the initial centroid values assigned. This research focuses on the assignment of cluster centroid selection so as to improve the clustering performance by K-Means clustering algorithm. This research uses Initial Cluster Centers Derived from Data Partitioning along the Data Axis with the Highest Variance to assign for cluster centroid.

fuzzy system facial expression خوارزمية التقسيم العنقدة Clustering Centroid K-Means المركز المزيد..

Modifying Mountain Clustering Algorithm and Using It to Enhance the Performance of Fuzzy C-Means Algorithm

1581 - Aِl-Baath University 2017 ورقة بحثية

In this paper, we introduce a modification to fuzzy mountain data clustering algorithm. We were able to make this algorithm working automatically, through finding a way to divide the space, to determine the values of the input parameters, and the stop condition automatically, instead of getting them by the user.

cost function مصفوفة العضوية دالة الكثافة خوارزمية عنقدة ضبابية دالة الكلفة وسطاء الدخل Membership Matrix Mountain Function Fuzzy Clustering Algorithm Input Parameters المزيد..

Modifying Mountain Clustering Algorithm and Using It to Enhance the Performance of Fuzzy C-Means Algorithm

1744 - Aِl-Baath University 2017 ورقة بحثية

A New Hybrid Digital Signature Algorithm with high Security and Performance

1044 - Aِl-Baath University 2016 ورقة بحثية

The majority of recent digital signature algorithms depend, in their structure, on complicated mathematical concepts that require a long time and a significant computational effort to be executed. As a trial to reduce these problems, some researchers have proposed digital signature algorithms which depend on simple arithmetic functions and operations that are executed quickly, but that was at the expense of the security of algorithms.

التوقيع الرقمي Digital Signature التعمية بالمفتاح العام خوارزمية التوقيع الرقمي خوارزمية تبادل المفاتيح public key encryption RSA digital signature algorithm Diffie-Hellman key exchange algorithm المزيد..

An Algorithm for Continuously Edge Coloring a Set of Graphs

1689 - Aِl-Baath University 2016 ورقة بحثية

As it’s known, The Graph k-Colorability Problem (GCP) is a wellknown NP-Hard Problem. This problem consists in finding the k minimum number of colors to paint the vertices of a graph in such a way that any two adjoined vertices, which are connecte d by an edge, have always different colors. In another words how can we color the edges of a graph in such a way that any two edges joined by a vertex have always different colors? In this paper we introduce a new effective algorithm for coloring the edges of the graph. Our proposed algorithm enables us to achieve a Continuously Edge Coloring (CEC) for a set of known graphs.

البيان Graph مسألة تلوين البيان التلوين الضلعي خوارزمية تلوين بيان التلوين الضلعي المستمر Graph Coloring Problem Edge Coloring Graph Graph Coloring Algorithm Continuously Edge Coloring المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد

A New Algorithm for Data Clustering and Enhancing K-Means Algorithm

خوارزمية جديدة لعنقدة البيانات و تحسين خوارزمية الK-Means

Ask ChatGPT about the research

This paper introduces a new algorithm to solve some problems that data clustering algorithms such as K-Means suffer from. This new algorithm by itself is able to cluster data without the need of other clustering algorithms.

Read More

suggested questions