Research papers, master and doctoral theses about Data Mining

Comparative Study between R and Rapidminer as Data mining Tools

3099 - Aِl-Baath University 2014 ورقة بحثية

The ability of data mining to provide predictive information derived from huge databases became an effective tool in the hands of companies and individuals، allowing them to focus on areas that are important to them from the massive data generated by the march of their daily lives. Along with the increasing importance of this science there was a rapidly increasing in the tools that produced to implement the theory concepts as fast as possible. So it will be hard to take a decision on which of these tools is the best to perform the desired task. This study provides a comparison between the two most commonly used data mining tools according to opinion polls، namely: Rapidminer and R programming language in order to help researchers and developers to choose the best suited tool for them between the two. Adopted the comparison on seven criteria: platform، algorithms، input/output formats، visualization، user’s evaluation، infrastructure and potential development، and performance by applying a set of classification algorithms on a number of data sets and using two techniques to split data set: cross validation and hold-out to make sure of the results. The Results show that R supports the largest number of algorithms، input/output formats، and visualization. While Rapidminer superiority in terms of ease of use and support for a greater number of platforms. In terms of performance the accuracy of classification models that were built using the R packages were higher. That was not true in some cases imposed by the nature of the data because we did not added any pre-processing stage. Finally the preference option in any tool is depending on the extent of the user experience and purpose that the tool is used for

Information retrieval fuzzy system Clustering Data Mining R Rapidminer Tools التنقيب في البيانات استرجاع البيانات أدوات خوارزمية التصنيف خوارزميات التجميع المزيد..

Impact of Implementation of Data mining on the management of banking operations "Field Study in commercial banks of Jordan"

2365 - Damascus University 2012 ورقة بحثية

This study aimed to indicate the level of interest in the application of concepts and data mining tools in the management of banking operations areas and the interest components of the environment and the application of the concepts of data mining tools in the management of banking operations in commercial banks of Jordan. To achieve these goals, the researcher used the descriptive analytical approach based on the questionnaire distributed to members of the community study. The researcher found that the percentage of interest among members of the community study on the application of the concepts of data mining operations the management of banking, was high in general, where the arithmetic mean is generally equal to (4.005). And that the order of fields that may be seen when you search in the application of the concepts of data mining and addressed by this study, have been of importance and level of interest by the members of the population of the study, as follows: the working environment of knowledge with information technology "has obtained the highest average, was the average the arithmetic of this axis is equal to (4.02), followed by the center of "opportunities to enhance knowledge systems with the development environment systems research and retrieval of data.

Data Mining التنقيب عن البيانات إدارة العمليات المصرفية management of banking operations

Employing Data Mining Algorithms in Traffic Accidents Analyzing

5974 - Tishreen University 2015 ورقة بحثية

In this paper we introduce a comparison for some of data mining algorithm for traffic accidents analysis. We start by describing available data for entry by analyzing the structure of statistical reports in Lattakia traffic directorate, and proceed to data mining stage which enables us to smart study of factors that play roles in traffic accident and find its inter-relations and importance for causing traffic accident. That comes after building data warehouse upon the database we built to store the data we gathered. In this research we list a some of models was tested which is a sample of a many cases we checked to have the research results.

Classification العنقدة Clustering Data Mining التنقيب في البيانات التصنيف حوادث المرور مستودعات البيانات قواعد بيانات معرفية قواعد الاقتران Traffic Accidents Data Warehouse Knowledge Database Association Rules المزيد..

Improving Efficiency Apriori Algorithm by Reduction of candidate itemsets

2595 - Aِl-Baath University 2017 ورقة بحثية

Association Rules is an important field in Data Mining, which is used to discover useful knowledge from a massive databases. Association Rules have been used to extract the information from the database transactions, and Apriori Algorithm is a pra ctical application for Association Rules and it is used to find frequent itemsets from database transactions. In this paper, we present a new improving on Apriori Algorithm by reduction generating of candidate itemsets and this leads to improving efficiency Apriori Algorithm.

Data Mining التنقيب في البيانات Association Rules صفقات قواعد الارتباط خوارزمية الأسبقية مجموعات بنود البيانات المرشحة مجموعات بنود البيانات المتكررة Apriori Algorithm Transactions Candidate Itemsets Frequent Itemsets المزيد..

Using Data Mining To Help Machine In Handwriting Characters Recognition

1469 - Aِl-Baath University 2016 ورقة بحثية

In this research, we offered a new and simple way of Handwriting Characters Recognition. This way extracts positions of the black points from binary images (black, white) according to certain coordinates which are used in the stages of training an d testing. The extracted positions are stored in a database according to appropriate structure for predictive data mining. We used training data to build a predictive model which helps in Recognition testing data depending on the data stored in the database. We have conducted a number of tests on different samples of handwriting character images. We got accurate results, within the required conditions.

Machine learning Data Mining التنقيب في البيانات التعرف البصري على المحارف تعليم الآلة طرق التنقيب في البيانات خوارزمية التنقيب Optical Characters Recognition Data Mining Function Data Mining Algorithms المزيد..

Evaluation of Estimating Number of Clusters Methods in Case of Agglomerative Hierarchical Clustering

1949 - Aِl-Baath University 2016 ورقة بحثية

The main goal of data mining process is to extract information and discover knowledge from huge databases, where the clustering is one of the most important functionalities which can be done in this area. There are many of clustering algorithms an d methods, but determining or estimating the number of clusters which should be extracted from a dataset is one of the most important issues most of these methods encounter it. This research focuses on the problem of estimating number of clusters in the case of agglomerative hierarchical clustering. We present an evaluation of three of the most common methods used in estimating number of clusters.

العنقدة Clustering Artificial intelligence Data Mining التنقيب في البيانات الذكاء الصنعي تقدير عدد العناقيد Estimatimg number of clusters المزيد..

Enhancing Universities’ Libraries Activities Using Data Mining and Business Intelligence Techniques

2086 - Aِl-Baath University 2017 ورقة بحثية

Through this study we will explain the application of data mining and business intelligence using the data existed in the library of the Arab International University. This data has been linked to the data of the students on the academic system of the university. The study will also answer questions that affect the work of the educational institution in general and the library in particular, propose solutions to improve the work of the library and its services, enhance library working methods, and specify indicators related to the role of information resources in the educational operation.

Information retrieval Data Mining Data Warehouse نظم دعم القرار Decision Support Systems استرجاع المعلومات التنقيب في المكتبات مستودع المعطيات التنقيب في المعطيات القياسات المكتبية ذكاء الأعمال Bibliomining Bibliometric Business Intelligence المزيد..

A New Algorithm for Data Clustering and Enhancing K-Means Algorithm

2909 - Aِl-Baath University 2016 ورقة بحثية

This paper introduces a new algorithm to solve some problems that data clustering algorithms such as K-Means suffer from. This new algorithm by itself is able to cluster data without the need of other clustering algorithms.

العنقدة Clustering Centroid المركز Data Mining البيانات الكائنات النقاط المنعزلة مربع الخطأ Objects Isolated Points Square Error المزيد..

Using Data Mining Techniques to Analyze Students’ Performance

3187 - Aِl-Baath University 2016 ورقة بحثية

The objective of this paper is presenting an educational data mining case study, by applying data mining techniques.

Data Mining التنقيب في البيانات التنقيب في بيانات المؤسسات التعليمية برنامج ذكاء الأعمال الخاص بشركة مايكروسوفت خوارزمية مايكروسوفت لأشجار القرار خوارزمية مايكروسوفت للتجميع و الكشف عن الحالات الشاذة خوارزمية مايكروسوفت لقواعد الارتباط Educational Data Mining SQL Server Business Intelligence Development Studio Microsoft Decision Trees Microsoft Clustering Outlier Detection Microsoft Association Rules المزيد..

Design An Intelligent Decision Support System Prototype for Human Resources Management

2072 - Syrian Virtual University 2014 رسالة ماجستير

The Research Aims: Syrian organizations keep large amounts of information and data about their personnel in their IT systems. This information, however, is often left unutilized or may be analyzed through statistical methods. In this study, DM is considered a solution for analyzing HR data and explore knowledge from data stored in some Syrian organization through two major stages: Stage A: Using results of Semi-Annual performance evaluation process to build prototype showed in (Fig. 6) to accomplish two tasks: 1. Building a models to predict appropriate job function for an employee through majority principle and using high accuracy result to increase the number of training data and make it self-learning model. 2. Choose most important attributes that used in classify methods to use it in personnel selection and recruitment. Stage B: Using data of Time & Attendance to analysis personnel activity through clustering methods and building many meaningful groups.

Artificial intelligence Data Mining نظم دعم القرار Knowledge Discovery الذكاء الصنعي التنقيب في المعطيات استكشاف المعرفة إدارة الموارد البشرية Human Resource Management Decision Support System المزيد..

يمكنك البدء بجني المال وتحقيق ربح مادي من أبحاثك العلمية، المزيد