SBKMMA: Sorting Based K Means and Median Based Clustering Algorithm Using Multi Machine Technique for Big Data
Keywords:
KMeans, BigData, Clustering, Sorting.Abstract
Big data analytics examines large amounts of data to uncover hidden patterns, correlations and other insights. Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group than those in other groups. KMeans is a traditional partition algorithm which is simple and popularly used. This algorithm has disadvantages such as to identify K clusters, initial allocation etc. In this paper we mainly focus on the initial centroids and improving the efficiency by reducing the number of iterations. Sorting based KMeans algorithm and Sorting based KMedian algorithm are enhanced form of KMeans algorithm where the data are sorted and uses KMeans algorithm. The proposed algorithm focuses on the initial centroid selection with the help of sorting. Here the centroids are default assigned to the objects in the beginning after sorting.
References
M. K.Kakhani, S. Kakhani and S. R.Biradar, Research issues in big data analytics, International Journal of Application or Innovation in Engineering & Management, 2(8) (2015), pp.228-232.
Nirali Honest and Atul Patel A SURVEY OF BIG DATA ANALYTICS, International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016
Mahima Jane and Dr. E. George Dharma Prakash Raj “SBKMA : Sorting based K- Means Clustering Algorithm using Multi Machine Technique for Big Data “ in the International Journal of Control Theory and Applications Volume 8 2015pp 2105- 2110
Mahima Jane and Dr. E. George Dharma Prakash Raj “SBKMEDA : Sorting based K- Median Clustering Algorithm using Multi Machine Technique for Big Data “ accepted for Advances in Intelligent Systems and Computing.
. Anand M. Baswade, Prakash S. Nalwade2,”Selection of initial centroids for K-Means Algorithm” IJCSMC, Vol. 2, Issue. 7, July 2013, pg.161 –164
. Aleta C. Fabregas, Bobby D. Gerardo, Bartolome T. Tanguilig III,"Enhanced Initial Centroids for K-means Algorithm", International Journal of Information Technology and Computer Science(IJITCS), Vol.9, No.1, pp.26-33, 2017. DOI: 10.5815/ijitcs.2017.01.04
Daljit Kaur and Kiran Jyot, “Enhancement in the Performance of K-means Algorithm”, International Journal of Computer Science and Communication Engineering, Volume 2 Issue 1, 2013.
K.Rajalakshmi,, Dr.S.S.Dhenakaran,N.Roobin “Comparative Analysis of K-Means Algorithm in Disease Prediction”, International Journal of Science, Engineering and Technology Research (IJSETR), Volume 4, Issue 7, July 2015
Downloads
Published
How to Cite
Issue
Section
License
Authors who submit papers with this journal agree to the following terms.