SBKMMA: Sorting Based K Means and Median Based Clustering Algorithm Using Multi Machine Technique for Big Data

E. Mahima Jane, Dr. E. George Dharma Prakash Raj


Big data analytics examines large amounts of data to uncover hidden patterns, correlations and other insights. Clustering is the task of dividing the population or data points into a number of groups such that data points in the same groups are more similar to other data points in the same group than those in other groups. KMeans is a traditional partition algorithm which is simple and popularly used. This algorithm has disadvantages such as to identify K clusters, initial allocation etc. In this paper we mainly focus on the initial centroids and improving the efficiency by reducing the number of iterations. Sorting based KMeans algorithm and Sorting based KMedian algorithm are enhanced form of KMeans algorithm where the data are sorted and uses KMeans algorithm. The proposed algorithm focuses on the initial centroid selection with the help of sorting. Here the centroids are default assigned to the objects in the beginning after sorting. 


KMeans;BigData; Clustering; Sorting.

Full Text:



M. K.Kakhani, S. Kakhani and S. R.Biradar, Research issues in big data analytics, International Journal of Application or Innovation in Engineering & Management, 2(8) (2015), pp.228-232.

Nirali Honest and Atul Patel A SURVEY OF BIG DATA ANALYTICS, International Journal of Information Sciences and Techniques (IJIST) Vol.6, No.1/2, March 2016

Mahima Jane and Dr. E. George Dharma Prakash Raj “SBKMA : Sorting based K- Means Clustering Algorithm using Multi Machine Technique for Big Data “ in the International Journal of Control Theory and Applications Volume 8 2015pp 2105- 2110

Mahima Jane and Dr. E. George Dharma Prakash Raj “SBKMEDA : Sorting based K- Median Clustering Algorithm using Multi Machine Technique for Big Data “ accepted for Advances in Intelligent Systems and Computing.

. Anand M. Baswade, Prakash S. Nalwade2,”Selection of initial centroids for K-Means Algorithm” IJCSMC, Vol. 2, Issue. 7, July 2013, pg.161 –164

. Aleta C. Fabregas, Bobby D. Gerardo, Bartolome T. Tanguilig III,"Enhanced Initial Centroids for K-means Algorithm", International Journal of Information Technology and Computer Science(IJITCS), Vol.9, No.1, pp.26-33, 2017. DOI: 10.5815/ijitcs.2017.01.04

Daljit Kaur and Kiran Jyot, “Enhancement in the Performance of K-means Algorithm”, International Journal of Computer Science and Communication Engineering, Volume 2 Issue 1, 2013.

K.Rajalakshmi,, Dr.S.S.Dhenakaran,N.Roobin “Comparative Analysis of K-Means Algorithm in Disease Prediction”, International Journal of Science, Engineering and Technology Research (IJSETR), Volume 4, Issue 7, July 2015


  • There are currently no refbacks.





About IJC | Privacy PolicyTerms & Conditions | Contact Us | DisclaimerFAQs 

IJC is published by (GSSRR).