Abstract
Recent concerns regarding privacy breach issues have motivated the development of data mining methods, which preserve the privacy of individual data item. A cluster is gathering of information in such a way that the objects with similar properties are grouped into similar clusters and objects with dissimilar properties are placed into different clusters. The K-Means clustering algorithm is a broadly utilized plan to solve the clustering problem. In this paper, a comparative study of three clustering algorithms—K-means, Hierarchical and Cobweb across two different datasets is being performed. To form Clusters WEKA API has been used. The comparison is made with the variant of standard K-means technique that is Modified K-means technique. The Modified K-means technique has been developed to give better results as compared to existing K-means, Hierarchical and Cobweb techniques. This work also includes encryption and decryption of the formed clusters using AES algorithm to provide privacy to the data while transferring over networks. Experimental result proves that the performance of Modified K-means algorithm is better as compared to the existing K-Means and better than the hierarchical and Cobweb when tested on two datasets. K-Means and Hierarchical clustering is forming less number of clusters. In contrast, Cobweb is forming many clusters, which create memory issues. Therefore, Modified K-means forms an appropriate number of clusters in an organized manner and also takes minimum amount of time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rui Li, Denise de Vries, John Roddick,” Bands Of Privacy Preserving Objectives: Classification of PPDM Strategies”, 2011 CRPIT.
G. Jagannathan, K. Pillaipakkamnatt, and R.N. Wright, “A New Privacy-Preserving Distributed K-means Clustering Algorithm,” in Proceedings of the Sixth SIAM International Conference on Data Mining, 2006.
A.K. Jain, M.N. Murty and P.J. Flynn, “Data Clustering: A Review”, ACM Computing Surveys, Vol. 31, No. 3, September 1999.
Neha B. Jinwala, Gordhan B. Jethava, Privacy “Preserving Using Distributed K-means Clustering for Arbitrarily Partitioned Data”, 2014 IJEDR.
P. Bunn and R. Ostrovsky, “Secure Two-Party K-means Clustering,” in Proceedings of the 14th ACM Conference on Computer and Communications Security. ACM New York, NY, USA, 2007.
Geetha Jagannathan and Rebecca N. Wright, “Privacy-Preserving Distributed K-means Clustering Over Arbitrarily Partitioned Data,” in Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, New York, NY, USA, 2005, pp. 593–599, ACM.
JyotiYadav, Monika Sharma, “A Review of K-mean Algorithm”, International Journal of Engineering Trends and Technology (IJETT)–Volume 4 Issue 7-July 2013.
George Karypis, eui-hong(sam)Han, Vipin Kumar, “Chameleon: Hierarchical Clustering Using Dynamic Modeling” 2009 IEEE.
Lavi Tyagi, Munesh Chandra Trivedi, “Hybrid K-mean and Refinement Based on Ant for Color Image Clustering” published in the springer proceedings(AISC), International Conference on Information and Communication Technology for Sustainable development (ICT4SD 2015).
Teng-Kai Yu, D.T. Lee, Shih-Ming Chang, “Multi-Party k-Means Clustering with Privacy Consideration”, IEEE DOI 10.1109/ISPA.2010.8.
Deepak S. Turaga, Michail Vlachos, Olivier Verscheure, “On K-Means Cluster Preservation using Quantization Schemes”, 2009 IEEE.
Dongxi Li, Elisa Bertin, Xun Yi, “Privacy of Outsourced K-Means Clustering”, ASIA CCS’14.
ZekeriyaErkin, Thijs Veugen1, Tomas Toft, Reginald L. Lagendijk1, “Privacy-Preserving User Clustering In a Social Network”, 2009 IEEE.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Khan, S., Dembla, D. (2018). Implementation of Modified K-means Approach for Privacy Preserving in Data Mining. In: Bhatia, S., Mishra, K., Tiwari, S., Singh, V. (eds) Advances in Computer and Computational Sciences. Advances in Intelligent Systems and Computing, vol 554. Springer, Singapore. https://doi.org/10.1007/978-981-10-3773-3_58
Download citation
DOI: https://doi.org/10.1007/978-981-10-3773-3_58
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3772-6
Online ISBN: 978-981-10-3773-3
eBook Packages: EngineeringEngineering (R0)