Skip to main content

Implementation of Modified K-means Approach for Privacy Preserving in Data Mining

  • Conference paper
  • First Online:
Advances in Computer and Computational Sciences

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 554))

  • 1334 Accesses

Abstract

Recent concerns regarding privacy breach issues have motivated the development of data mining methods, which preserve the privacy of individual data item. A cluster is gathering of information in such a way that the objects with similar properties are grouped into similar clusters and objects with dissimilar properties are placed into different clusters. The K-Means clustering algorithm is a broadly utilized plan to solve the clustering problem. In this paper, a comparative study of three clustering algorithms—K-means, Hierarchical and Cobweb across two different datasets is being performed. To form Clusters WEKA API has been used. The comparison is made with the variant of standard K-means technique that is Modified K-means technique. The Modified K-means technique has been developed to give better results as compared to existing K-means, Hierarchical and Cobweb techniques. This work also includes encryption and decryption of the formed clusters using AES algorithm to provide privacy to the data while transferring over networks. Experimental result proves that the performance of Modified K-means algorithm is better as compared to the existing K-Means and better than the hierarchical and Cobweb when tested on two datasets. K-Means and Hierarchical clustering is forming less number of clusters. In contrast, Cobweb is forming many clusters, which create memory issues. Therefore, Modified K-means forms an appropriate number of clusters in an organized manner and also takes minimum amount of time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rui Li, Denise de Vries, John Roddick,” Bands Of Privacy Preserving Objectives: Classification of PPDM Strategies”, 2011 CRPIT.

    Google Scholar 

  2. G. Jagannathan, K. Pillaipakkamnatt, and R.N. Wright, “A New Privacy-Preserving Distributed K-means Clustering Algorithm,” in Proceedings of the Sixth SIAM International Conference on Data Mining, 2006.

    Google Scholar 

  3. A.K. Jain, M.N. Murty and P.J. Flynn, “Data Clustering: A Review”, ACM Computing Surveys, Vol. 31, No. 3, September 1999.

    Google Scholar 

  4. Neha B. Jinwala, Gordhan B. Jethava, Privacy “Preserving Using Distributed K-means Clustering for Arbitrarily Partitioned Data”, 2014 IJEDR.

    Google Scholar 

  5. P. Bunn and R. Ostrovsky, “Secure Two-Party K-means Clustering,” in Proceedings of the 14th ACM Conference on Computer and Communications Security. ACM New York, NY, USA, 2007.

    Google Scholar 

  6. Geetha Jagannathan and Rebecca N. Wright, “Privacy-Preserving Distributed K-means Clustering Over Arbitrarily Partitioned Data,” in Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, New York, NY, USA, 2005, pp. 593–599, ACM.

    Google Scholar 

  7. JyotiYadav, Monika Sharma, “A Review of K-mean Algorithm”, International Journal of Engineering Trends and Technology (IJETT)–Volume 4 Issue 7-July 2013.

    Google Scholar 

  8. George Karypis, eui-hong(sam)Han, Vipin Kumar, “Chameleon: Hierarchical Clustering Using Dynamic Modeling” 2009 IEEE.

    Google Scholar 

  9. Lavi Tyagi, Munesh Chandra Trivedi, “Hybrid K-mean and Refinement Based on Ant for Color Image Clustering” published in the springer proceedings(AISC), International Conference on Information and Communication Technology for Sustainable development (ICT4SD 2015).

    Google Scholar 

  10. Teng-Kai Yu, D.T. Lee, Shih-Ming Chang, “Multi-Party k-Means Clustering with Privacy Consideration”, IEEE DOI 10.1109/ISPA.2010.8.

  11. Deepak S. Turaga, Michail Vlachos, Olivier Verscheure, “On K-Means Cluster Preservation using Quantization Schemes”, 2009 IEEE.

    Google Scholar 

  12. Dongxi Li, Elisa Bertin, Xun Yi, “Privacy of Outsourced K-Means Clustering”, ASIA CCS’14.

    Google Scholar 

  13. ZekeriyaErkin, Thijs Veugen1, Tomas Toft, Reginald L. Lagendijk1, “Privacy-Preserving User Clustering In a Social Network”, 2009 IEEE.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shifa Khan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Khan, S., Dembla, D. (2018). Implementation of Modified K-means Approach for Privacy Preserving in Data Mining. In: Bhatia, S., Mishra, K., Tiwari, S., Singh, V. (eds) Advances in Computer and Computational Sciences. Advances in Intelligent Systems and Computing, vol 554. Springer, Singapore. https://doi.org/10.1007/978-981-10-3773-3_58

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3773-3_58

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3772-6

  • Online ISBN: 978-981-10-3773-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics