K-means Clustering Algorithm for Categorical Attributes

Gupta, S K; Rao, K Sambasiva; Bhatnagar, Vasudha

doi:10.1007/3-540-48298-9_22

K-means Clustering Algorithm for Categorical Attributes

S K Gupta⁶,
K Sambasiva Rao⁷ &
Vasudha Bhatnagar⁸

Conference paper
First Online: 01 January 2002

996 Accesses
19 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Abstract

Efficient partitioning of large data sets into homogeneous clusters is a fundamental problem in data mining. The hierarchical clustering methods are not adaptable because of their high computational complexity. The K-means based algorithms give promising results for their efficiency. However their use is often limited to numeric data. The quality of clusters produced depends on the initialization of clusters and the order in which data elements are processed in the iteration. We present a method which is based on the K-means philosophy but removes the numeric data limitation.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

F Murtagh: Multidimensional Clustering Algorithms. Physica-Verlag, Vienna., 1985.
MATH Google Scholar
P. Michaud: Clustering techniques. Future Generation Computer Systems, (13), 1997.
Google Scholar
J. A. Hartigan: Clustering Algorithms. 1975.
Google Scholar
K. Sambasiva Rao: K-means Clustering for Categorical Attributes. M. Tech. Thesis, Dec 1998, Indian Institute of Technology, New Delhi, India.
Google Scholar

Download references

Author information

Authors and Affiliations

Deptt. of Computer Science and Engineering, Indian Institute of Technology, Hauz Khas, New Delhi, 110 016, India
S K Gupta
Deptt. of Computer Science and Engineering, Indian Institute of Technology, Hauz Khas, New Delhi, 110 016, India
K Sambasiva Rao
Deptt. of Computer Science., MotiLal Nehru College, Delhi University, Delhi, India
Vasudha Bhatnagar

Authors

S K Gupta
View author publications
You can also search for this author in PubMed Google Scholar
K Sambasiva Rao
View author publications
You can also search for this author in PubMed Google Scholar
Vasudha Bhatnagar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer and Information Science, University of South Australia, The Levels, Adelaide, Australia, 05
Mukesh Mohania
IFS, Technical University of Vienna, Resselgasse 3, A-1040, Vienna, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gupta, S.K., Rao, K.S., Bhatnagar, V. (1999). K-means Clustering Algorithm for Categorical Attributes. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_22

Download citation

DOI: https://doi.org/10.1007/3-540-48298-9_22
Published: 01 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66458-1
Online ISBN: 978-3-540-48298-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics