research-article

Data-missing k-means based on intra-cluster and inter-cluster distances

Authors:
Jiaji Qiu

College of Mathematics and Computer Science, Zhejiang Normal University, China

College of Mathematics and Computer Science, Zhejiang Normal University, China

0000-0001-7817-8712
View Profile

,
Huiying Xu

College of Mathematics and Computer Science, Zhejiang Normal University, China

College of Mathematics and Computer Science, Zhejiang Normal University, China

0000-0002-6704-0301
View Profile

,
Xinzhong Zhu

College of Mathematics and Computer Science, Zhejiang Normal University, China and Beijing Geekplus Technology Co., Ltd., China

College of Mathematics and Computer Science, Zhejiang Normal University, China and Beijing Geekplus Technology Co., Ltd., China

0000-0002-0033-5260
View Profile

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer EngineeringOctober 2022Pages 1554–1558https://doi.org/10.1145/3573428.3573701

Published:15 March 2023Publication History

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

Pages 1554–1558

ABSTRACT

This paper proposes a method that reduces the intra-cluster distance and increases the inter-cluster distance in the k-means problem with missing data. Filling in missing data, calculating intra-cluster distances between clusters, and clustering problems are integrated into one function, and solved through loop iterations. Finally, the method is applied to 4 UCI datasets, and the results show that the method has good effect.

References

Jain, A. K. 2010. Data clustering: 50 years beyond K-means. Pattern recognition letters, 31(8), 651-666.Google Scholar
Feng, J., Zhang, Y., Yue, G., Liu, X., Su, H., & Zhang, P. F. 2018. Atherosclerotic Plaque Pathological Analysis by Unsupervised $ K $-Means Clustering. IEEE Access, 6, 21530-21535.Google ScholarCross Ref
Munir, M. U., Javed, M. Y., & Khan, S. A. 2012. A hierarchical k-means clustering based fingerprint quality classification. Neurocomputing, 85, 62-67.Google ScholarDigital Library
Peng, K., Leung, V. C., & Huang, Q. 2018. Clustering approach based on mini batch kmeans for intrusion detection system over big data. IEEE Access, 6, 11897-11906.Google ScholarCross Ref
Lin, X., & Li, C. T. 2016. Large-scale image clustering based on camera fingerprints. IEEE Transactions on Information Forensics and Security, 12(4), 793-808.Google Scholar
Wang, S., Li, M., Hu, N., Zhu, E., Hu, J., Liu, X., & Yin, J. 2019. K-means clustering with incomplete data. IEEE Access, 7, 69162-69171.Google ScholarCross Ref
Wu, S., & Chow, T. W. 2004. Clustering of the self-organizing map using a clustering validity index based on inter-cluster and intra-cluster density. Pattern Recognition, 37(2), 175-188.Google ScholarCross Ref
García-Laencina, P. J., Sancho-Gómez, J. L., Figueiras-Vidal, A. R., & Verleysen, M. 2009. K nearest neighbours with mutual information for simultaneous classification and missing data imputation. Neurocomputing, 72(7-9), 1483-1493.Google ScholarDigital Library
Aste, M., Boninsegna, M., Freno, A., & Trentin, E. 2015. Techniques for dealing with incomplete data: a tutorial and survey. Pattern Analysis and Applications, 18(1), 1-29.Google ScholarDigital Library
Dempster, A. P., Laird, N. M., & Rubin, D. B. 1977. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1-22.Google ScholarCross Ref

Recommendations

Inter cluster distance management model with optimal centroid estimation for K-means clustering algorithm

Clustering techniques are used to group up the transactions based on the relevancy. Cluster analysis is one of the primary data analysis method. The clustering process can be done in two ways such that Hierarchical clusters and partition clustering. ...
Read More
Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering
Abstract
Data distribution has a significant impact on clustering results. This study focuses on the effect of cluster size distribution on clustering, namely the uniform effect of k-means and fuzzy c-means (FCM) clustering. We first provide some related ...
Read More
Ant clustering algorithm with K-harmonic means clustering

Clustering is an unsupervised learning procedure and there is no a prior knowledge of data distribution. It organizes a set of objects/data into similar groups called clusters, and the objects within one cluster are highly similar and dissimilar with ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering
October 2022
1999 pages
ISBN:9781450397148
DOI:10.1145/3573428

Copyright © 2022 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 March 2023
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
clustering
incomplete data
k-means
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate508of972submissions,52%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 26
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)2
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Data-missing k-means based on intra-cluster and inter-cluster distances

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

ABSTRACT

References

Cited By

Recommendations

Inter cluster distance management model with optimal centroid estimation for K-means clustering algorithm

Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering

Ant clustering algorithm with K-harmonic means clustering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Data-missing k-means based on intra-cluster and inter-cluster distances

EITCE '22: Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering

ABSTRACT

References

Cited By

Recommendations

Inter cluster distance management model with optimal centroid estimation for K-means clustering algorithm

Effect of cluster size distribution on clustering: a comparative study of k-means and fuzzy c-means clustering

Ant clustering algorithm with K-harmonic means clustering

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media