Efficient Constrained K-center Clustering with Background Knowledge

Authors

  • Longkun Guo School of Mathematics and Statistics, Fuzhou University, Fuzhou 350116, China Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250316, China
  • Chaoqi Jia Key Laboratory of Computing Power Network and Information Security, Ministry of Education, Shandong Computer Science Center, Qilu University of Technology (Shandong Academy of Sciences), Jinan 250316, China
  • Kewen Liao HilstLab, Peter Faber Business School, Australian Catholic University, Sydney 2060, Australia
  • Zhigang Lu College of Science and Engineering, James Cook University, Townsville 4810, Australia
  • Minhui Xue CSIRO's Data61, Sydney 2015, Australia

DOI:

https://doi.org/10.1609/aaai.v38i18.30058

Keywords:

SO: Combinatorial Optimization, ML: Clustering

Abstract

Center-based clustering has attracted significant research interest from both theory and practice. In many practical applications, input data often contain background knowledge that can be used to improve clustering results. In this work, we build on widely adopted k-center clustering and model its input background knowledge as must-link (ML) and cannot-link (CL) constraint sets. However, most clustering problems including k-center are inherently NP-hard, while the more complex constrained variants are known to suffer severer approximation and computation barriers that significantly limit their applicability. By employing a suite of techniques including reverse dominating sets, linear programming (LP) integral polyhedron, and LP duality, we arrive at the first efficient approximation algorithm for constrained k-center with the best possible ratio of 2. We also construct competitive baseline algorithms and empirically evaluate our approximation algorithm against them on a variety of real datasets. The results validate our theoretical findings and demonstrate the great advantages of our algorithm in terms of clustering cost, clustering quality, and running time.

Published

2024-03-24

How to Cite

Guo, L., Jia, C., Liao, K., Lu, Z., & Xue, M. (2024). Efficient Constrained K-center Clustering with Background Knowledge. Proceedings of the AAAI Conference on Artificial Intelligence, 38(18), 20709-20717. https://doi.org/10.1609/aaai.v38i18.30058

Issue

Section

AAAI Technical Track on Search and Optimization