Skip to main content
Log in

Approximation algorithms for the capacitated correlation clustering problem with penalties

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

Abstract

This paper considers the capacitated correlation clustering problem with penalties (CCorCwP), which is a new generalization of the correlation clustering problem. In this problem, we are given a complete graph, each edge is either positive or negative. Moreover, there is an upper bound on the number of vertices in each cluster, and each vertex has a penalty cost. The goal is to penalize some vertices and select a clustering of the remain vertices, so as to minimize the sum of the number of positive cut edges, the number of negative non-cut edges and the penalty costs. In this paper we present an integer programming, linear programming relaxation and two polynomial time algorithms for the CCorCwP. Given parameter \(\delta \in (0,4/9]\), the first algorithm is a \(\left( 8/(4-5\delta ), 8/\delta \right) \)-bi-criteria approximation algorithm for the CCorCPwP, which means that the number of vertices in each cluster does not exceed \(8/(4-5\delta )\) times the upper bound, and the output objective function value of the algorithm does not exceed \(8/\delta \) times the optimal value. The second one is based on above bi-criteria approximation, and we prove that the second algorithm can achieve a constant approximation ratio for some special instances of the CCorCwP.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Aboud A, Rabani Y (2008) Correlation clustering with penalties and approximating the reordering buffer management problem. Doctoral dissertation, Computer Science Department, Technion

  • Ahmadi S, Khuller S, Saha B (2019) Min-max correlation clustering via multicut. In: Proceedings of the 20th international conference on integer programming and combinatorial optimization, pp 13–26

  • Ahn KJ, Cormode G, Guha S, Mcgregor A, Wirth A (2015) Correlation clustering in data streams. In: Proceedings of the 32nd international conference on machine learning, pp 2237–2246

  • Ailon N, Avigdor-Elgrabli N, Liberty E, Zuylen AV (2012) Improved approximation algorithms for bipartite correlation clustering. SIAM J Comput 41(5):1110–1121

    Article  MathSciNet  MATH  Google Scholar 

  • Bansal N, Blum A, Chawla S (2004) Correlation clustering. Mach Learn 56(1–3):89–113

    Article  MathSciNet  MATH  Google Scholar 

  • Bressan M, Cesa-Bianchi N, Paudice A, Vitale F (2019) Correlation clustering with adaptive similarity queries. In: Proceedings of the 32nd annual conference on neural information processing systems, pp 12510–12519

  • Castro J, Nasini S, Saldanha-Da-Gama F (2017) A cutting-plane approach for large-scale capacitated multi-period facility location using a specialized interior-point method. Math Program 163(1–2):411–444

    Article  MathSciNet  MATH  Google Scholar 

  • Charikar M, Guruswami V, Wirth A (2005) Clustering with qualitative information. J Comput Syst Sci 3(71):360–383

    Article  MathSciNet  MATH  Google Scholar 

  • Chawla S, Makarychev K, Schramm T, Yaroslavtsev G (2015) Near optimal LP rounding algorithm for correlation clustering on complete and complete \(k\)-partite graphs. In: Proceedings of the 47th ACM symposium on theory of computing, pp 219–228

  • Chen X, Hu X, Jia X, Li M, Tang Z, Wang C (2018) Mechanism design for two-opposite-facility location games with penalties on distance. In: Proceedings of the 10th international symposium on algorithmic game theory, pp 256–260

  • Cohen-Addad V (2020) Approximation schemes for capacitated clustering in doubling metrics. In: Proceedings of the 30th annual ACM-SIAM symposium on discrete algorithms, pp 2241–2259

  • Filippi C, Guastaroba G, Speranza MG (2021) On single-source capacitated facility location with cost and fairness objectives. Eur J Oper Res 289(3):959–974

    Article  MathSciNet  MATH  Google Scholar 

  • Jafarov J, Kalhan S, Makarychev K, Makarychev Y (2020) Correlation clustering with asymmetric classification errors. In: Proceedings of the 37th international conference on machine learning, pp 4641–4650

  • Ji S, Cheng Y, Tan J, Zhao Z (2021) An improved approximation algorithm for capacitated correlation clustering problem. In: Proceedings of the 15th annual international conference on combinatorial optimization and applications, pp 35–45

  • Ji S, Xu D, Du D, Wu C (2019) Approximation algorithms for the fault-tolerant facility location problem with penalties. Discret Appl Math 264:62–75

    Article  MathSciNet  MATH  Google Scholar 

  • Lange JH, Karrenbauer A, Andres B (2018) Partial optimality and fast lower bounds for weighted correlation clustering. In: Proceedings of the 35th international conference on international conference on machine learning, pp 2892–2901

  • Li P, Puleo GJ, Milenkovic O (2019) Motif and hypergraph correlation clustering. IEEE Trans Inf Theory 66(5):3065–3078

    Article  MathSciNet  MATH  Google Scholar 

  • Puleo GJ, Milenkovic O (2015) Correlation clustering with constrained cluster sizes and extended weights bounds. SIAM J Optim 25(3):1857–1872

    Article  MathSciNet  MATH  Google Scholar 

  • Thiel E, Chehreghani MH, Dubhashi D (2019) A non-convex optimization approach to correlation clustering. In: Proceedings of the 33rd AAAI conference on artificial intelligence, pp 5159–5166

  • Xu Y, Möhring RH, Xu D, Zhang Y, Zou Y (2020) A constant FPT approximation algorithm for hard-capacitated \(k\)-means. Optim Eng 21(2):709–722

    Article  MathSciNet  MATH  Google Scholar 

  • Zhang D, Hao C, Wu C, Xu D, Zhang Z (2019) Local search approximation algorithms for the \(k\)-means problem with penalties. J Comb Optim 37(2):439–453

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

The first author is supported by National Natural Science Foundation of China (No. 12101594) and the Project funded by China Postdoctoral Science Foundation (No. 2021M693337). The third author is supported by National Natural Science Foundation of China (No. 11871081). The fourth author is supported by National Natural Science Foundation of China (No. 11801310).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gaidi Li.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A preliminary version of this paper appeared in Proceedings of the 15th International Conference on Algorithmic Applications in Management, pp. 15–26, 2021.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ji, S., Li, G., Zhang, D. et al. Approximation algorithms for the capacitated correlation clustering problem with penalties. J Comb Optim 45, 12 (2023). https://doi.org/10.1007/s10878-022-00930-6

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10878-022-00930-6

Keywords

Navigation