Optimal Decision Trees For Interpretable Clustering with Constraints (Extended Version)

Shati, Pouya; Cohen, Eldan; McIlraith, Sheila

Computer Science > Machine Learning

arXiv:2301.12671 (cs)

[Submitted on 30 Jan 2023 (v1), last revised 16 May 2023 (this version, v2)]

Title:Optimal Decision Trees For Interpretable Clustering with Constraints (Extended Version)

Authors:Pouya Shati, Eldan Cohen, Sheila McIlraith

View PDF

Abstract:Constrained clustering is a semi-supervised task that employs a limited amount of labelled data, formulated as constraints, to incorporate domain-specific knowledge and to significantly improve clustering accuracy. Previous work has considered exact optimization formulations that can guarantee optimal clustering while satisfying all constraints, however these approaches lack interpretability. Recently, decision-trees have been used to produce inherently interpretable clustering solutions, however existing approaches do not support clustering constraints and do not provide strong theoretical guarantees on solution quality. In this work, we present a novel SAT-based framework for interpretable clustering that supports clustering constraints and that also provides strong theoretical guarantees on solution quality. We also present new insight into the trade-off between interpretability and satisfaction of such user-provided constraints. Our framework is the first approach for interpretable and constrained clustering. Experiments with a range of real-world and synthetic datasets demonstrate that our approach can produce high-quality and interpretable constrained clustering solutions.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2301.12671 [cs.LG]
	(or arXiv:2301.12671v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.12671

Submission history

From: Pouya Shati [view email]
[v1] Mon, 30 Jan 2023 05:34:49 UTC (55 KB)
[v2] Tue, 16 May 2023 14:24:36 UTC (48 KB)

Computer Science > Machine Learning

Title:Optimal Decision Trees For Interpretable Clustering with Constraints (Extended Version)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Optimal Decision Trees For Interpretable Clustering with Constraints (Extended Version)

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators