Best Arm Identification for Stochastic Rising Bandits

Mussi, Marco; Montenegro, Alessandro; Trovó, Francesco; Restelli, Marcello; Metelli, Alberto Maria

Computer Science > Machine Learning

arXiv:2302.07510 (cs)

[Submitted on 15 Feb 2023 (v1), last revised 1 Jun 2023 (this version, v2)]

Title:Best Arm Identification for Stochastic Rising Bandits

Authors:Marco Mussi, Alessandro Montenegro, Francesco Trovó, Marcello Restelli, Alberto Maria Metelli

View PDF

Abstract:Stochastic Rising Bandits (SRBs) model sequential decision-making problems in which the expected rewards of the available options increase every time they are selected. This setting captures a wide range of scenarios in which the available options are learning entities whose performance improves (in expectation) over time. While previous works addressed the regret minimization problem, this paper, focuses on the fixed-budget Best Arm Identification (BAI) problem for SRBs. In this scenario, given a fixed budget of rounds, we are asked to provide a recommendation about the best option at the end of the identification process. We propose two algorithms to tackle the above-mentioned setting, namely R-UCBE, which resorts to a UCB-like approach, and R-SR, which employs a successive reject procedure. Then, we prove that, with a sufficiently large budget, they provide guarantees on the probability of properly identifying the optimal option at the end of the learning process. Furthermore, we derive a lower bound on the error probability, matched by our R-SR (up to logarithmic factors), and illustrate how the need for a sufficiently large budget is unavoidable in the SRB setting. Finally, we numerically validate the proposed algorithms in both synthetic and real-world environments and compare them with the currently available BAI strategies.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2302.07510 [cs.LG]
	(or arXiv:2302.07510v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2302.07510

Submission history

From: Marco Mussi [view email]
[v1] Wed, 15 Feb 2023 08:01:37 UTC (281 KB)
[v2] Thu, 1 Jun 2023 12:22:29 UTC (815 KB)

Computer Science > Machine Learning

Title:Best Arm Identification for Stochastic Rising Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Best Arm Identification for Stochastic Rising Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators