Robust Best-arm Identification in Linear Bandits

Wang, Wei; Vakili, Sattar; Bogunovic, Ilija

Computer Science > Machine Learning

arXiv:2311.04731 (cs)

[Submitted on 8 Nov 2023]

Title:Robust Best-arm Identification in Linear Bandits

Authors:Wei Wang, Sattar Vakili, Ilija Bogunovic

View PDF

Abstract:We study the robust best-arm identification problem (RBAI) in the case of linear rewards. The primary objective is to identify a near-optimal robust arm, which involves selecting arms at every round and assessing their robustness by exploring potential adversarial actions. This approach is particularly relevant when utilizing a simulator and seeking to identify a robust solution for real-world transfer. To this end, we present an instance-dependent lower bound for the robust best-arm identification problem with linear rewards. Furthermore, we propose both static and adaptive bandit algorithms that achieve sample complexity that matches the lower bound. In synthetic experiments, our algorithms effectively identify the best robust arm and perform similarly to the oracle strategy. As an application, we examine diabetes care and the process of learning insulin dose recommendations that are robust with respect to inaccuracies in standard calculators. Our algorithms prove to be effective in identifying robust dosage values across various age ranges of patients.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2311.04731 [cs.LG]
	(or arXiv:2311.04731v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.04731

Submission history

From: Wei Wang [view email]
[v1] Wed, 8 Nov 2023 14:58:11 UTC (455 KB)

Computer Science > Machine Learning

Title:Robust Best-arm Identification in Linear Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Robust Best-arm Identification in Linear Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators