Client Selection for Federated Policy Optimization with Environment Heterogeneity

Xie, Zhijie; Song, S. H.

Computer Science > Machine Learning

arXiv:2305.10978 (cs)

[Submitted on 18 May 2023 (v1), last revised 20 Feb 2024 (this version, v5)]

Title:Client Selection for Federated Policy Optimization with Environment Heterogeneity

Authors:Zhijie Xie, S.H. Song

View PDF HTML (experimental)

Abstract:The development of Policy Iteration (PI) has inspired many recent algorithms for Reinforcement Learning (RL), including several policy gradient methods that gained both theoretical soundness and empirical success on a variety of tasks. The theory of PI is rich in the context of centralized learning, but its study under the federated setting is still in the infant stage. This paper investigates the federated version of Approximate PI (API) and derives its error bound, taking into account the approximation error introduced by environment heterogeneity. We theoretically prove that a proper client selection scheme can reduce this error bound. Based on the theoretical result, we propose a client selection algorithm to alleviate the additional approximation error caused by environment heterogeneity. Experiment results show that the proposed algorithm outperforms other biased and unbiased client selection methods on the federated mountain car problem and the Mujoco Hopper problem by effectively selecting clients with a lower level of heterogeneity from the population distribution.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2305.10978 [cs.LG]
	(or arXiv:2305.10978v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2305.10978

Submission history

From: Zhijie Xie [view email]
[v1] Thu, 18 May 2023 13:48:20 UTC (1,299 KB)
[v2] Sun, 21 May 2023 03:24:02 UTC (2,235 KB)
[v3] Wed, 24 May 2023 15:13:37 UTC (2,265 KB)
[v4] Thu, 15 Feb 2024 13:33:22 UTC (4,325 KB)
[v5] Tue, 20 Feb 2024 10:47:47 UTC (4,545 KB)

Computer Science > Machine Learning

Title:Client Selection for Federated Policy Optimization with Environment Heterogeneity

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Client Selection for Federated Policy Optimization with Environment Heterogeneity

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators