Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking

Du, Desong; Qi, Naiming; Liu, Yanfang; Pan, Wei

Computer Science > Robotics

arXiv:2311.03680 (cs)

[Submitted on 7 Nov 2023]

Title:Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking

Authors:Desong Du, Naiming Qi, Yanfang Liu, Wei Pan

View PDF

Abstract:In the pursuit of autonomous spacecraft proximity maneuvers and docking(PMD), we introduce a novel Bayesian actor-critic reinforcement learning algorithm to learn a control policy with the stability guarantee. The PMD task is formulated as a Markov decision process that reflects the relative dynamic model, the docking cone and the cost function. Drawing from the principles of Lyapunov theory, we frame the temporal difference learning as a constrained Gaussian process regression problem. This innovative approach allows the state-value function to be expressed as a Lyapunov function, leveraging the Gaussian process and deep kernel learning. We develop a novel Bayesian quadrature policy optimization procedure to analytically compute the policy gradient while integrating Lyapunov-based stability constraints. This integration is pivotal in satisfying the rigorous safety demands of spaceflight missions. The proposed algorithm has been experimentally evaluated on a spacecraft air-bearing testbed and shows impressive and promising performance.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.03680 [cs.RO]
	(or arXiv:2311.03680v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2311.03680

Submission history

From: Desong Du [view email]
[v1] Tue, 7 Nov 2023 03:12:58 UTC (9,458 KB)

Computer Science > Robotics

Title:Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Deep Bayesian Reinforcement Learning for Spacecraft Proximity Maneuvers and Docking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators