透過您的圖書館登入
IP:3.137.178.133
  • 學位論文

基於深度強化學習之輪型足球機器人的守門員策略

Deep Reinforcement Learning Based Goalkeeper Strategy of Wheeled Soccer Robot

指導教授 : 李世安
本文將於2025/02/27開放下載。若您希望在開放下載時收到通知,可將文章加入收藏

摘要


隨著機器學習的發展,深度學習已經廣泛的應用在工業製造中的影像辨識中,而強化學習也已經有在各個遊戲平台中,表現出優於人類的能力。本論文實現一個基於深度強化學習之輪型足球機器人的守門員策略,已達到防守敵方攻擊者的射門進攻策略,並提出一個有效的訓練方法,使足球守門員能夠以更加有效率的方式學習策略。本論文所採用的深度強化學習演算法為柔性行動者評論家,藉由設計此演算法的神經網路架構和以Gazebo模擬器做為環境平台,使機器守門員能更在模擬的環境中探索與學習,並能有效得完成指定的任務。訓練方法部分,本論文設計良好的獎勵函數和利用漸進式學習的方式。藉由慢慢增加訓練環境的困難度,使訓練能夠更加穩定,以及降低訓練所需要花費的時間。根據本論文所設計之方法,以分區域的方法進行測試其策略的有效性,皆表現出優良的防守率,以證明本論文所設計的方法的強健性。

並列摘要


With the development of machine learning, deep learning has been widely applied in industrial manufactory, and reinforcement learning also has a good performance in lots of games. This paper presents a method based on deep reinforcement learning on goalkeeper strategy of wheeled soccer robot which makes the robot goalkeeper can defend the attacker shoots the soccer toward the goal and presents an effective training method that can make robot goalkeeper has a high effectivity in learning the strategy. We adopt the Soft Actor-Critic (SAC) which is one of a deep reinforcement learning algorithm. With the Gazebo simulator and SAC, the robot goalkeeper can explore the environment and learn the strategy in the simulated environment, and enable the robot goalkeeper stable and effectively complete the specified tasks by designing neural network architecture. In the section of the training method, the idea of this paper creates a useful reward function and using progressive learning. By increasing the complexity of the environment step by step, the training can be more stable, and reduce the training time. According to the method presented in the paper, by dividing the field into several areas to test the result, the robot goalkeeper has a good performance in its defending rate that proofs the robusticity of the present method.

參考文獻


[1] RoboCup, URL: https://www.robocup.org/
[2] FIRA, URL: http://www.fira.net
[3] WRS, URL: https://worldrobotsummit.org/
[4] Shakey, URL: http://www.ai.sri.com/shakey/
[5] Kiva robot, URL: https://robohub.org/meet-the-drone-that-already-delivers-your-packages-kiva-robot-teardown/

延伸閱讀