A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Fuchida, Takayasu; Aung, Kathy Thi

doi:10.1007/s10015-013-0125-x

A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Special Feature: Original Article
Published: 15 November 2013

Volume 18, pages 172–177, (2013)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Takayasu Fuchida¹ &
Kathy Thi Aung¹

218 Accesses
Explore all metrics

Abstract

This paper presents a new adaptive segmentation of continuous state space based on vector quantization algorithm such as Linde–Buzo–Gray for high-dimensional continuous state spaces. The objective of adaptive state space partitioning is to develop the efficiency of learning reward values with an accumulation of state transition vector in a single-agent environment. We constructed our single-agent model in continuous state and discrete actions spaces using Q-learning function. Moreover, the study of the resulting state space partition reveals a Voronoi tessellation. In addition, the experimental results show that this proposed method can partition the continuous state space appropriately into Voronoi regions according to not only the number of actions, but also achieve a good performance of reward-based learning tasks compared with other approaches such as square partition lattice on discrete state space.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

A Dynamic and Task-Independent Reward Shaping Approach for Discrete Partially Observable Markov Decision Processes

References

Sutton RS, Barto AG (1998) Reinforcement learning an introduction. MIT Press, Cambridge
Google Scholar
Watkins CJCH, Dayan P (1992) Technical notes: Q-learning. Mach Learn 8:279–292
MATH Google Scholar
Patane G, Russo M (2001) The enhanced LBG algorithm. In: Proceeding of neural networks, pp 1219–1237
Shen F, Hasegawa O (2006) An adaptive incremental LBG for vector quantization. Neural Netw 19:694–704
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of System Information Science, Graduate School of Science and Engineering, Kagoshima University, 1-21-40 Kohrimoto, Kagoshima, 890-0065, Japan
Takayasu Fuchida & Kathy Thi Aung

Authors

Takayasu Fuchida
View author publications
You can also search for this author in PubMed Google Scholar
Kathy Thi Aung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takayasu Fuchida.

About this article

Cite this article

Fuchida, T., Aung, K.T. A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation. Artif Life Robotics 18, 172–177 (2013). https://doi.org/10.1007/s10015-013-0125-x

Download citation

Received: 30 September 2013
Accepted: 07 October 2013
Published: 15 November 2013
Issue Date: December 2013
DOI: https://doi.org/10.1007/s10015-013-0125-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Abstract

Access this article

Similar content being viewed by others

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

A Dynamic and Task-Independent Reward Shaping Approach for Discrete Partially Observable Markov Decision Processes

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Keywords

Navigation

A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

Abstract

Access this article

Similar content being viewed by others

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

A Dynamic and Task-Independent Reward Shaping Approach for Discrete Partially Observable Markov Decision Processes

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Keywords

Search

Navigation