Skip to main content
Log in

A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation

  • Special Feature: Original Article
  • Published:
Artificial Life and Robotics Aims and scope Submit manuscript

Abstract

This paper presents a new adaptive segmentation of continuous state space based on vector quantization algorithm such as Linde–Buzo–Gray for high-dimensional continuous state spaces. The objective of adaptive state space partitioning is to develop the efficiency of learning reward values with an accumulation of state transition vector in a single-agent environment. We constructed our single-agent model in continuous state and discrete actions spaces using Q-learning function. Moreover, the study of the resulting state space partition reveals a Voronoi tessellation. In addition, the experimental results show that this proposed method can partition the continuous state space appropriately into Voronoi regions according to not only the number of actions, but also achieve a good performance of reward-based learning tasks compared with other approaches such as square partition lattice on discrete state space.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Sutton RS, Barto AG (1998) Reinforcement learning an introduction. MIT Press, Cambridge

    Google Scholar 

  2. Watkins CJCH, Dayan P (1992) Technical notes: Q-learning. Mach Learn 8:279–292

    MATH  Google Scholar 

  3. Patane G, Russo M (2001) The enhanced LBG algorithm. In: Proceeding of neural networks, pp 1219–1237

  4. Shen F, Hasegawa O (2006) An adaptive incremental LBG for vector quantization. Neural Netw 19:694–704

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Takayasu Fuchida.

About this article

Cite this article

Fuchida, T., Aung, K.T. A proposition of adaptive state space partition in reinforcement learning with Voronoi tessellation. Artif Life Robotics 18, 172–177 (2013). https://doi.org/10.1007/s10015-013-0125-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10015-013-0125-x

Keywords

Navigation