Temporal Difference Approach to Playing Give-Away Checkers

Mańdziuk, Jacek; Osman, Daniel

doi:10.1007/978-3-540-24844-6_141

Temporal Difference Approach to Playing Give-Away Checkers

Jacek Mańdziuk²² &
Daniel Osman²²

Conference paper

1679 Accesses
7 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3070))

Abstract

In this paper we examine the application of temporal difference methods in learning a linear state value function approximation in a game of give-away checkers. Empirical results show that the TD(λ) algorithm can be successfully used to improve playing policy quality in this domain. Training games with strong and random opponents were considered. Results show that learning only on negative game outcomes improved performance of the learning player against strong opponents.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.: Learning to predict by the method of temporal differences. Machine Learning 3, 9–44 (1988)
Google Scholar
Tesauro, G.: Temporal difference learning and td-gammon. Communications of the ACM 38, 58–68 (1995)
Article Google Scholar
Samuel, A.L.: Some studies in machine learning using the game of checkers. IBM Journal of Research and Development 3, 210–229 (1959)
Article Google Scholar
Schaeffer, J., Hlynka, M., Jussila, V.: Temporal difference learning applied to a high-performance game-playing program. In: International Joint Conference on Artificial Intelligence (IJCAI), pp. 529–534 (2001)
Google Scholar
Baxter, J., Tridgell, A., Weaver, L.: Knightcap: A chess program that learns by combining td(lambda) with game-tree search. In: MACHINE LEARNING Proceedings of the Fifteenth International Conference (ICML 1998), Madison WISCONSIN, pp. 28–36 (1998)
Google Scholar
Schraudolph, N.N., Dayan, P., Sejnowski, T.J.: Learning to evaluate go positions via temporal difference methods. In: Baba, N., Jain, L. (eds.) Computational Intelligence in Games, vol. 62, Springer, Berlin (2001)
Google Scholar
Walker, S., Lister, R., Downs, T.: On self-learning patterns in the othello board game by the method of temporal differences. In: Proceedings of the 6th Australian Joint Conference on Artificial Intelligence, Melbourne, pp. 328–333. World Scientific, Melbourne (1993)
Google Scholar
Alemanni, J.B.: Give-away checkers (1993), http://perso.wanadoo.fr/alemanni/give_away.html
Schaeffer, J., Lake, R., Lu, P., Bryant, M.: Chinook: The world man-machine checkers champion. AI Magazine 17, 21–29 (1996)
Google Scholar
Singh, S.P., Sutton, R.S.: Reinforcement learning with replacing eligibility traces. Machine Learning 22, 123–158 (1996)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Mathematics and Information Science, Warsaw University of Technology, Plac Politechniki 1, 00-661, Warsaw, Poland
Jacek Mańdziuk & Daniel Osman

Authors

Jacek Mańdziuk
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Osman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Artificial Intelligence, Academy of Humanities and Economics, Poland
Leszek Rutkowski
German Research Center of Artificial Intelligence (DFKI), Germany
Jörg H. Siekmann
Institute of Automatics, AGH University of Science and Technology, Al. Mickiewicza 30, PL-30-059, Kraków, Poland
Ryszard Tadeusiewicz
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Initiative in Soft Computing (BISC), 94720-1776, Berkeley, CA
Lotfi A. Zadeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mańdziuk, J., Osman, D. (2004). Temporal Difference Approach to Playing Give-Away Checkers. In: Rutkowski, L., Siekmann, J.H., Tadeusiewicz, R., Zadeh, L.A. (eds) Artificial Intelligence and Soft Computing - ICAISC 2004. ICAISC 2004. Lecture Notes in Computer Science(), vol 3070. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24844-6_141

Download citation

DOI: https://doi.org/10.1007/978-3-540-24844-6_141
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22123-4
Online ISBN: 978-3-540-24844-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics