The shortest path problem is considered to be one of the classical and most important combinatorial optimization problems. Given a directed graph and a length α ij for each arc (i, j), the problem is to find a path of minimum length that leads from any node i to a node t, called the destination node. So, for each node i, we need to optimally identify a successor node u(i) so as to reach the destination at the minimum sum of arc lengths over all paths that start at i and terminate at t. Of particular relevance is, in the area of distributed computation, the problem of data routing within a computer communication network. In such a case, the cost associated with a particular link (i, j) is related to an average delay. The stochastic shortest path problem is a generalization whereby for each node i we must select a probability distribution over all possible successor nodes j out of a given set of probability distributions p ij (u), parameterized by a control u ∈ U(i). Clearly, the path...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bertsekas, D. P.: Dynamic programming and optimal control, Athena Sci., 1995.
Bertsekas, D. P., and Tsitsiklis, J. N.: Neurodynamic programming, Athena Sci., 1997.
Eaton, J. H., and Zadeh, L. A.: ‘Optimal pursuit strategies in discrete state probabilistic systems’, Trans. ASME Ser. D.J. Basic Engin.84 (1962), 23–29.
Littman, M. L.: ‘Algorithms for sequential decision making’, PhD Thesis Brown Univ. (1996).
Psaraftis, H. N., and Tsitsiklis, J. N.: ‘Dynamic shortest paths in acyclic networks with markovian arc costs’, Oper. Res.41 (1993), 91–101.
Puterman, M. L.: Markov decision processes-Discrete stochastoc dynamic programming, Wiley, 1994.
Sutton, R. S.: ‘Learning to predict by the method of temporal differences’, Machine Learning3 (1988), 9–44.
Tsitsiklis, J. N.: ‘Asynchronous stochastic aggregation and Q-learning’, Machine Learning16 (1994), 185–202.
Watkins, C. J.: ‘Learning from delayed rewards’, PhD Thesis Cambridge Univ. (1989).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Kluwer Academic Publishers
About this entry
Cite this entry
Androulakis, I.P. (2001). Dynamic Programming: Stochastic Shortest Path Problems . In: Floudas, C.A., Pardalos, P.M. (eds) Encyclopedia of Optimization. Springer, Boston, MA. https://doi.org/10.1007/0-306-48332-7_113
Download citation
DOI: https://doi.org/10.1007/0-306-48332-7_113
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-6932-5
Online ISBN: 978-0-306-48332-5
eBook Packages: Springer Book Archive