Dynamic Programming: Stochastic Shortest Path Problems

Androulakis, Ioannis P.

doi:10.1007/0-306-48332-7_113

Ioannis P. Androulakis³

253 Accesses

The shortest path problem is considered to be one of the classical and most important combinatorial optimization problems. Given a directed graph and a length α_ij for each arc (i, j), the problem is to find a path of minimum length that leads from any node i to a node t, called the destination node. So, for each node i, we need to optimally identify a successor node u(i) so as to reach the destination at the minimum sum of arc lengths over all paths that start at i and terminate at t. Of particular relevance is, in the area of distributed computation, the problem of data routing within a computer communication network. In such a case, the cost associated with a particular link (i, j) is related to an average delay. The stochastic shortest path problem is a generalization whereby for each node i we must select a probability distribution over all possible successor nodes j out of a given set of probability distributions p _ij(u), parameterized by a control u ∈ U(i). Clearly, the path...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 1,699.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bertsekas, D. P.: Dynamic programming and optimal control, Athena Sci., 1995.
Google Scholar
Bertsekas, D. P., and Tsitsiklis, J. N.: Neurodynamic programming, Athena Sci., 1997.
Google Scholar
Eaton, J. H., and Zadeh, L. A.: ‘Optimal pursuit strategies in discrete state probabilistic systems’, Trans. ASME Ser. D.J. Basic Engin.84 (1962), 23–29.
MathSciNet Google Scholar
Littman, M. L.: ‘Algorithms for sequential decision making’, PhD Thesis Brown Univ. (1996).
Google Scholar
Psaraftis, H. N., and Tsitsiklis, J. N.: ‘Dynamic shortest paths in acyclic networks with markovian arc costs’, Oper. Res.41 (1993), 91–101.
MathSciNet MATH Google Scholar
Puterman, M. L.: Markov decision processes-Discrete stochastoc dynamic programming, Wiley, 1994.
Google Scholar
Sutton, R. S.: ‘Learning to predict by the method of temporal differences’, Machine Learning3 (1988), 9–44.
Google Scholar
Tsitsiklis, J. N.: ‘Asynchronous stochastic aggregation and Q-learning’, Machine Learning16 (1994), 185–202.
MATH Google Scholar
Watkins, C. J.: ‘Learning from delayed rewards’, PhD Thesis Cambridge Univ. (1989).
Google Scholar

Download references

Author information

Authors and Affiliations

Corp. Strategic Res. ExxonMobil Res. & Engin., Annandale, New Jersey, 08801, USA
Ioannis P. Androulakis

Authors

Ioannis P. Androulakis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. Chemical Engin., Princeton Univ., Princeton, NJ, 08544-5263, USA
Christodoulos A. Floudas
Center for Applied Optim. Dept. Industrial and Systems Engin., Univ. Florida, Gainesville, FL, 32611, USA
Panos M. Pardalos

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Androulakis, I.P. (2001). Dynamic Programming: Stochastic Shortest Path Problems . In: Floudas, C.A., Pardalos, P.M. (eds) Encyclopedia of Optimization. Springer, Boston, MA. https://doi.org/10.1007/0-306-48332-7_113

Download citation

DOI: https://doi.org/10.1007/0-306-48332-7_113
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-7923-6932-5
Online ISBN: 978-0-306-48332-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics