Skip to main content

Dynamic Programming: Stochastic Shortest Path Problems

  • Reference work entry
Encyclopedia of Optimization
  • 253 Accesses

The shortest path problem is considered to be one of the classical and most important combinatorial optimization problems. Given a directed graph and a length α ij for each arc (i, j), the problem is to find a path of minimum length that leads from any node i to a node t, called the destination node. So, for each node i, we need to optimally identify a successor node u(i) so as to reach the destination at the minimum sum of arc lengths over all paths that start at i and terminate at t. Of particular relevance is, in the area of distributed computation, the problem of data routing within a computer communication network. In such a case, the cost associated with a particular link (i, j) is related to an average delay. The stochastic shortest path problem is a generalization whereby for each node i we must select a probability distribution over all possible successor nodes j out of a given set of probability distributions p ij (u), parameterized by a control u ∈ U(i). Clearly, the path...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,699.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bertsekas, D. P.: Dynamic programming and optimal control, Athena Sci., 1995.

    Google Scholar 

  2. Bertsekas, D. P., and Tsitsiklis, J. N.: Neurodynamic programming, Athena Sci., 1997.

    Google Scholar 

  3. Eaton, J. H., and Zadeh, L. A.: ‘Optimal pursuit strategies in discrete state probabilistic systems’, Trans. ASME Ser. D.J. Basic Engin.84 (1962), 23–29.

    MathSciNet  Google Scholar 

  4. Littman, M. L.: ‘Algorithms for sequential decision making’, PhD Thesis Brown Univ. (1996).

    Google Scholar 

  5. Psaraftis, H. N., and Tsitsiklis, J. N.: ‘Dynamic shortest paths in acyclic networks with markovian arc costs’, Oper. Res.41 (1993), 91–101.

    MathSciNet  MATH  Google Scholar 

  6. Puterman, M. L.: Markov decision processes-Discrete stochastoc dynamic programming, Wiley, 1994.

    Google Scholar 

  7. Sutton, R. S.: ‘Learning to predict by the method of temporal differences’, Machine Learning3 (1988), 9–44.

    Google Scholar 

  8. Tsitsiklis, J. N.: ‘Asynchronous stochastic aggregation and Q-learning’, Machine Learning16 (1994), 185–202.

    MATH  Google Scholar 

  9. Watkins, C. J.: ‘Learning from delayed rewards’, PhD Thesis Cambridge Univ. (1989).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Kluwer Academic Publishers

About this entry

Cite this entry

Androulakis, I.P. (2001). Dynamic Programming: Stochastic Shortest Path Problems . In: Floudas, C.A., Pardalos, P.M. (eds) Encyclopedia of Optimization. Springer, Boston, MA. https://doi.org/10.1007/0-306-48332-7_113

Download citation

  • DOI: https://doi.org/10.1007/0-306-48332-7_113

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-7923-6932-5

  • Online ISBN: 978-0-306-48332-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics