Skip to main content

Approximate Dynamic Programming (ADP)

  • Reference work entry
  • First Online:
Encyclopedia of Systems and Control
  • 211 Accesses

Abstract

Approximate dynamic programming (ADP or RLADP) includes a wide variety of general methods to solve for optimal decision and control in the face of complexity, nonlinearity, stochasticity, and/or partial observability. This entry first reviews methods and a few key applications across decision and control engineering (e.g., vehicle and logistics control), computer science (e.g., AlphaGo), operations research, and connections to economics, neuropsychology, and animal behavior. Then it summarizes a sixfold mathematical taxonomy of methods in use today, with pointers to the future.

Paul J. Werbos has retired.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,699.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 1,999.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Bibliography

  • Lewis FL, Liu D (eds), (2013) Reinforcement learning and approximate dynamic programming for feedback control, vol 17. Wiley (IEEE Series), New York

    Google Scholar 

  • Werbos PJ (2005) Backwards differentiation in AD and neural nets: past links and new opportunities. In: Bucker M, Corliss G, Hovland P, Naumann, Norris B (eds) Automatic differentiation: applications, theory and implementations. Springer, New York

    Google Scholar 

  • Werbos PJ (2014) Werbos, from ADP to the brain: foundations, roadmap, challenges and research priorities. In: Proceedings of the international joint conference on neural networks 2014. IEEE, New Jersey. https://arxiv.org/abs/1404.0554

  • Werbos PJ, Davis JJ, (2016) Regular cycles of forward and backward signal propagation in prefrontal cortex and in consciousness. Front Syst Neurosci 10:97. https://doi.org/10.3389/fnsys.2016.00097

    Article  Google Scholar 

  • White DA, Sofge DA (eds) (1992) Handbook of intelligent control: neural, fuzzy, and adaptive approaches. Van Nostrand Reinhold, New York

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Paul J. Werbos .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Werbos, P.J. (2021). Approximate Dynamic Programming (ADP). In: Baillieul, J., Samad, T. (eds) Encyclopedia of Systems and Control. Springer, Cham. https://doi.org/10.1007/978-3-030-44184-5_100096

Download citation

Publish with us

Policies and ethics