Abstract
This article describes the results on the existence of optimal and nearly optimal policies for Markov Decision Processes (MDPs) with total expected discounted rewards. The problem of optimization of total expected discounted rewards for MDPs is also known under the name of discounted dynamic programming.
| Original language | English |
|---|---|
| Title of host publication | Wiley Encyclopedia of Operations Research and Management Science |
| Publisher | wiley |
| Pages | 1-8 |
| Number of pages | 8 |
| ISBN (Electronic) | 9780470400531 |
| ISBN (Print) | 9780470400630 |
| DOIs | |
| State | Published - Jan 1 2010 |
Keywords
- discounted rewards
- dynamic programming
- Markov decision process
- optimal policy
- reward function
Fingerprint
Dive into the research topics of 'Total Expected Discounted Reward MDPS: Existence of Optimal Policies'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver