Abstract
This paper addresses constrained Markov decision processes, with expected discounted total cost criteria, which are controlled by non-randomized policies. A dynamic programming approach is used to construct optimal policies. The convergence of the series of finite horizon value functions to the infinite horizon value function is also shown. A simple example illustrating an application is presented.
| Original language | English |
|---|---|
| Pages (from-to) | 165-179 |
| Number of pages | 15 |
| Journal | Mathematical Methods of Operations Research |
| Volume | 66 |
| Issue number | 1 |
| DOIs | |
| State | Published - Aug 2007 |
Keywords
- Constrained Markov
- Decision processes
- Dynamic programming
- Non-randomized policies
Fingerprint
Dive into the research topics of 'Non-randomized policies for constrained Markov decision processes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver