Abstract
This note considers an average-cost Markov Decision Process (MDP) with finite state and action sets and satisfying the additional condition that there is a state to which the system jumps from any state and under any action with a positive probability. The main result is that the policy iteration algorithm is strongly polynomial for such MDPs, which are often used to model replacement and maintenance problems.
| Original language | English |
|---|---|
| Pages (from-to) | 249-251 |
| Number of pages | 3 |
| Journal | Operations Research Letters |
| Volume | 41 |
| Issue number | 3 |
| DOIs | |
| State | Published - May 2013 |
Keywords
- Average cost
- Linear program
- Markov Decision Process
- Policy iteration algorithm
- Simplex method
- Strongly polynomial
Fingerprint
Dive into the research topics of 'Strong polynomiality of policy iterations for average-cost MDPs modeling replacement and maintenance problems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver