Abstract
This paper describes the structure of optimal policies for infinite-state Markov Decision Processes with setwise continuous transition probabilities. The action sets may be noncompact. The objective criteria are either the expected total discounted and undiscounted costs or average costs per unit time. The analysis of optimality equations and inequalities is based on the optimal selection theorem for inf-compact functions introduced in this paper.
| Original language | English |
|---|---|
| Pages (from-to) | 734-740 |
| Number of pages | 7 |
| Journal | Operations Research Letters |
| Volume | 49 |
| Issue number | 5 |
| DOIs | |
| State | Published - Sep 2021 |
Keywords
- Average cost per unit time
- Markov decision process
- Optimal selection theorem
- Total discounted cost
Fingerprint
Dive into the research topics of 'MDPs with setwise continuous transition probabilities'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver