Abstract
The paper studies optimization of average-reward continuous-time finite state and action Markov Decision Processes with multiple criteria and constraints. Under the standard unichain assumption, we prove the existence of optimal K-switching strategies for feasible problems with K constraints. For switching randomized strategies, the decisions depend on the current state and the the time spent in the current state after the last jump. For stationary strategies, these functions do not depend on sojourn times, i.e. they are constant in time. For K-switching strategies, these functions are piece-wise constant and the total number of jumps is limited by K. If there is no absorbing states, there exist also optimal K-randomized policies. We consider the linear programming approach and provide algorithms for calculations of optimal policies.
| Original language | English |
|---|---|
| Pages (from-to) | 3805-3810 |
| Number of pages | 6 |
| Journal | Proceedings of the IEEE Conference on Decision and Control |
| Volume | 4 |
| State | Published - 2002 |
| Event | 41st IEEE Conference on Decision and Control - Las Vegas, NV, United States Duration: Dec 10 2002 → Dec 13 2002 |
Fingerprint
Dive into the research topics of 'Optimal control of average reward constrained continuous-time finite Markov decision processes'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver