Skip to main navigation Skip to search Skip to main content

Optimal control of average reward constrained continuous-time finite Markov decision processes

Research output: Contribution to journalConference articlepeer-review

22 Scopus citations

Abstract

The paper studies optimization of average-reward continuous-time finite state and action Markov Decision Processes with multiple criteria and constraints. Under the standard unichain assumption, we prove the existence of optimal K-switching strategies for feasible problems with K constraints. For switching randomized strategies, the decisions depend on the current state and the the time spent in the current state after the last jump. For stationary strategies, these functions do not depend on sojourn times, i.e. they are constant in time. For K-switching strategies, these functions are piece-wise constant and the total number of jumps is limited by K. If there is no absorbing states, there exist also optimal K-randomized policies. We consider the linear programming approach and provide algorithms for calculations of optimal policies.

Original languageEnglish
Pages (from-to)3805-3810
Number of pages6
JournalProceedings of the IEEE Conference on Decision and Control
Volume4
StatePublished - 2002
Event41st IEEE Conference on Decision and Control - Las Vegas, NV, United States
Duration: Dec 10 2002Dec 13 2002

Fingerprint

Dive into the research topics of 'Optimal control of average reward constrained continuous-time finite Markov decision processes'. Together they form a unique fingerprint.

Cite this