Skip to main navigation Skip to search Skip to main content

Note on the existence of optimal policies in total reward dynamic programs with compact action sets

  • Universidad Autonóma Agraria Antonio Narro
  • Universidad Autónoma Metropolitana

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

This work deals with Markov decision processes (MDPs) with expected total rewards, discrete state spaces, and compact action sets. Within this framework, a question on the existence of optimal stationary policies, formulated by Puterman (1994, p. 326), is considered. The paper concerns the possibility of obtaining an affirmative answer when additional assumptions are imposed on the decision model. Three conditions ensuring the existence of average optimal stationary policies in finite-state MDPs are analyzed, and it is shown that only the so-called structural continuity condition is a natural sufficient assumption under which the existence of total-reward optimal stationary policies can be guaranteed. In particular, this existence result holds for unichain MDPs with finite state space, but an example is provided to show that this general conclusion does not have an extension to the denumerable state space case.

Original languageEnglish
Pages (from-to)657-666
Number of pages10
JournalMathematics of Operations Research
Volume25
Issue number4
DOIs
StatePublished - 2000

Fingerprint

Dive into the research topics of 'Note on the existence of optimal policies in total reward dynamic programs with compact action sets'. Together they form a unique fingerprint.

Cite this