Skip to main navigation Skip to search Skip to main content

Compactness of the space of non-randomized policies in countable-state sequential decision processes

  • Naval Research Laboratory

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

For sequential decision processes with countable state spaces, we prove compactness of the set of strategic measures corresponding to nonrandomized policies. For the Borel state case, this set may not be compact (Piunovskiy, Optimal control of random sequences in problems with constraints. Kluwer, Boston, p. 170, 1997) in spite of compactness of the set of strategic measures corresponding to all policies (Schäl, On dynamic programming: compactness of the space of policies. Stoch Processes Appl 3(4):345-364, 1975b; Balder, On compactness of the space of policies in stochastic dynamic programming. Stoch Processes Appl 32(1):141-150, 1989). We use the compactness result from this paper to show the existence of optimal policies for countable-state constrained optimization of expected discounted and nonpositive rewards, when the optimality is considered within the class of nonrandomized policies. This paper also studies the convergence of a value-iteration algorithm for such constrained problems.

Original languageEnglish
Pages (from-to)307-323
Number of pages17
JournalMathematical Methods of Operations Research
Volume71
Issue number2
DOIs
StatePublished - Apr 2010

Keywords

  • Compactness
  • Markov decision processes
  • Non-randomized policies

Fingerprint

Dive into the research topics of 'Compactness of the space of non-randomized policies in countable-state sequential decision processes'. Together they form a unique fingerprint.

Cite this