Skip to main navigation Skip to search Skip to main content

Optimality conditions for Partially Observable Markov Decision Processes

  • National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

3 Scopus citations

Abstract

This note describes sufficient conditions for the existence of optimal policies for Partially Observable Markov Decision Processes (POMDPs). The objective criterion is either minimization of total discounted costs or minimization of total nonnegative costs. It is well-known that a POMDP can be reduced to a Completely Observable Markov Decision Process (COMDP) with the state space being the sets of believe probabilities for the POMDP. Thus, a policy is optimal in POMDP if and only if it corresponds to an optimal policy in the COMDP. Here we provide sufficient conditions for the existence of optimal policies for COMDP and therefore for POMDP.

Original languageEnglish
Title of host publicationContinuous and Distributed Systems
Subtitle of host publicationTheory and Applications
PublisherSpringer Verlag
Pages251-264
Number of pages14
ISBN (Print)9783319031453
DOIs
StatePublished - 2014

Publication series

NameSolid Mechanics and its Applications
Volume211
ISSN (Print)0925-0042

Fingerprint

Dive into the research topics of 'Optimality conditions for Partially Observable Markov Decision Processes'. Together they form a unique fingerprint.

Cite this