Skip to main navigation Skip to search Skip to main content

Total Expected Discounted Reward MDPS: Existence of Optimal Policies

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

4 Scopus citations

Abstract

This article describes the results on the existence of optimal and nearly optimal policies for Markov Decision Processes (MDPs) with total expected discounted rewards. The problem of optimization of total expected discounted rewards for MDPs is also known under the name of discounted dynamic programming.

Original languageEnglish
Title of host publicationWiley Encyclopedia of Operations Research and Management Science
Publisherwiley
Pages1-8
Number of pages8
ISBN (Electronic)9780470400531
ISBN (Print)9780470400630
DOIs
StatePublished - Jan 1 2010

Keywords

  • discounted rewards
  • dynamic programming
  • Markov decision process
  • optimal policy
  • reward function

Fingerprint

Dive into the research topics of 'Total Expected Discounted Reward MDPS: Existence of Optimal Policies'. Together they form a unique fingerprint.

Cite this