Skip to main navigation Skip to search Skip to main content

Markov decision processes

  • Hyeong Soo Chang
  • , Jiaqiao Hu
  • , Michael C. Fu
  • , Steven I. Marcus
  • Sogang University
  • University of Maryland, College Park

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

2 Scopus citations

Abstract

We provide a formal description of the discounted reward MDP framework in Chap. 1, including both the finite- and the infinite-horizon settings and summarizing the associated optimality equations. We then present the well-known exact solution algorithms, value iteration and policy iteration, and outline a framework of rolling-horizon control (also called receding-horizon control) as an approximate solution methodology for solving MDPs, in conjunction with simulation-based approaches covered later in the book. We conclude with a brief survey of other recently proposed MDP solution techniques designed to break the curse of dimensionality.

Original languageEnglish
Title of host publicationCommunications and Control Engineering
PublisherSpringer International Publishing
Pages1-17
Number of pages17
Edition9781447150213
DOIs
StatePublished - 2013

Publication series

NameCommunications and Control Engineering
Number9781447150213
ISSN (Print)0178-5354
ISSN (Electronic)2197-7119

Keywords

  • Convolution
  • Entropy
  • Prefix

Fingerprint

Dive into the research topics of 'Markov decision processes'. Together they form a unique fingerprint.

Cite this