A note on policy algorithms for discounted Markov decision problems

Michael K. Ng*

*Corresponding author for this work

Research output: Contribution to journalJournal articlepeer-review

4 Citations (Scopus)

Abstract

The evaluation phase in the policy iteration algorithm for the infinite horizon discounted Markov decision problem is presented, which can be done in O(mN2) operations, where N is the number of states of the Markov decision process and m is the number of states in which the decision changes during the policy improvement phase.

Original languageEnglish
Pages (from-to)195-197
Number of pages3
JournalOperations Research Letters
Volume25
Issue number4
DOIs
Publication statusPublished - Nov 1999

Scopus Subject Areas

  • Software
  • Management Science and Operations Research
  • Industrial and Manufacturing Engineering
  • Applied Mathematics

User-Defined Keywords

  • Discounted Markov decision process
  • Policy algorithm
  • Matrices

Fingerprint

Dive into the research topics of 'A note on policy algorithms for discounted Markov decision problems'. Together they form a unique fingerprint.

Cite this