BIGpedia.com - Markov decision process - Encyclopedia and Dictionary Online
encyclopedia search

Markov decision process

A Markov Decision Process (MDP) is a discrete time stochastic control process characterized by a set of states, actions, and transition probability matrices that depend on the actions chosen within a given state.

MDPs are useful for studying a wide range of optimization problems solved via dynamic programming and reinforcement learning.

References

  • Bellman, R. E. Dynamic Programming. Princeton University Press, Princeton, NJ.
  • M. L. Puterman. Markov Decision Processes. Wiley, 1994.

External links



The contents of this article are licensed from Wikipedia.org under the GNU Free Documentation License.
How to see transparent copy

01-04-2007 01:21:04