![]() |
|
|||||||||||||||||
Markov decision processA Markov Decision Process (MDP) is a discrete time stochastic control process characterized by a set of states, actions, and transition probability matrices that depend on the actions chosen within a given state. MDPs are useful for studying a wide range of optimization problems solved via dynamic programming and reinforcement learning. References
External links
The contents of this article are licensed from Wikipedia.org under the GNU Free Documentation License.
How to see transparent copy 01-04-2007 01:21:04 |
|





