Home
News
Members
Projects
Publications
Contact
Light
Dark
Automatic
Reinforcement Learning
Acting in Delayed Environments with Non-Stationary Markov Policies
The standard Markov Decision Process (MDP) formulation hinges on the assumption that an action is executed immediately after it was chosen. However, assuming it is often unrealistic and can lead to catastrophic failures in applications such as …
«
Cite
×