Home
News
Members
Projects
Publications
Contact
Light
Dark
Automatic
Global Convergence of Policy Gradient in Average Reward MDPs
Navdeep Kumar
,
Yashaswini Murthy
,
Itai Shufaro
,
Kfir Yehuda Levy
,
R. Srikant
,
Shie Mannor
May 2025
Machine Learning
Type
Conference paper
Publication
ICLR 2025
Reinforcement Learning
Theory
Related
On the Convergence of Single-Timescale Actor-Critic
Non-rectangular Robust MDPs with Normed Uncertainty Sets
Policy Optimized Text-to-Image Pipeline Design
State Entropy Regularization for Robust Reinforcement Learning
Policy Gradient via Tree Expansion
Cite
×