Global Convergence of Policy Gradient in Average Reward MDPs

Publication
ICLR 2025

Related