Search

Home
News
Members
Projects
Publications
Contact

Light Dark Automatic

Itai Shufaro

Latest

Global Convergence of Policy Gradient in Average Reward MDPs
On Bits and Bandits: Quantifying the Regret-Information Trade-off

© 2026 NVIDIA Corporation. All rights reserved.

Privacy Policy — Your Privacy Choices — Terms of Service — Accessibility — Corporate Policies — Contact
Published with Wowchemy — the free, open source website builder that empowers creators.

Cite