Home
News
Members
Projects
Publications
Contact
Light
Dark
Automatic
Gal Dalal*
Latest
Monotone and Conservative Policy Iteration Beyond the Tabular Case
Gradient Boosting Reinforcement Learning
Policy Gradient via Tree Expansion
SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search
Planning and Learning with Adaptive Lookahead
Reinforcement Learning with a Terminator
Reinforcement Learning for Datacenter Congestion Control
Improve Agents without Retraining: Parallel Tree Search with Off-Policy Correction
Acting in Delayed Environments with Non-Stationary Markov Policies
Cite
×