Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
Mingjie Liu
,
Shizhe Diao
,
Ximing Lu
,
Jian Hu
,
Xin Dong
,
Yejin Choi
,
Jan Kautz
,
Yi Dong
December 2025
Cite
arXiv
Type
Conference paper
Publication
Advances in Neural Information Processing Systems (NeurIPS)
Shizhe Diao
Xin Dong
Jan Kautz
Team Leader
Related
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
Cite
×