Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
RLP: Reinforcement Learning Pre-training
Ali Hatamizadeh
,
Syeda Nahida Akter
,
Shrimai Prabhumoye
,
Jan Kautz
,
Mostofa Patwary
,
Mohammad Shoeybi
,
Bryan Catanzaro
,
Yejin Choi
April 2026
Cite
arXiv
press
Type
Conference paper
Publication
International Conference on Learning Representations (ICLR)
Ali Hatamizadeh
Jan Kautz
Team Leader
Related
An Empirical Study of Mamba-based Language Models
Compact Language Models via Pruning and Knowledge Distillation
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Cite
×