Think Smart About Sparse Compute: LatentMoE for Higher Accuracy per FLOP and per Parameter
Published:
Published:
Published:
We scale up cascaded reinforcement learning (Cascade RL) to develop general purpose reasoning models, Nemotron-Cascade, capable of operating in both instruct and deep thinking modes. Our 14B model can outperform its SFT teacher and achieves silver-medal performance in IOI 2025.