Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Xin Dong
NVIDIA
Interests
LLM/VLM
Low-Cost AI
Latest
CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Fast-SLM: Towards Latency-Optimal Hybrid Small Language Models
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
Hymba: A Hybrid-head Architecture for Small Language Models
LongMamba: Enhancing Mamba's Long-Context Capabilities via Training-Free Receptive Field Enlargement
Privacy Vulnerability of Split Computing to Data-Free Model Inversion Attacks
Cite
×