Home
News
Members
Publications
NVIDIA Research
Light
Dark
Automatic
Greg Heinrich
NVIDIA
Interests
DL Model Architectures
DL Runtime Efficiency
Latest
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
Flextron: Many-in-One Flexible Large Language Model
AM-RADIO: Agglomerative Model - Reduce All Domains Into One
FasterViT: Fast Vision Transformers with Hierarchical Attention
Global Context Vision Transformers
Cite
×