Efficient AI
Efficient AI
News
Publications
Light
Dark
Automatic
Shang Yang
Latest
NVILA: Efficient Frontier Visual Language Models
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Cite
×