Efficient AI
Efficient AI
News
Members
Publications
Light
Dark
Automatic
Shang Yang
Latest
Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models
DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer
Cite
×