Efficient AI
Efficient AI
News
Members
Publications
Light
Dark
Automatic
Sparse
XAttention: Block Sparse Attention with Antidiagonal Scoring
Long-Context Transformer Models (LCTMs) are vital for real-world applications but suffer high computational costs due to attention’s …
Ruyi Xu
,
Guangxuan Xiao
,
Haofeng Huang
,
Junxian Guo
,
Song Han
PDF
Cite
Code
Slides
Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Diffusion Transformers (DiTs) dominate video generation but their high computational cost severely limits real-world applicability, …
Haocheng Xi
,
Shuo Yang
,
Yilong Zhao
,
Chenfeng Xu
,
Muyang Li
,
Xiuyu Li
,
Yujun Lin
,
Han Cai
,
Jintao Zhang
,
Dacheng Li
,
Jianfei Chen
,
Ion Stoica
,
Kurt Keutzer
,
Song Han
PDF
Cite
Code
Project
Cite
×