Efficient AI
Efficient AI
News
Publications
Light
Dark
Automatic
Haisheng Chen
Latest
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference
Cite
×