AVSEC

Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement

Recent Mamba-based models have shown promise in speech enhancement by efficiently modeling long-range temporal dependencies. However, models like Speech Enhancement Mamba (SEMamba) remain limited to single-speaker scenarios and struggle in complex …