MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence Paper ⢠2508.13992 ⢠Published Aug 19 ⢠7
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities Paper ⢠2503.03983 ⢠Published Mar 6 ⢠26
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark Paper ⢠2410.19168 ⢠Published Oct 24, 2024 ⢠23
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation Paper ⢠2410.13198 ⢠Published Oct 17, 2024 ⢠10
ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds Paper ⢠2409.09213 ⢠Published Sep 13, 2024 ⢠13
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Paper ⢠2406.11768 ⢠Published Jun 17, 2024 ⢠24