tiantiaf/voxlect-spanish-dialect-whisper-large-v3 Audio Classification • 2B • Updated Aug 10, 2025 • 149 • 5
Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs Paper • 2603.21573 • Published Mar 23 • 1
CPRT Collection Compositional Privacy Risk Taxonomy: Benchmark and Models • 5 items • Updated Mar 30 • 1
Rethinking Visual Privacy: A Compositional Privacy Risk Framework for Severity Assessment with VLMs Paper • 2603.21573 • Published Mar 23 • 1
FireRedASR2S: A State-of-the-Art Industrial-Grade All-in-One Automatic Speech Recognition System Paper • 2603.10420 • Published Mar 11 • 6
Accent Vector: Controllable Accent Manipulation for Multilingual TTS Without Accented Data Paper • 2603.07534 • Published Mar 8 • 5
AlexXu811/child-adult-joint-asr-diarization Automatic Speech Recognition • 0.2B • Updated Jan 31 • 121 • 2