Running on A100 208 Omnilingual ASR Media Transcription 🌍 208 Transcribe audio or video into text in multiple languages
facebook/dinov3-vitb16-pretrain-lvd1689m Image Feature Extraction • 85.7M • Updated Aug 19 • 321k • 86
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated about 22 hours ago • 399k • 1.55k