microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
6B
•
Updated
•
403k
•
1.54k
Try on clothes virtually by uploading images
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View AI model releases for 2024