Answer questions about images and text conversations
Extract and recognize text from images and PDFs
Convert text to speech with custom voices and cloning
Lightning-Fast, On-Device, Multilingual TTS
Real-time object detection & pose estimation in your browser
Pocket TTS optimized for Hugging Face Spaces on CPU