FastVLM WebGPU
🍎
419
Real-time video captioning powered by FastVLM
Zero SQL
Generate a transcript with speaker identification from an audio file
Fight AI models with prompts
OmniParser, turn your LLM into GUI agent
Testing Multimodal Gemini
Testing Multimodal Gemini