openai/clip-vit-large-patch14 Zero-Shot Image Classification β’ 0.4B β’ Updated Sep 15, 2023 β’ 8.53M β’ 1.92k
microsoft/Phi-3-vision-128k-instruct Text Generation β’ 4B β’ Updated Aug 20, 2024 β’ 17.7k β’ 970
Running on Zero MCP Featured 139 Multimodal OCR2 π» 139 nanonets ocr / smoldocling / monkey ocr / typhoon ocr
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ 0.3B β’ Updated Sep 17 β’ 138k β’ 1.6k
Running on Zero 16 Explainable-Vision-Language-Model π₯Ά 16 Generate a video visualizing how a model attends to an image while generating text
google/vit-base-patch16-224 Image Classification β’ 86.6M β’ Updated Sep 5, 2023 β’ 3.95M β’ β’ 906