Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,379

Full-text search

Active filters: multimodal

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 7 days ago • 32.3k • 428

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 3 days ago • 1.17k • 41

stepfun-ai/GELab-Zero-4B-preview

Image-to-Text • 4B • Updated 8 days ago • 703 • 92

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 3.34M • • 1.38k

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 283k • 744

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 7.69M • 571

bytedance-research/Vidi-7B

9B • Updated 18 days ago • 462 • 8

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 2 days ago • 135 • 8

jinaai/jina-clip-v2

Feature Extraction • 0.9B • Updated Apr 28 • 197k • 297

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 183k • 444

ZJU-AI4H/Hulu-Med-4B

Image-Text-to-Text • 5B • Updated 12 days ago • 2.22k • 9

xuemduan/reevaluate-clip

0.4B • Updated 3 days ago • 107 • 6

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated about 14 hours ago • 257 • 5

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 139k • 1.83k

ZJU-AI4H/Hulu-Med-7B

Image-Text-to-Text • 8B • Updated 12 days ago • 7.37k • 46

ZJU-AI4H/Hulu-Med-14B

Image-Text-to-Text • 15B • Updated 12 days ago • 10.7k • 42

omlab/VLM-FO1_Qwen2.5-VL-3B-v01

Object Detection • 4B • Updated 11 days ago • 1.92k • 10

stepfun-ai/Step1X-Edit

Image-to-Image • Updated Jul 9 • 131 • 326

OpenGVLab/VideoChat-R1_5-7B

Video-Text-to-Text • 8B • Updated Oct 2 • 10.1k • 10

cpatonn/Qwen3-Omni-30B-A3B-Instruct-AWQ-4bit

Any-to-Any • 10B • Updated Sep 28 • 24.2k • 32

IDEA-Research/Rex-Omni

Image-Text-to-Text • 4B • Updated Oct 16 • 27k • 46

ByteDance/Dolphin-1.5

Image-Text-to-Text • 0.4B • Updated 27 days ago • 1.57k • 31

etri-vilab/SafeGem-12B

Image-Text-to-Text • 12B • Updated 21 days ago • 16 • 3

etri-vilab/SafeGem-27B

Image-Text-to-Text • 27B • Updated 21 days ago • 18 • 3

etri-vilab/SafeQwen2.5-VL-7B

Image-Text-to-Text • 8B • Updated 21 days ago • 22 • 3

etri-vilab/SafeQwen2.5-VL-32B

Image-Text-to-Text • 33B • Updated 21 days ago • 18 • 3

etri-vilab/SafeLLaVA-13B

Image-Text-to-Text • 13B • Updated 15 days ago • 28 • 3

etri-vilab/SafeLLaVA-7B

Image-Text-to-Text • 7B • Updated 4 days ago • 71 • 3

thesby/Qwen3-VL-8B-NSFW-Caption-V4.5

Image-to-Text • 9B • Updated Nov 7 • 15.9k • 44

Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12 • 2.02M • 472