Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 12
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
5,988
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/gemma-3n-E2B-it-litert-preview
Image-Text-to-Text
•
Updated
May 20
•
571
Mungert/Vintern-1B-v3_5-GGUF
Image-Text-to-Text
•
0.6B
•
Updated
Sep 24
•
57
unsloth/InternVL3-78B-GGUF
Image-Text-to-Text
•
73B
•
Updated
May 19
•
389
•
1
lordChipotle/nutrition-label-detector
Image-Text-to-Text
•
9B
•
Updated
May 19
•
9
unsloth/InternVL3-1B-Instruct
Image-Text-to-Text
•
0.9B
•
Updated
May 19
•
28
unsloth/InternVL3-1B-Instruct-GGUF
Image-Text-to-Text
•
0.6B
•
Updated
May 19
•
132
unsloth/InternVL3-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
May 19
•
23
unsloth/InternVL3-2B-Instruct-GGUF
Image-Text-to-Text
•
2B
•
Updated
May 19
•
207
unsloth/InternVL3-8B-Instruct-GGUF
Image-Text-to-Text
•
8B
•
Updated
May 19
•
277
•
2
unsloth/InternVL3-14B-Instruct
Image-Text-to-Text
•
15B
•
Updated
May 19
•
14
unsloth/InternVL3-14B-Instruct-GGUF
Image-Text-to-Text
•
15B
•
Updated
May 19
•
487
•
4
TienAnh/Finetune_OCR_1B
Image-Text-to-Text
•
0.9B
•
Updated
May 22
•
29
•
1
FlashVL/FlashVL-2B-Dynamic-ISS
Image-Text-to-Text
•
3B
•
Updated
May 19
•
27
•
2
ByteDance/Dolphin
Image-Text-to-Text
•
0.4B
•
Updated
Jul 16
•
3.59k
•
506
kolerk/TON-3B-AITZ
Image-Text-to-Text
•
4B
•
Updated
Jul 14
•
15
Zkkkai/CPGD-7B
Image-Text-to-Text
•
8B
•
Updated
May 21
•
9
•
1
rp-yu/Dimple-7B
Image-Text-to-Text
•
8B
•
Updated
May 26
•
33
•
9
FlashVL/FlashVL-2B-Static
Image-Text-to-Text
•
2B
•
Updated
May 19
•
9
FlashVL/FlashVL-2B-Static-GRPO
Image-Text-to-Text
•
2B
•
Updated
May 19
•
7
•
1
rootonchair/Vintern-3B-R-beta-GGUF
Image-Text-to-Text
•
3B
•
Updated
May 19
•
190
unsloth/InternVL3-38B-Instruct-GGUF
Image-Text-to-Text
•
33B
•
Updated
May 19
•
269
•
2
unsloth/InternVL3-78B-Instruct-GGUF
Image-Text-to-Text
•
73B
•
Updated
May 20
•
320
•
1
ChenShawn/DeepEyes-7B
Image-Text-to-Text
•
8B
•
Updated
May 22
•
544
•
14
shakhizat/nanoVLM-222M
Image-Text-to-Text
•
0.2B
•
Updated
May 20
•
6
zhaode/FastVLM-0.5B-Stage2
Image-Text-to-Text
•
0.8B
•
Updated
May 20
•
21
•
1
zhaode/FastVLM-0.5B-Stage3
Image-Text-to-Text
•
0.8B
•
Updated
May 20
•
49
•
1
zhaode/FastVLM-1.5B-Stage2
Image-Text-to-Text
•
2B
•
Updated
May 20
•
8
zhaode/FastVLM-1.5B-Stage3
Image-Text-to-Text
•
2B
•
Updated
May 20
•
10
zhaode/FastVLM-7B-Stage2
Image-Text-to-Text
•
8B
•
Updated
May 21
•
9
ysn-rfd/gemma-3-4b-it-qat-int4-unquantized-Q8_0-GGUF
Image-Text-to-Text
•
4B
•
Updated
May 20
•
17
Previous
1
...
98
99
100
Next