-
-
-
-
-
-
Inference Providers
Active filters:
awq
QuantTrio/DeepSeek-V3.2-Speciale-AWQ
Text Generation
•
685B
•
Updated
•
96
•
5
hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4
Text Generation
•
2B
•
Updated
•
182k
•
80
IDEA-Research/Rex-Omni-AWQ
4B
•
Updated
•
1.95k
•
3
TheBloke/deepseek-coder-1.3b-instruct-AWQ
Text Generation
•
0.3B
•
Updated
•
113
•
4
TheBloke/saiga_mistral_7b-AWQ
Text Generation
•
1B
•
Updated
•
151
•
4
Qwen/Qwen2.5-32B-Instruct-AWQ
Text Generation
•
6B
•
Updated
•
1.2M
•
90
Qwen/Qwen2.5-Coder-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
506k
•
18
hugging-quants/gemma-2-9b-it-AWQ-INT4
Text Generation
•
2B
•
Updated
•
10.1k
•
7
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
•
71.8k
•
69
Qwen/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
180k
•
94
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
130k
•
116
Qwen/Qwen3-14B-AWQ
Text Generation
•
3B
•
Updated
•
204k
•
46
Qwen/Qwen3-4B-AWQ
Text Generation
•
0.9B
•
Updated
•
155k
•
20
ReadyArt/Cydonia-24B-v3-AWQ
24B
•
Updated
•
15
•
1
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
•
9B
•
Updated
•
238
•
4
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
•
5B
•
Updated
•
12.7k
•
2
twhitworth/gpt-oss-120b-awq-w4a16
117B
•
Updated
•
7.9k
•
16
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
Updated
•
42
•
3
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
229B
•
Updated
•
8.72k
•
6
TheHouseOfTheDude/INTELLECT-3_Compressed-Tensors
Text Generation
•
Updated
•
18
•
1
QuantTrio/DeepSeek-V3.2-AWQ
Text Generation
•
685B
•
Updated
•
1.82k
•
2
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
20
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
10
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
12
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
10
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
6
casperhansen/opt-125m-awq
Text Generation
•
90.3M
•
Updated
•
792
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
73
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
2.09k
•
24