Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

7,180

Full-text search

Active filters: awq

QuantTrio/DeepSeek-V3.2-Speciale-AWQ

Text Generation • 685B • Updated 7 days ago • 96 • 5

hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 2B • Updated Aug 7, 2024 • 182k • 80

IDEA-Research/Rex-Omni-AWQ

4B • Updated Oct 31 • 1.95k • 3

TheBloke/deepseek-coder-1.3b-instruct-AWQ

Text Generation • 0.3B • Updated Nov 9, 2023 • 113 • 4

TheBloke/saiga_mistral_7b-AWQ

Text Generation • 1B • Updated Nov 28, 2023 • 151 • 4

Qwen/Qwen2.5-32B-Instruct-AWQ

Text Generation • 6B • Updated Oct 9, 2024 • 1.2M • 90

Qwen/Qwen2.5-Coder-7B-Instruct-AWQ

Text Generation • 2B • Updated Nov 18, 2024 • 506k • 18

hugging-quants/gemma-2-9b-it-AWQ-INT4

Text Generation • 2B • Updated Oct 17, 2024 • 10.1k • 7

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • 13B • Updated Mar 7 • 71.8k • 69

Qwen/Qwen2.5-VL-7B-Instruct-AWQ

Image-Text-to-Text • 3B • Updated Apr 6 • 180k • 94

Qwen/Qwen3-32B-AWQ

Text Generation • 6B • Updated May 21 • 130k • 116

Qwen/Qwen3-14B-AWQ

Text Generation • 3B • Updated May 21 • 204k • 46

Qwen/Qwen3-4B-AWQ

Text Generation • 0.9B • Updated May 21 • 155k • 20

ReadyArt/Cydonia-24B-v3-AWQ

24B • Updated Jun 9 • 15 • 1

QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix

Text Generation • 9B • Updated Sep 5 • 238 • 4

QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ

Text Generation • 5B • Updated Sep 5 • 12.7k • 2

twhitworth/gpt-oss-120b-awq-w4a16

117B • Updated Aug 19 • 7.9k • 16

QuantTrio/DeepSeek-V3.1-AWQ-Lite

Text Generation • Updated Sep 5 • 42 • 3

QuantTrio/MiniMax-M2-AWQ

Text Generation • 229B • Updated 8 days ago • 8.72k • 6

TheHouseOfTheDude/INTELLECT-3_Compressed-Tensors

Text Generation • Updated 11 days ago • 18 • 1

QuantTrio/DeepSeek-V3.2-AWQ

Text Generation • 685B • Updated 7 days ago • 1.82k • 2

casperhansen/mpt-7b-8k-chat-awq

Text Generation • Updated Nov 4, 2023 • 20 • 3

casperhansen/falcon-7b-awq

Text Generation • Updated Nov 4, 2023 • 10 • 1

casperhansen/vicuna-7b-v1.5-awq

Text Generation • Updated Oct 31, 2023 • 12 • 3

casperhansen/vicuna-7b-v1.5-awq-gemv

Text Generation • Updated Oct 31, 2023 • 10 • 1

casperhansen/mpt-7b-8k-chat-awq-gemv

Text Generation • Updated Oct 31, 2023 • 6

casperhansen/opt-125m-awq

Text Generation • 90.3M • Updated Oct 31, 2023 • 792 • 3

casperhansen/tinyllama-1b-awq

Text Generation • Updated Oct 31, 2023 • 73

Bomml/Llama-2-70B-chat-w4-g128-awq

Text Generation • Updated Sep 16, 2023

TheBloke/Llama-2-7B-Chat-AWQ

Text Generation • 1B • Updated Nov 9, 2023 • 2.09k • 24