Qwen2.5-7B-Instruct-darija-gguf

This repository contains quantized versions of Qwen2.5-7B-Instruct-darija in GGUF format for efficient inference.

Available Quantizations

Quantization	Description	File Size	Use Case
`f16`	FP16 (no quantization)	14531.95 MB	Best quality, largest size
`0`	0	7723.36 MB	Quantized version

Usage

Using llama.cpp

# Download the desired quantization
wget https://huggingface.co/GemMaroc/Qwen2.5-7B-Instruct-darija-gguf/resolve/main/Qwen2.5-7B-Instruct-darija_ckpt-*_q8_0.gguf

# Run inference
./llama-cli -m Qwen2.5-7B-Instruct-darija_ckpt-*_q8_0.gguf -p "Your prompt here"

Using Python with llama-cpp-python

from llama_cpp import Llama

# Load the quantized model
llm = Llama(
    model_path="./Qwen2.5-7B-Instruct-darija_ckpt-*_q8_0.gguf",
    n_ctx=32768,  # Context length
    n_threads=8,  # Number of CPU threads
)

# Generate text
response = llm("Your prompt here", max_tokens=512)
print(response['choices'][0]['text'])

Model Information

Base Model: GemMaroc/Qwen2.5-7B-Instruct-darija
Quantization: Multiple GGUF formats available
Context Length: 32,768 tokens
Languages: Arabic (Moroccan Darija), English

Recommendations

For best quality: Use f16 (largest file size)
For balanced performance: Use q8_0 (recommended)
For resource-constrained environments: Use tq2_0 or tq1_0

Citation

If you use this model, please cite the original GemMaroc paper:

@misc{skiredj2025gemmarocunlockingdarijaproficiency,
      title={GemMaroc: Unlocking Darija Proficiency in LLMs with Minimal Data},
      author={Abderrahman Skiredj and Ferdaous Azhari and Houdaifa Atou and Nouamane Tazi and Ismail Berrada},
      year={2025},
      eprint={2505.17082},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.17082},
}

Downloads last month: 21

GGUF

Model size

8B params

Architecture

qwen2

Hardware compatibility

8-bit

16-bit

Model tree for GemMaroc/Qwen2.5-7B-Instruct-darija-gguf

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct

Finetuned

GemMaroc/Qwen2.5-7B-Instruct-darija

Quantized

(1)

this model

Paper for GemMaroc/Qwen2.5-7B-Instruct-darija-gguf

GemMaroc: Unlocking Darija Proficiency in LLMs with Minimal Data

Paper • 2505.17082 • Published May 20, 2025