Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
DavidAU 
posted an update 5 days ago
Post
4508
Uncensored, Heretic, Qwen 3.6 27B GGUFs - Exceeds all quant metrics and core model metrics too.

Tuned 27B Heretic Uncensored quants from IQ2M to Q8.
IQ2M is 83% of BF16, with Q6 just under 98% of BF16 precision.
Q8: 98.47% of BF16 precision.
NEO/Code DI-Imatrix Quants.

Exceeds all 5 metrics for "censored" quants too.

All metrics posted.

Tuned model -from which the quants were built- also exceeds Qwen 3.6 27B core metrics too.

DavidAU/Qwen3.6-27B-Heretic-Uncensored-FINETUNE-NEO-CODE-Di-IMatrix-MAX-GGUF

THANKS for sharing this ! any plan for the Qwen 3.6 35B A3B Variant ?

·

I will add to the list; may wait for specific Heretic and/or tuned version.

I already have a 43B-A3B version running in the lab ; however tuning these sparse moe models take a lot more work/time and ahh... detail. AND a lot more VRAM!!! [can't compress these atm, so BF16 required => 100 GB+ ]

In this post