view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs 24 days ago โข 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 17 days ago โข 3
cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit Text Generation โข 5B โข Updated Mar 23 โข 36.5k โข 31