50 8 28

Zachary Mueller PRO

muellerzr

AI & ML interests

None yet

Recent Activity

liked a model 19 days ago

mlx-community/DeepSeek-V4-Pro-4bit

liked a model 21 days ago

openai/privacy-filter

liked a model 22 days ago

moonshotai/Kimi-K2.6

View all activity

Organizations

upvoted 2 collections about 1 year ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 735

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 720

upvoted a paper over 1 year ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 25

upvoted an article over 1 year ago

Article

Accelerate 1.0.0

muellerzr, marcsun13, BenjaminB

•

Sep 13, 2024

• 54

upvoted 2 articles almost 2 years ago

Article

BigCodeBench: The Next Generation of HumanEval

terryyz, ganler, SivilTaram, huybery, Muennighoff, dpfried, harmdevries, lvwerra, clefourrier

•

Jun 18, 2024

• 54

Article

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

mirinflim, aldopareja, muellerzr, stas

•

Jun 13, 2024

• 62

upvoted an article about 2 years ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

yuxiang630, cassanof, ganler, YifengDing, StringChaos, harmdevries, lvwerra, arjunguha, lingming

•

Apr 29, 2024

• 79

upvoted a collection about 2 years ago

llama 3 self-align experiments

Collection

Replicating the pipeline for StarCoder-2 Instruct on Llama-3-8B with some tweaks https://huggingface.co/blog/sc2-instruct • 4 items • Updated May 9, 2024 • 6

Zachary Mueller PRO

AI & ML interests

Recent Activity

Organizations

muellerzr's activity

Accelerate 1.0.0

BigCodeBench: The Next Generation of HumanEval

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation