justheuristic (Yozh)

upvoted an article 6 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+4

Jun 12

•

151

upvoted a paper 6 months ago

Magistral

Paper • 2506.10910 • Published Jun 12 • 65

upvoted 2 papers 7 months ago

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 84

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 78

upvoted an article 7 months ago

Article

4D masks support in Transformers

Jan 8, 2024

•

31

upvoted 11 papers 8 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 44

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 137

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 99

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published Apr 8 • 64

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

Paper • 2504.05599 • Published Apr 8 • 85

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference

Paper • 2504.05897 • Published Apr 8 • 21

Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Paper • 2503.20533 • Published Mar 26 • 12

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published Apr 8 • 182

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Paper • 2411.17525 • Published Nov 26, 2024 • 5

Extreme Compression of Large Language Models via Additive Quantization

Paper • 2401.06118 • Published Jan 11, 2024 • 13

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted a paper 9 months ago

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 41

upvoted an article 9 months ago

Article

Digest of models based on YandexGPT 5 Lite

Mar 19

•

33

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

430

upvoted a paper over 1 year ago

TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

Paper • 2406.19380 • Published Jun 27, 2024 • 50

Yozh

AI & ML interests

Organizations

Learn the Hugging Face Kernel Hub in 5 Minutes

Magistral

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

4D masks support in Transformers

Learning Adaptive Parallel Reasoning with Language Models

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

An Empirical Study of GPT-4o Image Generation Capabilities

Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought

HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference

Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Pushing the Limits of Large Language Model Quantization via the Linearity Theorem

Extreme Compression of Large Language Models via Additive Quantization

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Scale-wise Distillation of Diffusion Models

Digest of models based on YandexGPT 5 Lite

SmolLM - blazingly fast and remarkably powerful

TabReD: A Benchmark of Tabular Machine Learning in-the-Wild

Yozh

AI & ML interests

Organizations

justheuristic's activity

Learn the Hugging Face Kernel Hub in 5 Minutes

4D masks support in Transformers

Digest of models based on YandexGPT 5 Lite

SmolLM - blazingly fast and remarkably powerful