Alchemist: Turning Public Text-to-Image Data into Generative Gold Paper β’ 2505.19297 β’ Published May 25 β’ 84
Quartet: Native FP4 Training Can Be Optimal for Large Language Models Paper β’ 2505.14669 β’ Published May 20 β’ 78
Learning Adaptive Parallel Reasoning with Language Models Paper β’ 2504.15466 β’ Published Apr 21 β’ 44
PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters Paper β’ 2504.08791 β’ Published Apr 7 β’ 137
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published Jan 8 β’ 99
An Empirical Study of GPT-4o Image Generation Capabilities Paper β’ 2504.05979 β’ Published Apr 8 β’ 64
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper β’ 2504.05599 β’ Published Apr 8 β’ 85
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference Paper β’ 2504.05897 β’ Published Apr 8 β’ 21
Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence Paper β’ 2503.20533 β’ Published Mar 26 β’ 12
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper β’ 2504.06263 β’ Published Apr 8 β’ 182
Pushing the Limits of Large Language Model Quantization via the Linearity Theorem Paper β’ 2411.17525 β’ Published Nov 26, 2024 β’ 5
Extreme Compression of Large Language Models via Additive Quantization Paper β’ 2401.06118 β’ Published Jan 11, 2024 β’ 13
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper β’ 2504.06261 β’ Published Apr 8 β’ 110
TabReD: A Benchmark of Tabular Machine Learning in-the-Wild Paper β’ 2406.19380 β’ Published Jun 27, 2024 β’ 50