Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2504.12216

about 1 hour ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 18
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4 • 41
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR

Paper • 2509.23808 • Published Sep 28 • 47

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 54

Discrete Diffusion LLM & MLLM

An collection of research/models in discrete diffusion large language and multimodal models

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16 • 43
GSAI-ML/LLaDA-8B-Instruct

Text Generation • 8B • Updated Oct 21 • 239k • 337
Dream-org/Dream-v0-Base-7B

Text Generation • 8B • Updated Jul 15 • 346k • 51
Dream-org/Dream-v0-Instruct-7B

Text Generation • 8B • Updated Jul 15 • 90.6k • 144

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Paper • 2509.19371 • Published Sep 19
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10 • 7
Selective Attention: Enhancing Transformer through Principled Context Control

Paper • 2411.12892 • Published Nov 19, 2024
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

(M)LLMs based on Discrete Diffusion Model and relevant techniques

diffusionfamily/diffullama

Text Generation • 7B • Updated Oct 25, 2024 • 524 • 11
GSAI-ML/LLaDA-8B-Base

Text Generation • 8B • Updated Oct 21 • 123k • 87
Dream-org/Dream-v0-Instruct-7B

Text Generation • 8B • Updated Jul 15 • 90.6k • 144
GSAI-ML/LLaDA-V

Image-Text-to-Text • 8B • Updated Jun 18 • 5.43k • 23

Diffusion Language

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Unifying Autoregressive and Diffusion-Based Sequence Generation

Paper • 2504.06416 • Published Apr 8 • 3
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37
Anchored Diffusion Language Model

Paper • 2505.18456 • Published May 24 • 1

about 1 hour ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1 • 18
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information

Paper • 2510.03632 • Published Oct 4 • 41
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR

Paper • 2509.23808 • Published Sep 28 • 47

How to inject knowledge efficiently? Knowledge Infusion Scaling Law for Pre-training Large Language Models

Paper • 2509.19371 • Published Sep 19
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Paper • 2505.06708 • Published May 10 • 7
Selective Attention: Enhancing Transformer through Principled Context Control

Paper • 2411.12892 • Published Nov 19, 2024
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

Why mask diffusion does not work

Paper • 2510.03289 • Published Sep 29
d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published Sep 30 • 54

(M)LLMs based on Discrete Diffusion Model and relevant techniques

diffusionfamily/diffullama

Text Generation • 7B • Updated Oct 25, 2024 • 524 • 11
GSAI-ML/LLaDA-8B-Base

Text Generation • 8B • Updated Oct 21 • 123k • 87
Dream-org/Dream-v0-Instruct-7B

Text Generation • 8B • Updated Jul 15 • 90.6k • 144
GSAI-ML/LLaDA-V

Image-Text-to-Text • 8B • Updated Jun 18 • 5.43k • 23

Discrete Diffusion LLM & MLLM

An collection of research/models in discrete diffusion large language and multimodal models

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16 • 43
GSAI-ML/LLaDA-8B-Instruct

Text Generation • 8B • Updated Oct 21 • 239k • 337
Dream-org/Dream-v0-Base-7B

Text Generation • 8B • Updated Jul 15 • 346k • 51
Dream-org/Dream-v0-Instruct-7B

Text Generation • 8B • Updated Jul 15 • 90.6k • 144

Diffusion Language

d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning

Paper • 2504.12216 • Published Apr 16 • 3
Unifying Autoregressive and Diffusion-Based Sequence Generation

Paper • 2504.06416 • Published Apr 8 • 3
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12 • 37
Anchored Diffusion Language Model

Paper • 2505.18456 • Published May 24 • 1

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs