Distilled Reasoning Models with Activation Sparse
AI & ML interests
ML algorithms and systems
Reproduce Deepseek distilled models based on open-r1.
-
InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct
Text Generation • 3B • Updated • 8 • 1 -
InfiniAILab/OpenR1-Qwen-7B-SFT-Instruct
Text Generation • 8B • Updated • 11 • 2 -
InfiniAILab/OpenR1-Qwen-7B-Math-Instruct
Text Generation • 8B • Updated • 17 -
InfiniAILab/OpenR1-Qwen-1.5B-SFT-Instruct
Text Generation • 2B • Updated • 10
Distilled Reasoning Models with Activation Sparse
Reproduce Deepseek distilled models based on open-r1.
-
InfiniAILab/OpenR1-Qwen-3B-SFT-Instruct
Text Generation • 3B • Updated • 8 • 1 -
InfiniAILab/OpenR1-Qwen-7B-SFT-Instruct
Text Generation • 8B • Updated • 11 • 2 -
InfiniAILab/OpenR1-Qwen-7B-Math-Instruct
Text Generation • 8B • Updated • 17 -
InfiniAILab/OpenR1-Qwen-1.5B-SFT-Instruct
Text Generation • 2B • Updated • 10
models
96
InfiniAILab/Autoregressive-7B-2
2B
•
Updated
•
11
InfiniAILab/Autoregressive-7B
1.0B
•
Updated
•
1
•
1
InfiniAILab/Multiverse-7B
1B
•
Updated
•
476
InfiniAILab/Autoregressive-1.5B-2
0.2B
•
Updated
•
2
InfiniAILab/Autoregressive-1.5B
0.2B
•
Updated
•
1
•
1
InfiniAILab/Autoregressive-1.5B-no-structure
0.2B
•
Updated
•
4
InfiniAILab/Multiverse-1.5B
0.2B
•
Updated
•
183
•
1
InfiniAILab/S1-claude-1K-32B-bs16-new-tokenizer
33B
•
Updated
•
6
InfiniAILab/S1-claude-1K-32B-bs16
33B
•
Updated
•
6
InfiniAILab/S1.1-1K-32B-bs16-new-tokenizer-parallel-7.1-v6-true-mix-prompt
33B
•
Updated
•
5
datasets
22
InfiniAILab/multiverse-sample
Updated
•
20
InfiniAILab/gsm_infinite_symbolic_32k
Updated
•
129
InfiniAILab/gsm_infinite_hard_128k
Viewer
•
Updated
•
12.3k
•
424
InfiniAILab/gsm_infinite_symbolic_16k
Updated
•
199
InfiniAILab/gsm_infinite_medium_128k
Viewer
•
Updated
•
12.7k
•
901
InfiniAILab/gsm_infinite_symbolic_8k
Updated
•
467
InfiniAILab/gsm_infinite_hard_64k
Viewer
•
Updated
•
12.3k
•
15
InfiniAILab/gsm_infinite_symbolic_0
Updated
•
393
InfiniAILab/gsm_infinite_medium_64k
Viewer
•
Updated
•
21.3k
•
59
InfiniAILab/gsm_infinite_symbolic_128k
Updated
•
109