-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 63.8k • • 943 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 1.13k • 55 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 230k • • 1.17k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 22.5k • 651
DeepSeek
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
mHC: Manifold-Constrained Hyper-Connections
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 384k • • 13k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 2.97k • 941 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 284k • • 739 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.29M • • 1.5k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 2.59k • 679 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 3.48k • 146 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 1.17k • 91 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 1.81k • 84
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
models for paper expert-specialized fine-tuning
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 25.7k • 561 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 58.9k • 469 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 2.88k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 174k • 153
-
Chat with DeepSeek-VL2-small
🌍577Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 111k • 240 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 117k • 173 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 11.8k • 377
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 2.22k • 677 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 365 • 82 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 1.96k • 101 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 199k • • 530
DeepSeek-VL model series
DeepSeek LLM series
DeepSeek MoE series
-
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation • 685B • Updated • 63.8k • • 943 -
deepseek-ai/DeepSeek-V3.2-Exp-Base
Text Generation • 685B • Updated • 1.13k • 55 -
deepseek-ai/DeepSeek-V3.2
Text Generation • 685B • Updated • 230k • • 1.17k -
deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation • 685B • Updated • 22.5k • 651
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 384k • • 13k -
deepseek-ai/DeepSeek-R1-Zero
Text Generation • 685B • Updated • 2.97k • 941 -
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 284k • • 739 -
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
Text Generation • 33B • Updated • 2.29M • • 1.5k
DeepSeek Math series
-
deepseek-ai/DeepSeek-Math-V2
Text Generation • 685B • Updated • 2.59k • 679 -
deepseek-ai/deepseek-math-7b-instruct
Text Generation • Updated • 3.48k • 146 -
deepseek-ai/deepseek-math-7b-rl
Text Generation • 7B • Updated • 1.17k • 91 -
deepseek-ai/deepseek-math-7b-base
Text Generation • Updated • 1.81k • 84
-
Chat with DeepSeek-VL2-small
🌍577Generate responses using images and text input
-
deepseek-ai/deepseek-vl2-tiny
Image-Text-to-Text • 3B • Updated • 111k • 240 -
deepseek-ai/deepseek-vl2-small
Image-Text-to-Text • 16B • Updated • 117k • 173 -
deepseek-ai/deepseek-vl2
Image-Text-to-Text • 27B • Updated • 11.8k • 377
Janus is a novel autoregressive framework that unifies multimodal understanding and generation.
DeepSeek-Prover-Series
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation • 236B • Updated • 2.22k • 677 -
deepseek-ai/DeepSeek-Coder-V2-Base
Text Generation • 236B • Updated • 365 • 82 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Base
Text Generation • 16B • Updated • 1.96k • 101 -
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct
Text Generation • 16B • Updated • 199k • • 530
models for paper expert-specialized fine-tuning
DeepSeek-VL model series
DeepSeek Coder series
-
deepseek-ai/deepseek-coder-33b-instruct
Text Generation • 33B • Updated • 25.7k • 561 -
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation • 7B • Updated • 58.9k • 469 -
deepseek-ai/deepseek-coder-7b-instruct-v1.5
Text Generation • 7B • Updated • 2.88k • 143 -
deepseek-ai/deepseek-coder-1.3b-instruct
Text Generation • 1B • Updated • 174k • 153
DeepSeek LLM series
DeepSeek MoE series