view article Article Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training +3 Aug 8, 2025 • 90
ICT3214-Group5/Cryptography_GPT_2_v1.0.0 Text Generation • 0.1B • Updated Nov 27, 2024 • 8 • 1
mlx-community/Qwen3-4B-Instruct-2507-4bit Text Generation • 0.6B • Updated 8 days ago • 3.91k • 7
uer/roberta-base-chinese-extractive-qa Question Answering • Updated Oct 17, 2023 • 1.66k • • 102
zai-org/GLM-4.1V-9B-Thinking Image-Text-to-Text • 10B • Updated Oct 25, 2025 • 103k • • 762