Zhang Ruichong's picture

Zhang Ruichong

ZhangRC

·

https://www.zhihu.com/people/triangjyeddriung

Triang-jyed-driung

AI & ML interests

Mathematics (Real analysis, functional analysis, commutative algebra, etc)

Organizations

upvoted an article 6 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+4

Jun 12

•

151

upvoted 2 papers 6 months ago

FEAT: Full-Dimensional Efficient Attention Transformer for Medical Video Generation

Paper • 2506.04956 • Published Jun 5 • 3

Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs

Paper • 2504.07866 • Published Apr 10 • 12

upvoted 2 papers 7 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 188

RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale

Paper • 2505.03005 • Published May 5 • 36

upvoted a collection 8 months ago

Qwen3

84 items • Updated Aug 6 • 1.48k

upvoted 2 papers 8 months ago

Volume estimates for unions of convex sets, and the Kakeya set conjecture in three dimensions

Paper • 2502.17655 • Published Feb 24 • 1

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted 2 collections 8 months ago

RWKV-7 Goose

RWKV-7 Goose related resources. • 53 items • Updated Mar 19 • 1

paper weekly

8 items • Updated Mar 22 • 1

upvoted a paper 9 months ago

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

Paper • 2501.15654 • Published Jan 26 • 15

upvoted a collection 9 months ago

RNN

18 items • Updated Mar 19 • 4

upvoted a paper 9 months ago

xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference

Paper • 2503.13427 • Published Mar 17 • 3

upvoted a collection 9 months ago

interesting architecture

25 items • Updated Oct 20 • 3

upvoted a paper 9 months ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published Mar 18 • 153

upvoted 3 collections 9 months ago

RWKV v7

9 items • Updated Mar 17 • 6

QwQ

Qwen with Questions • 6 items • Updated Jul 21 • 101

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 667

upvoted a collection 10 months ago

🪿 RWKV7

RWKV7 models 🪿 • 18 items • Updated Sep 15 • 9

upvoted a paper 10 months ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published Feb 9 • 38