2 14

Liu

Shiweiliuiiiiiii

https://shiweiliuiiiiiii.github.io/

Shiwei_Liu66

AI & ML interests

LLM, reasoning, ML efficiency

Recent Activity

upvoted a paper about 2 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

upvoted a paper 3 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

upvoted a paper 4 months ago

Diffusion Language Models Know the Answer Before Decoding

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

The Path Not Taken: RLVR Provably Learns Off the Principals

Paper • 2511.08567 • Published Nov 11, 2025 • 33

upvoted a paper 3 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 31

upvoted a paper 4 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 25

upvoted a paper 6 months ago

Spatial Mental Modeling from Limited Views

Paper • 2506.21458 • Published Jun 26, 2025 • 13

upvoted a paper 7 months ago

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning

Paper • 2506.00772 • Published Jun 1, 2025 • 2

upvoted a paper 10 months ago

SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers

Paper • 2502.20545 • Published Feb 27, 2025 • 22

upvoted 3 papers 11 months ago

upvoted 2 papers 12 months ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published Jan 22, 2025 • 28

SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training

Paper • 2501.06842 • Published Jan 12, 2025 • 16

upvoted 2 papers about 1 year ago

Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN

Paper • 2412.13795 • Published Dec 18, 2024 • 20

APOLLO: SGD-like Memory, AdamW-level Performance

Paper • 2412.05270 • Published Dec 6, 2024 • 37

upvoted a paper over 1 year ago

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Paper • 2407.08296 • Published Jul 11, 2024 • 33

Liu

AI & ML interests

Recent Activity

Organizations

Shiweiliuiiiiiii's activity