Weiqin Yang's picture

9 8

Weiqin Yang

Tiny-Snow

·

https://tiny-snow.github.io/

Tiny-Snow

AI & ML interests

Large Language Models, Recommendation Foundation Models

Recent Activity

upvoted a paper about 2 months ago

Diffusion Language Models are Super Data Learners

upvoted a paper about 2 months ago

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

upvoted a paper about 2 months ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

View all activity

Organizations

None yet

upvoted 4 papers about 2 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published Nov 25, 2025 • 43

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published Nov 27, 2025 • 89

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 253

authored 2 papers 5 months ago

PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for Recommendation

Paper • 2411.00163 • Published Oct 31, 2024

Breaking the Top-$K$ Barrier: Advancing Top-$K$ Ranking Metrics Optimization in Recommender Systems

Paper • 2508.05673 • Published Aug 4, 2025

liked a Space 7 months ago

Beam Search Visualizer

View how beam search decoding works, in detail!

upvoted an article 7 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

283

liked 2 models 7 months ago

Qwen/Qwen3-1.7B-Base

Text Generation • 2B • Updated Jul 26, 2025 • 188k • 54

Qwen/Qwen3-8B-Base

Text Generation • 8B • Updated May 21, 2025 • 246k • • 77

upvoted a collection 7 months ago

Qwen3

84 items • Updated 23 days ago • 1.59k

liked 5 models 7 months ago

Qwen/Qwen3-8B

Text Generation • 8B • Updated Jul 26, 2025 • 4.16M • • 869

meta-llama/Llama-3.2-1B-Instruct

Text Generation • 1B • Updated Oct 24, 2024 • 2.65M • • 1.26k

meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.43M • • 1.94k

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 724k • 684

meta-llama/Llama-3.2-1B

Text Generation • 1B • Updated Oct 24, 2024 • 2.66M • 2.26k

upvoted 3 collections 7 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 704

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 193

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 649