EdenQiao

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

upvoted a paper 7 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

liked a model 7 months ago

openbmb/AgentCPM-GUI

View all activity

Organizations

None yet

upvoted a paper 21 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 21 days ago • 24

upvoted a paper 7 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28, 2025 • 131

liked a model 7 months ago

openbmb/AgentCPM-GUI

Image-Text-to-Text • 8B • Updated Jun 14, 2025 • 209 • 128

liked a model 11 months ago

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • 2B • Updated Apr 9, 2025 • 23k • 577

liked 6 datasets about 1 year ago

liked a dataset over 1 year ago

ruslanmv/ai-medical-chatbot

Viewer • Updated Mar 23, 2024 • 257k • 993 • 245

liked a model over 1 year ago

sfairXC/FsfairX-LLaMA3-RM-v0.1

Text Classification • 8B • Updated Oct 14, 2024 • 866 • 60

liked 3 datasets over 1 year ago

locuslab/TOFU

Viewer • Updated Mar 27, 2025 • 18.1k • 91.7k • 43

openai/gsm8k

Benchmark • Updated 19 days ago • 17.6k • 421k • 1.1k

EdinburghNLP/xsum

Viewer • Updated Apr 5, 2023 • 227k • 18.9k • 128

liked 4 models over 1 year ago

weqweasdas/hh_rlhf_rm_open_llama_3b

Text Classification • Updated Feb 25, 2024 • 233 • 17

Ray2333/reward-model-Mistral-7B-instruct-Unified-Feedback

Text Classification • 7B • Updated Feb 5, 2025 • 355 • 11

tiiuae/falcon-7b

Text Generation • 7B • Updated Oct 12, 2024 • 101k • 1.1k

lmsys/vicuna-7b-v1.5

Text Generation • Updated Mar 13, 2024 • 102k • 386

liked a dataset over 1 year ago

nvidia/HelpSteer2

Viewer • Updated Dec 18, 2024 • 21.4k • 12k • 435

EdenQiao

AI & ML interests

Recent Activity

Organizations

EdenQiao's activity