Haoze Wu's picture

3 20 3

Haoze Wu

WaitHZ

·

https://waithz.github.io/

AI & ML interests

Modular DL, Complex Reasoning

Recent Activity

upvoted a paper 6 days ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

upvoted a paper 6 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 9 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

View all activity

Organizations

upvoted 2 papers 6 days ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

Paper • 2512.01822 • Published 9 days ago • 33

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 8 days ago • 194

upvoted a paper 9 days ago

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Paper • 2511.22570 • Published 13 days ago • 69

liked a model 9 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 9 days ago • 40.7k • • 854

upvoted a paper about 1 month ago

From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones

Paper • 2509.25123 • Published Sep 29 • 20

authored a paper about 1 month ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

upvoted a paper about 1 month ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29 • 45

liked a dataset about 1 month ago

hkust-nlp/Toolathlon-Trajectories

Preview • Updated 5 days ago • 7.95k • 15

upvoted a paper about 2 months ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21 • 110

upvoted a collection 2 months ago

DeepSeek-V3.2

4 items • Updated 9 days ago • 507

updated 2 collections 3 months ago

MATH-Benchmark

5 items • Updated Sep 15

MATH-Training

2 items • Updated Sep 15

upvoted a paper 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 79

authored 2 papers 3 months ago

ReCode: Updating Code API Knowledge with Reinforcement Learning

Paper • 2506.20495 • Published Jun 25 • 9

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published Aug 28 • 8

upvoted a paper 3 months ago

Model-Task Alignment Drives Distinct RL Outcomes

Paper • 2508.21188 • Published Aug 28 • 8

updated a collection 5 months ago

ReCode

2 items • Updated Jul 21