3 18 5

Ziyang

hzy

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

upvoted a paper 9 days ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

liked a dataset 20 days ago

skylenage-ai/QwenClawBench

View all activity

Organizations

None yet

upvoted a paper 8 days ago

DV-World: Benchmarking Data Visualization Agents in Real-World Scenarios

Paper • 2604.25914 • Published 9 days ago • 41

upvoted a paper 9 days ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published 11 days ago • 33

upvoted a paper 21 days ago

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

Paper • 2604.14142 • Published 22 days ago • 29

upvoted a paper about 2 months ago

LMEB: Long-horizon Memory Embedding Benchmark

Paper • 2603.12572 • Published Mar 13 • 73

upvoted 2 papers 3 months ago

WideSeek: Advancing Wide Research via Multi-Agent Scaling

Paper • 2602.02636 • Published Feb 2 • 16

Closing the Loop: Universal Repository Representation with RPG-Encoder

Paper • 2602.02084 • Published Feb 2 • 85

upvoted a paper 5 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 159

upvoted a paper 6 months ago

Towards Universal Video Retrieval: Generalizing Video Embedding via Synthesized Multimodal Pyramid Curriculum

Paper • 2510.27571 • Published Oct 31, 2025 • 19

upvoted a paper 7 months ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 111

upvoted 4 papers 8 months ago

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published Sep 23, 2025 • 67

RPG: A Repository Planning Graph for Unified and Scalable Codebase Generation

Paper • 2509.16198 • Published Sep 19, 2025 • 129

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11, 2025 • 47

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

upvoted 2 papers 9 months ago

Thyme: Think Beyond Images

Paper • 2508.11630 • Published Aug 15, 2025 • 81

WideSearch: Benchmarking Agentic Broad Info-Seeking

Paper • 2508.07999 • Published Aug 11, 2025 • 112

upvoted a paper 11 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 190

upvoted 2 papers 12 months ago

Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent

Paper • 2505.07596 • Published May 12, 2025 • 11

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12, 2025 • 45

Ziyang

AI & ML interests

Recent Activity

Organizations

hzy's activity