Ziniu Li's picture

5 22 5

Ziniu Li

ziniuli

·

http://www.liziniu.org/

liziniu

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

How Far Are We from Genuinely Useful Deep Research Agents?

upvoted a paper 5 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

published a dataset 9 days ago

ziniuli/rollout

View all activity

Organizations

authored 10 papers about 1 month ago

Why Transformers Need Adam: A Hessian Perspective

Paper • 2402.16788 • Published Feb 26, 2024 • 1

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

Paper • 2310.10505 • Published Oct 16, 2023 • 1

CoRT: Code-integrated Reasoning within Thinking

Paper • 2506.09820 • Published Jun 11 • 17

Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO

Paper • 2505.11595 • Published May 16 • 1

A Survey on Large Language Models for Mathematical Reasoning

Paper • 2506.08446 • Published Jun 10

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24 • 80

Bridging Formal Language with Chain-of-Thought Reasoning to Geometry Problem Solving

Paper • 2508.09099 • Published Aug 12

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Paper • 2505.04113 • Published May 7

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30 • 47

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29 • 219

authored a paper 10 months ago

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques

Paper • 2501.14492 • Published Jan 24 • 32

authored a paper 11 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 74

authored a paper over 1 year ago

Adam-mini: Use Fewer Learning Rates To Gain More

Paper • 2406.16793 • Published Jun 24, 2024 • 69