Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts Paper • 2509.23188 • Published Sep 27 • 3
Trading-R1: Financial Trading with LLM Reasoning via Reinforcement Learning Paper • 2509.11420 • Published Sep 14 • 2
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search Paper • 2509.25454 • Published Sep 29 • 140
Life-Code: Central Dogma Modeling with Multi-Omics Sequence Unification Paper • 2502.07299 • Published Feb 11 • 2
SemiReward: A General Reward Model for Semi-supervised Learning Paper • 2310.03013 • Published Oct 4, 2023 • 2
A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN Paper • 2205.13943 • Published May 27, 2022 • 1
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning Paper • 2503.07459 • Published Mar 10 • 16
LocAgent: Graph-Guided LLM Agents for Code Localization Paper • 2503.09089 • Published Mar 12 • 13
PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking Paper • 2505.01700 • Published May 3 • 1
AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity Paper • 2505.23520 • Published May 29
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8 • 75
The Invisible Leash: Why RLVR May Not Escape Its Origin Paper • 2507.14843 • Published Jul 20 • 85