Agentic RL - a edilmo Collection

edilmo 's Collections

CoreML

Context Engineering

Agentic RL

updated 2 days ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 228
PRewrite: Prompt Rewriting with Reinforcement Learning

Paper • 2401.08189 • Published Jan 16, 2024
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 48
Multiplayer Nash Preference Optimization

Paper • 2509.23102 • Published Sep 27, 2025 • 62
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published Dec 18, 2025 • 114
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Paper • 2601.10402 • Published 3 days ago • 34
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 4 days ago • 73