9 30 12

Xiaoyang Wang

xywang1

https://xyang0.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 15 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper about 1 month ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

upvoted a paper about 2 months ago

DeepAnalyze: Agentic Large Language Models for Autonomous Data Science

View all activity

Organizations

None yet

upvoted a paper about 15 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 2 days ago • 58

upvoted a paper about 1 month ago

InteractComp: Evaluating Search Agents With Ambiguous Queries

Paper • 2510.24668 • Published Oct 28 • 97

upvoted 3 papers about 2 months ago

upvoted 4 papers 3 months ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published Sep 8 • 79

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9 • 101

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

TCIA: A Task-Centric Instruction Augmentation Method for Instruction Finetuning

Paper • 2508.20374 • Published Aug 28 • 21

upvoted 6 papers 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

Paper • 2508.15760 • Published Aug 21 • 46

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14 • 60

Complex Logical Instruction Generation

Paper • 2508.09125 • Published Aug 12 • 40

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 130

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 93

upvoted a paper 6 months ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 58

upvoted a paper 8 months ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 62

upvoted a paper 9 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 171

upvoted a paper 10 months ago

AIDE: AI-Driven Exploration in the Space of Code

Paper • 2502.13138 • Published Feb 18 • 8

upvoted a paper 11 months ago

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Paper • 2501.15427 • Published Jan 26 • 6

Xiaoyang Wang

AI & ML interests

Recent Activity

Organizations

xywang1's activity