Zixia Jia
vickyandkekey
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 5 hours ago
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
upvoted
a
paper
6 months ago
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
authored
a paper
7 months ago
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient
in Latent Space
Organizations
None yet