Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2602.04884

Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

about 16 hours ago

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published 17 days ago • 59
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 16 days ago • 99
Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 62

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 23 days ago • 175
DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 18 days ago • 60
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 18 days ago • 21
BMAM: Brain-inspired Multi-Agent Memory Framework

Paper • 2601.20465 • Published 18 days ago • 4

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Lizard: An Efficient Linearization Framework for Large Language Models

Paper • 2507.09025 • Published Jul 11, 2025 • 19
On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Paper • 2507.23632 • Published Jul 31, 2025 • 6
Causal Attention with Lookahead Keys

Paper • 2509.07301 • Published Sep 9, 2025 • 21

Attention Learning

Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published 18 days ago • 40
Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

about 1 hour ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 63
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13, 2025 • 53

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

Attention Learning

Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

about 16 hours ago

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published 17 days ago • 59
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 16 days ago • 99
Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22, 2025 • 62

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published 18 days ago • 40
Reinforced Attention Learning

Paper • 2602.04884 • Published 11 days ago • 27

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published 23 days ago • 175
DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 18 days ago • 60
Linear representations in language models can change dramatically over a conversation

Paper • 2601.20834 • Published 18 days ago • 21
BMAM: Brain-inspired Multi-Agent Memory Framework

Paper • 2601.20465 • Published 18 days ago • 4

about 1 hour ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 63
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published Mar 7, 2025 • 46
Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6, 2025 • 96
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13, 2025 • 53

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Lizard: An Efficient Linearization Framework for Large Language Models

Paper • 2507.09025 • Published Jul 11, 2025 • 19
On the Expressiveness of Softmax Attention: A Recurrent Neural Network Perspective

Paper • 2507.23632 • Published Jul 31, 2025 • 6
Causal Attention with Lookahead Keys

Paper • 2509.07301 • Published Sep 9, 2025 • 21

Reinforcement learning

Diffusion Augmented Agents: A Framework for Efficient Exploration and Transfer Learning

Paper • 2407.20798 • Published Jul 30, 2024 • 24
Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4, 2025 • 104
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs