-
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
A Survey on Large Language Model Benchmarks
Paper • 2508.15361 • Published • 20 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102
yangdechuan
yangdechuan
·
AI & ML interests
None yet
Organizations
None yet
LLM
-
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Paper • 2503.19470 • Published • 19 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
A Survey on Large Language Model Benchmarks
Paper • 2508.15361 • Published • 20 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 102
models
6
yangdechuan/bert-base-uncased
Feature Extraction
•
0.1B
•
Updated
•
2
yangdechuan/bert-base-cased
Feature Extraction
•
0.1B
•
Updated
•
6
yangdechuan/mt5-small-finetuned-amazon-en-es-accelerate
Updated
•
3
yangdechuan/mt5-small-finetuned-amazon-en-es
Summarization
•
Updated
•
41
yangdechuan/codeparrot-ds
Text Generation
•
Updated
•
9
yangdechuan/bert-base-cased-finetuned-mrpc
Text Classification
•
Updated
•
5