seojinlee's picture

76 36

seojinlee

sjlee311

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

upvoted a paper 8 days ago

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

upvoted a paper 8 days ago

Latent Collaboration in Multi-Agent Systems

View all activity

Organizations

None yet

upvoted a paper 4 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 7 days ago • 185

upvoted 2 papers 8 days ago

Revisiting Generalization Across Difficulty Levels: It's Not So Easy

Paper • 2511.21692 • Published 13 days ago • 15

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 14 days ago • 113

liked a model 12 days ago

sionic-ai/nllb-200-ko-gec-3.3B

Translation • Updated Jul 2, 2024 • 108 • 10

upvoted 3 papers about 1 month ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10 • 81

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published Oct 17 • 147

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21 • 82

upvoted a paper about 2 months ago

Making Mathematical Reasoning Adaptive

Paper • 2510.04617 • Published Oct 6 • 22

upvoted a paper 2 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 495

liked a model 2 months ago

Zigeng/R1-VeriThinker-7B

Text Generation • 8B • Updated May 27 • 83 • 5

upvoted a paper 2 months ago

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24 • 41

upvoted 7 papers 3 months ago

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 139

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7 • 149

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 57

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 225

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published Aug 28 • 15

upvoted 2 papers 4 months ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 158

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88