Maxim Evdokimov's picture

23 1

Maxim Evdokimov

mevdokimov

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Qwen3-VL Technical Report

upvoted a paper 6 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

liked a dataset about 2 months ago

MTSAIR/MWS-Vision-Bench

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 12 days ago • 111

upvoted a paper 6 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 6 days ago • 181

upvoted 5 papers 3 months ago

Mini-o3: Scaling Up Reasoning Patterns and Interaction Turns for Visual Search

Paper • 2509.07969 • Published Sep 9 • 59

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 225

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 124

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1 • 37

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208

upvoted 2 papers 4 months ago

Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success

Paper • 2508.04280 • Published Aug 6 • 35

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

upvoted a paper 5 months ago

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models

Paper • 2507.12566 • Published Jul 16 • 14

upvoted a collection 5 months ago

T-pro-2.0

Hybrid reasoning model based on Qwen3 32B • 14 items • Updated 7 days ago • 30

upvoted a paper 5 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 241

upvoted 2 papers 7 months ago

ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

Paper • 2505.08581 • Published May 13 • 9

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317

upvoted 6 papers 8 months ago

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23 • 29

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published Apr 14 • 27

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 200

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 303

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 132

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300