Pavel Iakubovskii's picture

Pavel Iakubovskii

qubvel-hf

·

AI & ML interests

Computer Vision models

Recent Activity

upvoted a paper 3 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

liked a Space 25 days ago

tori29umai/Qwen-Image-2509-MultipleAngles

liked a Space 26 days ago

linoyts/Qwen-Image-Edit-Angles

View all activity

Organizations

upvoted a paper 3 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 4 days ago • 150

upvoted an article 3 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

Sep 11

•

166

upvoted a collection 4 months ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 398

upvoted an article 4 months ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

Aug 11

•

75

upvoted 2 collections 4 months ago

MM Grounding DINO

8 items • Updated Aug 1 • 5

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 391

upvoted a collection 5 months ago

H-Net

The family of hierarchical networks (H-Nets) from https://arxiv.org/abs/2507.07955 • 8 items • Updated Jul 11 • 20

upvoted an article 5 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

722

upvoted a paper 6 months ago

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11 • 29

upvoted a collection 6 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 173

upvoted an article 6 months ago

Article

KV Cache from scratch in nanoVLM

+3

Jun 4

•

103

upvoted 2 changelogs 7 months ago

Changelog

Xet is now the default storage option for new users and organizations

May 23

• 74

Changelog

Static Spaces can now have a build step

May 23

• 105

upvoted 2 articles 7 months ago

Article

The Transformers Library: standardizing model definitions

+2

May 15

•

120

Article

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

+5

May 11

•

86

upvoted 2 collections 7 months ago

D-FINE

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 56

Qwen3

84 items • Updated Aug 6 • 1.47k

upvoted an article 8 months ago

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28

•

490

upvoted 2 collections 8 months ago

Describe Anything

Multimodal Large Language Models for Detailed Localized Image and Video Captioning • 7 items • Updated 5 days ago • 60

DPT

Vision Transformers for Dense Prediction. • 1 item • Updated Apr 12 • 1