WebML Community

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Xenova new activity 7 days ago

webml-community/SAM3-Tracker-WebGPU:File upload not working under linux

Xenova updated a Space 7 days ago

webml-community/SAM3-Tracker-WebGPU

Xenova new activity 9 days ago

webml-community/Supertonic-TTS-WebGPU:add-text-preprocessing

View all activity

Xenova

in webml-community/SAM3-Tracker-WebGPU 7 days ago

File upload not working under linux

#1 opened 17 days ago by

Xenova

updated a Space 7 days ago

SAM3 Tracker WebGPU

Segment and extract parts from images by clicking

Xenova

in webml-community/Supertonic-TTS-WebGPU 9 days ago

add-text-preprocessing

#2 opened 9 days ago by

Xenova

updated a Space 12 days ago

Supertonic TTS WebGPU

Blazingly fast text-to-speech 100% locally in your browser

Xenova

published a Space 12 days ago

Supertonic TTS WebGPU

Blazingly fast text-to-speech 100% locally in your browser

Xenova

updated a Space 12 days ago

OuteTTS WebGPU

WebGPU text-to-Speech powered by OuteTTS and Transformers.js

Xenova

updated a Space 13 days ago

Llama 3.2 WebGPU

A powerful AI chatbot that runs locally in your browser

Xenova

in webml-community/llama-3.2-webgpu 13 days ago

Update demo (Transformers.js v3.8.0)

#4 opened 13 days ago by

Xenova

published a Space 17 days ago

SAM3 Tracker WebGPU

Segment and extract parts from images by clicking

Xenova

published a Space 19 days ago

Baguettotron WebGPU

A small but powerful reasoning model

Xenova

updated a Space 19 days ago

Baguettotron WebGPU

A small but powerful reasoning model

Xenova

updated a Space about 2 months ago

NanoChat WebGPU

Run NanoChat 100% locally in your browser on WebGPU

Xenova

published a Space about 2 months ago

NanoChat WebGPU

Run NanoChat 100% locally in your browser on WebGPU

Xenova

published a Space 2 months ago

MongoDB Embedding WebGPU

Generate a scatterplot of sentences by category

Xenova

updated a Space 2 months ago

MongoDB Embedding WebGPU

Generate a scatterplot of sentences by category

Xenova

updated 2 Spaces 3 months ago

Semantic Galaxy

Visualize embeddings in 3D space, powered by EmbeddingGemma

Janus 1.3B WebGPU

In-browser unified multimodal understanding and generation.

Xenova

in webml-community/Janus-1.3B-WebGPU 3 months ago

Update files

#2 opened 3 months ago by

Xenova

published a Space 3 months ago

Semantic Galaxy

Visualize embeddings in 3D space, powered by EmbeddingGemma

Xenova

posted an update 4 months ago

Post

11659

Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! 🤯
Demo (+ source code): webml-community/DINOv3-video-tracking

This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! 😍

How does it work? 🤔
1️⃣ Generate and cache image features for each frame
2️⃣ Create a list of embeddings for selected patch(es)
3️⃣ Compute cosine similarity between each patch and the selected patch(es)
4️⃣ Highlight those whose score is above some threshold

... et voilà! 🥳

You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.

Excited to see what the community builds with it!

1 reply

·