Useful Sensors Inc.

Team

company

https://usefulsensors.com/

usefulsensors

Activity Feed Request to join this org

AI & ML interests

AI for the physical world, TinyML, Embedded Systems

Recent Activity

keveman updated a Space 2 days ago

UsefulSensors/moonshine-streaming-demo

keveman published a Space 2 days ago

UsefulSensors/moonshine-streaming-demo

keveman updated a model 3 days ago

UsefulSensors/moonshine-streaming

View all activity

keveman

updated a Space 2 days ago

Moonshine Streaming Demo

📚

Demo of Moonshine streaming ASR model

keveman

published a Space 2 days ago

Moonshine Streaming Demo

📚

Demo of Moonshine streaming ASR model

keveman

updated a model 3 days ago

UsefulSensors/moonshine-streaming

Updated 3 days ago

keveman

published a model 3 days ago

UsefulSensors/moonshine-streaming

Updated 3 days ago

petewarden

updated a model 7 days ago

UsefulSensors/moonshine

Automatic Speech Recognition • Updated 7 days ago • 81

eustlb

authored a paper 12 days ago

Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Recognition Evaluation

Paper • 2510.06961 • Published Oct 8 • 9

petewarden

updated a model 16 days ago

UsefulSensors/moonshine-base-ko

Automatic Speech Recognition • 61.5M • Updated 16 days ago • 278

evanking

authored a paper 3 months ago

Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models

Paper • 2305.09802 • Published May 16, 2023

theadamsabra

authored a paper 3 months ago

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices

Paper • 2509.02523 • Published Sep 2 • 7

evanking

authored a paper 3 months ago

Flavors of Moonshine: Tiny Specialized ASR Models for Edge Devices

Paper • 2509.02523 • Published Sep 2 • 7

theadamsabra

authored a paper 3 months ago

SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech

Paper • 2402.12482 • Published Feb 19, 2024

Xenova

posted an update 4 months ago

Post

11690

Okay this is insane... WebGPU-accelerated semantic video tracking, powered by DINOv3 and Transformers.js! 🤯
Demo (+ source code): webml-community/DINOv3-video-tracking

This will revolutionize AI-powered video editors... which can now run 100% locally in your browser, no server inference required (costs $0)! 😍

How does it work? 🤔
1️⃣ Generate and cache image features for each frame
2️⃣ Create a list of embeddings for selected patch(es)
3️⃣ Compute cosine similarity between each patch and the selected patch(es)
4️⃣ Highlight those whose score is above some threshold

... et voilà! 🥳

You can also make selections across frames to improve temporal consistency! This is super useful if the object changes its appearance slightly throughout the video.

Excited to see what the community builds with it!

1 reply

Xenova

posted an update 4 months ago

Post

4518

The next generation of AI-powered websites is going to be WILD! 🤯

In-browser tool calling & MCP is finally here, allowing LLMs to interact with websites programmatically.

To show what's possible, I built a demo using Liquid AI's new LFM2 model, powered by 🤗 Transformers.js: LiquidAI/LFM2-WebGPU

As always, the demo is open source (which you can find under the "Files" tab), so I'm excited to see how the community builds upon this! 🚀

2 replies

Xenova

posted an update 5 months ago

Post

3409

Introducing Voxtral WebGPU: State-of-the-art audio transcription directly in your browser! 🤯
🗣️ Transcribe videos, meeting notes, songs and more
🔐 Runs on-device, meaning no data is sent to a server
🌎 Multilingual (8 languages)
🤗 Completely free (forever) & open source

That's right, we're running Mistral's new Voxtral-Mini-3B model 100% locally in-browser on WebGPU, powered by Transformers.js and ONNX Runtime Web! 🔥

Try it out yourself! 👇
webml-community/Voxtral-WebGPU

Xenova

posted an update 6 months ago

Post

7350

NEW: Real-time conversational AI models can now run 100% locally in your browser! 🤯

🔐 Privacy by design (no data leaves your device)
💰 Completely free... forever
📦 Zero installation required, just visit a website
⚡️ Blazingly-fast WebGPU-accelerated inference

Try it out: webml-community/conversational-webgpu

For those interested, here's how it works:
- Silero VAD for voice activity detection
- Whisper for speech recognition
- SmolLM2-1.7B for text generation
- Kokoro for text to speech

Powered by Transformers.js and ONNX Runtime Web! 🤗 I hope you like it!

5 replies

Xenova

posted an update 7 months ago

Post

8563

Introducing the ONNX model explorer: Browse, search, and visualize neural networks directly in your browser. 🤯 A great tool for anyone studying Machine Learning! We're also releasing the entire dataset of graphs so you can use them in your own projects! 🤗

Check it out! 👇
Demo: onnx-community/model-explorer
Dataset: onnx-community/model-explorer
Source code: https://github.com/xenova/model-explorer

Xenova

posted an update 8 months ago

Post

3033

Reasoning models like o3 and o4-mini are advancing faster than ever, but imagine what will be possible when they can run locally in your browser! 🤯

Well, with 🤗 Transformers.js, you can do just that! Here's Zyphra's new ZR1 model running at over 100 tokens/second on WebGPU! ⚡️

Giving models access to browser APIs (like File System, Screen Capture, and more) could unlock an entirely new class of web experiences that are personalized, interactive, and run locally in a secure, sandboxed environment.

For now, try out the demo! 👇
webml-community/Zyphra-ZR1-WebGPU

1 reply

Xenova

authored a paper 8 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 200

Xenova

posted an update 10 months ago

Post

14366

We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. ⚡️

Generate 10 seconds of speech in ~1 second for $0.

What will you build? 🔥
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
✂️ Implement sentence splitting, allowing for streamed responses
🌍 Multilingual support (only phonemization left)

Who wants to help?

12 replies

Xenova

authored a paper 10 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 249

AI & ML interests

Recent Activity

Team members 8

UsefulSensors's activity

Moonshine Streaming Demo

Moonshine Streaming Demo