unity
/

inference-engine-whisper-tiny

Automatic Speech Recognition

unity-inference-engine

Model card Files Files and versions

inference-engine-whisper-tiny / README.md

Paul Bird

Update README.md

0468533 verified about 2 years ago

|

1.06 kB

	---
	license: apache-2.0
	library_name: unity-sentis
	pipeline_tag: automatic-speech-recognition
	---

	# Whisper-Tiny model in Unity Sentis Format

	This is the [Whisper Tiny](https://huggingface.co/openai/whisper-tiny) model tested to work in Unity 2023. It is a speech-to-text model. You feed in a 16kHz wav file and it outputs the best guess for what was said in the audio.

	## How to Use
	* Open a new scene in Unity 2023
	* Import package ``com.unity.sentis`` from the package manager.
	* Put the `RunWhisper.cs` on the Main Camera
	* Put the *.sentis files and the `vocab.json` in the Assets/StreamingAssets folder
	* Add a 16kHz mono audio file up to 30 seconds long to your project and drag on to the audioClip field.
	* IMPORTANT: The audio must be 16kHz. In the audio inspector select "Force Mono". And "Decompress on Load".
	* You can add a step to convert 44kHz or 22kHz audio to 16kHz with [this model](https://huggingface.co/unity/sentis-audio-frequency-to-16khz)

	When you press play the transcription of the audio will be displayed in the console window.