thiswillbeyourgithub/parakeet_web
NVIDIA Parakeet speech recognition for the browser (WebGPU/WASM) powered by ONNX Runtime Web
ParakeetWeb
โ ๏ธ EXPERIMENTAL WIP โ This is a heavily modified fork, purely vibe-coded. Expect bugs, breaking changes, and rough edges.
Browser-based speech-to-text running entirely client-side using NVIDIA's Parakeet TDT 0.6B v3 model (converted to ONNX format by istupakov) via WebGPU/WASM.
What It Does
- Runs speech-to-text entirely in your browser (nothing sent to servers)
- Supports WebGPU (fast) and WASM (compatible) backends
- Word-level timestamps and confidence scores
- File upload or microphone recording
- Model quantization options (fp32/int8)
- Installable as a PWA (Progressive Web App) for app-like experience
Status
- ๐ง Work in progress
- ๐งช Experimental and unstable
- ๐ Licensed under AGPLv3
- ๐จ Vibe-coded with aider.chat and Claude Code
Live Demo
A running instance is available at https://pw.olicorne.org/ (no installation required).
Quick Start
# 1.Modify .env.example into .env to set your own values
# 2.Run the demo locally with Docker
sudo docker compose up- Then visit
http://localhost:5173
Local Model Fallback
If HuggingFace is blocked or unreachable in your environment, you can serve model weights directly from the container:
# 1. Install the HuggingFace CLI and download the model files locally
pip install huggingface-hub
hf download istupakov/parakeet-tdt-0.6b-v3-onnx --local-dir ./fallback_models/istupakov__parakeet-tdt-0.6b-v3-onnx
# 2. In docker-compose.yml, uncomment the volume bind:
# - ./fallback_models/istupakov__parakeet-tdt-0.6b-v3-onnx:/app/ui/public/models/istupakov/parakeet-tdt-0.6b-v3-onnx:ro
# 3. In your .env, enable the fallback:
VITE_LOCAL_MODEL_FALLBACK=trueThe downloaded files are git-ignored. When VITE_LOCAL_MODEL_FALLBACK=true is set, the app will check for the local model files on startup and refuse to load if they are missing.
License
AGPLv3 โ See LICENSE file
Acknowledgments
- ysdede/parakeet.js โ Original project this is forked from
- nvidia/parakeet-tdt-0.6b-v3 โ The underlying ASR model by NVIDIA
- istupakov/parakeet-tdt-0.6b-v3-onnx โ ONNX conversion of the model
- istupakov/onnx-asr โ Python reference implementation
- ONNX Runtime Web โ Makes browser inference possible
Credits
This fork is based on ysdede/parakeet.js โ all the heavy lifting and original implementation credit goes there. This would not exist without their excellent work.
