126 results for “topic:deepspeech”
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Examples of how to use or integrate DeepSpeech
基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
speech to text benchmark framework
A CLI script to generate subtitle files (SRT/VTT/TXT) for any video using either DeepSpeech or Coqui
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
DeepSpeech based forced alignment tool
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
A testing server for a speech to text service based on coqui.ai
Golang bindings for Mozilla's DeepSpeech speech-to-text library
ASR with PyTorch
Automatic Speech Recognition in Unity using Vosk library
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
Install Mozilla DeepSpeech on a Raspberry Pi 4
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
A MXNet implementation of Baidu's DeepSpeech architecture
Blender add-on to implement VOCA neural network.
Raspberry Pi impersonates Nintendo Switch controller
📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
🌊 A crazy simple library for reading/writing WAV files in V. Zero dependencies, 100% cross-platform.
Training scripts for Speech-To-Text models for Ukrainian language
Mozilla DeepSpeech in flutter using Dart FFI