Roberts Slisans
rsxdalv
TTS WebUI Developer
Languages
Repos
174
Stars
3.4k
Forks
485
Top Language
Python
Loading contributions...
Top Repositories
A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
SoTA open-source TTS
Frontier Open-Source Text-to-Speech
Site for sharing MusicGen + AudioGen Prompts and Creations
Repositories
174A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
No description provided.
No description provided.
No description provided.
SoTA open-source TTS
GitHub composite action — deploy a service via OIDC-authenticated ansible-hooks webhook
artifact storage
GitHub Action to upload Debian packages to a Pository instance
Site for sharing MusicGen + AudioGen Prompts and Creations
No description provided.
No description provided.
Frontier Open-Source Text-to-Speech
No description provided.
The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
No description provided.
No description provided.
No description provided.
Bark: A text-to-speech model
Developer focused github repo for getting Webkit/Safari on Windows
No description provided.
An extension to use Kokoro TTS in text generation webui
Generative models for conditional audio generation
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
Audiocraft provides MusicGen and MAGNeT models for high-quality music and audio generation
No description provided.
No description provided.
Object-oriented handling of audio data, with GPU-powered augmentations, and more.
StyleTTS2 is a text-to-speech model that generates high-quality speech with controllable style
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.