84 results for “topic:coqui-tts”
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
Open-source, fully private and local alternative to NotebookLM. Chat with your documents, generate audio summaries, and ground AI in your own sources—built with Supabase, N8N on a React frontend using Ollama for local inference
Persian/Farsi text to speech(TTS) training using coqui tts
Text to Speech using Coqui TTS + RVC
The world’s first game framework that lets you talk to AI in real time — locally. Supports any custom voice.
Automatically generate faceless YouTube Shorts from trending topics using AI scripts, TTS, and FFmpeg. Fully containerized and one-click deployable
Open source Speechify alternative. Read PDFs and EPUBs with local models.
Free voice cloning for creators using Coqui XTTS-v2 on Google Colab. Clone your voice with just a few minutes of audio. Complete guide to build your own notebook.
SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversational and interactive experience. It uses LLMs available through Ollama and has capabilities for extending functionalities through a modular tool system.
A framework for AI WhatsApp calls using Whisper, Coqui TTS, GPT-3.5 Turbo, Virtual Audio Cable, and the WhatsApp Desktop App.
Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.
Rust bindings to the https://github.com/coqui-ai TTS library
Various tools to clone a voice
Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS
Gui for users who use the coqui-TTS vits model.
A Voice-First AI Companion
Docker for multiple TTS Engines with a GRadio interface
DoyenTalker uses deep learning techniques to generate personalized avatar videos that speak user-provided text in a specified voice. The system utilizes Coqui TTS for text-to-speech generation, along with various face rendering and animation techniques to create a video where the given avatar articulates the speech.
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
With this tool you can create custom TTS dataset from video or audio.
Voice cloning using coqui-TTS
The TTS Platform leverages the power of Coqui TTS, an advanced open-source framework, to deliver a high-quality text-to-speech (TTS) experience. It caters to diverse user needs, offering natural-sounding voice generation with extensive customization options.
A lightweight voice companion, optimized for macOS.
(wip) python command-line Text-to-Speech (TTS) tool esp. for German, leveraging numerous endpoints like orpheus, piper, outetts, kokoro, csm, edge, coqui, kartoffelbox, etc
EchoSight is a tool that helps visually impaired individuals by audibly describing images taken with a Raspberry Pi Camera or inputted via image path or URL across different operating systems.
Training XTTS V2 and PEFT LORA Text-to-Speech (TTS)
An AI-powered backseat coach to fix your skill issue and/or ruin your day :). Supports popular models from OpenAI, Anthropic and Google and self-hosted. Customizable prompting and voice cloning thanks to ElevenLabs and Coqui TTS.
Synthesize speech using state-of-the-art open and closed-source tools
Speakeasy GPT is a Jupyter notebook that utilizes several natural language processing utilities to provide a seamless and low-latency speech interface to ChatGPT and other large language models.