GitHunt — Discover GitHub Repositories

4,384 results for “topic:tts”

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python59.5k9.4kUpdated 4 hours ago

deep-learningpythonpytorchtensorflowtts+1

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python55.6k6.1kUpdated just now

text-to-speechttsvitsvoice-clonevoice-cloneai+1

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python53.5k4.4kUpdated just now

agentdeepseekdeepseek-r1fine-tuninggemma+15

coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python44.7k6.0kUpdated just now

deep-learningglow-ttshifiganmelganmulti-speaker-tts+14

mudler/LocalAI

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

Go43.4k3.6kUpdated just now

aiapiaudio-generationdecentralizeddistributed+15

2noise/ChatTTS

A generative speech model for daily dialogue.

Python38.9k4.2kUpdated just now

agentchatchatgptchatttschinese+12

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python36.9k5.2kUpdated 5 hours ago

aideep-learningpytorchspeechtext-to-speech+1

myshell-ai/OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python36.0k4.0kUpdated 7 hours ago

text-to-speechttsvoice-clonezero-shot-tts

fishaudio/fish-speech

SOTA Open Source TTS

Python25.2k2.1kUpdated 2 hours ago

llamatransformerttsvallevits+2

mastra-ai/mastra

From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.

TypeScript21.8k1.7kUpdated just now

agentsaichatbotsevalsjavascript+8

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python19.9k2.3kUpdated just now

audio-generationcantonesechatbotchatgptchinese+14

index-tts/index-tts

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python19.2k2.4kUpdated just now

bigvgancross-lingualindexttstext-to-speechtts+2

readest/readest

Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.

TypeScript18.5k999Updated just now

androidcross-platformebookebook-readerepub+10

DrewThomasson/ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1158+ languages!

Python18.4k1.5kUpdated just now

audiobookaudiobookschinesecolab-notebookdocker+11

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

JavaScript17.3k826Updated just now

linuxmacosocrpotpot-app+6

NVIDIA-NeMo/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python16.9k3.4kUpdated 6 hours ago

asrdeeplearninggenerative-aimachine-translationneural-networks+5

PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Python12.5k2.0kUpdated 23 hours ago

asrcode-switchconformerkwspunctuation-restoration+15

rhasspy/piper

A fast, local neural text to speech system

C++10.6k916Updated just now

speech-synthesistext-to-speechtts

rany2/edge-tts

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python10.2k968Updated just now

speech-synthesistext-to-speechtts

mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook10.1k1.3kUpdated 1 day ago

dataset-analysisdeep-learningganttsglow-ttsmelgan+11

krillinai/KrillinAI

Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platforms like YouTube，TikTok. AI视频翻译配音工具，100种语言双向翻译，一键部署全流程，可以生抖音，小红书，哔哩哔哩，视频号，TikTok，Youtube等形态的内容成适配

Go9.7k842Updated just now

dubbinglocalizationttsvideo-transcriptionvideo-translation

shidahuilang/shuyuan

阅读书源-香色闺阁+用心读书+源阅+阅读3.0书源+源阅读+爱阅书香+千阅+花火阅读+读不舍手+番茄+喜马拉雅+漫画+听书+书源+IPTV源+IPA巨魔应用=自动更新

Python9.6k537Updated just now

aiyueshuxiangipaiptvreadershuyuan+5

jianchang512/clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

Python8.9k984Updated just now

clonevoicespeech-analysisststtsvoice-assistant

fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

Python8.7k1.3kUpdated just now

agentbertbert-vitsbert-vits2fish+6

netease-youdao/EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python8.5k746Updated 5 hours ago

aideep-learningemotionemotivoicemulti-speaker+8

Plachtaa/VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python8.0k781Updated 9 hours ago

emotional-speechgpttext-to-speechtransformer-architecturetts+2

jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python7.8k1.4kUpdated 17 hours ago

deep-learningpytorchspeech-synthesistext-to-speechtts

jianchang512/ChatTTS-ui

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

Python7.5k906Updated 19 hours ago

chatttstts

GetStream/Vision-Agents

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

Python7.3k564Updated just now

agentic-aiagentsaiai-agentsrealtime+6

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python7.2k1.0kUpdated 1 hour ago

chineseenglishfrenchjapanesekorean+4

Page 1 of 34