"topic:f5-tts" — Search

19 results for “topic:f5-tts”

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

Python73865Updated 11 hours ago

ai-audioaudioaudio-editingaudio-generationaudio-processingchatterboxcomfyuicozy-voice-3echo-ttsf5f5-ttshiggs-audioindextts-2qwen3-ttsrvctext-to-speechttsvibevoicevoice-cloningvoice-conversion

RaduBolbo/F5-TTS-Emotional-CFG

Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS

Python305Updated 6 days ago

classifier-free-guidanceemotionemotional-speechf5-ttsf5ttsfine-grainedfine-tuningflow-matchingpythonpytorchspeech-generationspeech-synthesistext-to-speechttsvoice-cloningzero-shot

gokhaneraslan/tts-dataset-generator

With this tool you can create custom TTS dataset from video or audio.

Python135Updated 1 week ago

coqui-aicoqui-ttscustom-datasetdata-sciencef5-ttsstttacotron2tortoise-ttsttsvitsxttsv2

jeantimex/F5-TTS-Server

F5-TTS server APIs for voice cloning and text-to-speech generation with interactive waveform visualization.

JavaScript82Updated 2 weeks ago

f5-ttsfastapitext-to-speech

2tocom/F5-TTS-Vietnamese-Google-Colab

Vietnamese TTS, Chuyển văn bản thành giọng nói tiếng Việt, text to speech tiếng Việt Nam

Python63Updated 4 hours ago

colabf5-ttsf5-tts-colabf5-tts-vietnamesegoogle-colabtext-to-speechtext-to-speech-viet-namtext-to-speech-vnttstts-viettts-vietnamtts-vietnamesetts-vnvietviet-nam-text-to-speechvietnamvietnamesevietnamese-ttsvnvn-tts

eamag/f5-tts-durationArchived

Duration predictor trainer for f5 tts mlx (DE)

Python21Updated 8 months ago

f5-ttshuggingfacemlxpytorch

harshitx077/ComfyUI-Qwen3-ASR

🎤 Transcribe audio to text seamlessly with ComfyUI-Qwen3-ASR, supporting 52 languages and dialects for accurate and efficient speech recognition.

Python20Updated 2 hours ago

aiai-audioasr-modelaudio-processingchatterboxemotionf5-ttshiggs-audioqwen3qwen3-asrspeech-to-texttext-to-speechttsvibevoicevoice-conversion

mrigankad/Mimicry

Mimicry is a complete, self-hosted zero-shot voice cloning system. Built on top of F5-TTS, it features a high-performance FastAPI backend, a built-in voice management frontend, an asynchronous Python SDK, and advanced audio processing for clean, professional speech synthesis.

Python10Updated 5 days ago

ai-voiceaudio-processingf5-ttsfastapipython-sdkself-hostedspeech-synthesistext-to-speechttsvoice-cloningzero-shot-tts

gateoneh92/Flow-Matching-TTS

⚡ Non-autoregressive TTS using Conditional Flow Matching - 5-20x faster than AR models

Python10Updated 1 week ago

deep-learningf5-ttsflow-matchingmb-istftnon-autoregressivepytorchspeech-synthesistext-to-speechttsvoicebox

ronakbothraa/translingo

No description provided.

TypeScript10Updated 2 months ago

clerk-authf5-ttsflaskflask-restfulgemini-apinextjsnllb200reactjswhisper

Trshadow45/ComfyUI-Qwen3-TTS

🎙️ Enhance voice synthesis with ComfyUI-Qwen3-TTS, featuring advanced voice cloning, emotion-aware ASR, and unlimited multi-role dubbing.

Python10Updated 2 hours ago

aiai-audioaudio-editingaudio-generationaudio-processingchatterboxemotionf5f5-ttsgenerative-aihiggs-audioqwentext-to-speechttsvibevoicevoice-conversion

KevinBonnoron/sirene

Self-hosted text-to-speech platform with multi-backend support, voice cloning, and a modern web UI.

TypeScript00Updated 3 days ago

aibunchatterboxcosyvoicef5-ttsfastapikokoromonorepoopenaudiopiperpocketbaseqwen-ttsreactself-hostedspeech-synthesistext-to-speechttsvoice-cloningvoice-generationwhisper

icosane/hyacinthia

Simple graphical front‑end for F5‑TTS

Python00Updated 1 month ago

f5-ttsfaster-whisperomogrepyqtqfluentwidgetstext-to-speechtts

rwmicro/voice-backend

Voice backend that provides acces to Kokoro, Chatterbox and F5-TTS.

Python00Updated 1 month ago

chatterbox-ttsf5-ttskokoro-ttssttttsvoice-recognition

mcp-tool-shop-org/original_voice-soundboard

Production-ready TTS library and MCP server for AI assistants. Multi-voice synthesis, real-time streaming, SSML support, emotional speech, and sound effects.

Python00Updated 4 days ago

23-languagesai-assistantchatterboxclaudediffusion-transformerf5-ttskokoromcpmcp-servermultilingualonnxpythonreal-timespeech-synthesisssmlstreamingtext-to-speechttsvoice-cloningwebsocket

Yacinewhatchandcode/VoiceCloning

🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent

Python00Updated 23 hours ago

audiodeep-learningf5-ttsgradiopythonpytorchttsvoice-cloning

Nomannazir/f5-tts-fastapi

Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with real-time voice cloning and streaming support.

Python00Updated 2 days ago

aiaudiof5-ttsfastapihuggingfacemachine-learningopenaispeech-synthesistext-to-speechvoice-generation

ipriyanshuuu/qwen3-tts

🎤 Clone voices easily and efficiently with Qwen3-TTS, a local GPU-accelerated tool for voice synthesis using just one audio sample.

Python00Updated 2 hours ago

aiai-audioapple-siliconaudioaudio-generationaudio-processingchatterboxcomfyuicudaf5-ttsgradiom1m3qwen-ttsqwen3qwen3-tts-uirustspeech-synthesisvoice-cloningwhisper

moaz11112/qwen3-tts-enhanced

🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered multi-reference cloning and audio preprocessing for high-quality outputs.

Python00Updated 2 hours ago

audio-editingaudio-processingcebchatterboxcomfyuicudadeep-learningdockerf5-ttsflash-attentiongenerative-aigradiopythonstable-diffusiontext-to-imagexmlseclibs