19 results for “topic:f5-tts”
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools
Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS
With this tool you can create custom TTS dataset from video or audio.
F5-TTS server APIs for voice cloning and text-to-speech generation with interactive waveform visualization.
Vietnamese TTS, Chuyển văn bản thành giọng nói tiếng Việt, text to speech tiếng Việt Nam
Duration predictor trainer for f5 tts mlx (DE)
🎤 Transcribe audio to text seamlessly with ComfyUI-Qwen3-ASR, supporting 52 languages and dialects for accurate and efficient speech recognition.
Mimicry is a complete, self-hosted zero-shot voice cloning system. Built on top of F5-TTS, it features a high-performance FastAPI backend, a built-in voice management frontend, an asynchronous Python SDK, and advanced audio processing for clean, professional speech synthesis.
⚡ Non-autoregressive TTS using Conditional Flow Matching - 5-20x faster than AR models
No description provided.
🎙️ Enhance voice synthesis with ComfyUI-Qwen3-TTS, featuring advanced voice cloning, emotion-aware ASR, and unlimited multi-role dubbing.
Self-hosted text-to-speech platform with multi-backend support, voice cloning, and a modern web UI.
Simple graphical front‑end for F5‑TTS
Voice backend that provides acces to Kokoro, Chatterbox and F5-TTS.
Production-ready TTS library and MCP server for AI assistants. Multi-voice synthesis, real-time streaming, SSML support, emotional speech, and sound effects.
🎙️ Real-Time TTS & Voice Cloning Pipeline — F5-TTS · PyTorch · Gradio · Voice Agent
Open-source FastAPI wrapper for F5-TTS. A powerful Text-to-Speech API with real-time voice cloning and streaming support.
🎤 Clone voices easily and efficiently with Qwen3-TTS, a local GPU-accelerated tool for voice synthesis using just one audio sample.
🎤 Clone voices in seconds with Qwen3-TTS Enhanced. Enjoy local, GPU-powered multi-reference cloning and audio preprocessing for high-quality outputs.