127 results for “topic:vits”
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Easily train a good VC model with voice data <= 10 mins!
SOTA Open Source TTS
SoftVC VITS Singing Voice Conversion
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
so-vits-svc fork with realtime support, improved interface and more features.
vits2 backbone with multilingual-bert
A simple, high-quality voice conversion tool focused on ease of use and performance.
Core Engine of Singing Voice Conversion & Singing Voice Clone
GPT-SoVITS ONNX Inference Engine & Model Converter
移动版二次元 AI 老婆聊天器
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
多个SVC/TTS的C++推理库
A simple VITS HTTP API, developed by extending Moegoe with additional features.
So-VITS-SVC 本地部署使用帮助文档,提供Colab笔记本 So-VITS-SVC Local Deployment Document and provide Colab notebook
singing voice change based on whisper, and lora for singing voice clone
🦖Pytorch implementation of popular Attention Mechanisms, Vision Transformers, MLP-Like models and CNNs.🔥🔥🔥
SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synthesis(TTS) project that has almost no dependency and could be easily used for Chinese TTS with just one key build out
liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project
AivisSpeech: AI Voice Imitation System - Text to Speech Software
vits Android部署
Probing the representations of Vision Transformers.
An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.
🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!
Persian/Farsi text to speech(TTS) training using coqui tts
Singing Voice Synthesis based on VITS, different from VISinger
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
图片搜索引擎,很简单。三步构建属于你自己的图片搜索引擎,掌握向量数据库和以图搜图、文本搜索图片。