12 results for “topic:vall-e”
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An unofficial PyTorch implementation of the audio LM VALL-E
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
ChatGPT, GenerativeAI and LLMs Timeline
OpenMusic: SOTA Text-to-music (TTM) Generation
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
An unofficial PyTorch implementation of VALL-E
Applying deep learning to translate animation and re-generate audio.
Unofficial PyTorch implementation of VALL-E: zero-shot text-to-speech and voice cloning using neural codec language models. Train and synthesize speech from text with a single reference audio.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
VALL-E-X の日本語ツール