"topic:vall-e" — Search

12 results for “topic:vall-e”

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Python9.7k797Updated 9 months ago

audio-generationaudio-synthesisaudioldmauditemiliafastspeech2maskgctmusic-generationnaturalspeech2singing-voice-conversionspeech-synthesistext-to-audiotext-to-speechvall-evitsvocodervoice-conversion

Plachtaa/VALL-E-XArchived

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python8.0k780Updated 2 years ago

emotional-speechgpttext-to-speechtransformer-architecturettsvall-evoice-clone

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python3.0k405Updated 2 years ago

audio-lmpytorchtext-to-speechttsvall-evalle

lifeiteng/vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python2.2k334Updated 6 months ago

chatgptin-context-learninglarge-language-modelstext-to-speechttsvall-evalle

hollobit/GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

95656Updated 1 year ago

agichatgptchatgpt-apiclaudecopilotgenerative-aigenerative-modelsgptlangchainlarge-language-modelsllamallmmidjourneyopenaipalm-estable-diffusiontimelinetransformervall-e

ivcylc/OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

Python63472Updated 8 months ago

aiai-musicai-music-generationai-music-generatoraudioldmdiffusion-modelsdiffusion-transformerdithifi-ganmdtmusic-aimusic-ai-architecturesmusic-generationtext-to-audiotext-to-audio-aitext-to-musictext-to-music-transformervall-e

zhenye234/xcodec

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python29523Updated 5 months ago

audioaudio-codeccodecgptlanguage-modelmusicself-supervised-learningsemanticsoundspeechspeech-language-modeltext-to-musictext-to-soundtext-to-speechtokenizervall-e

E-

e-c-k-e-r/vall-e

An unofficial PyTorch implementation of VALL-E

Python887Updated 7 months ago

audio-lmpytorchtext-to-speechttsvall-e

NewJerseyStyle/anime-translatorArchived

Applying deep learning to translate animation and re-generate audio.

Python62Updated 1 year ago

emotional-speechspeech-to-speech-translationtransformer-architecturevall-evoice-clonewhisper

KuchikiRenji/vall-e

Unofficial PyTorch implementation of VALL-E: zero-shot text-to-speech and voice cloning using neural codec language models. Train and synthesize speech from text with a single reference audio.

Python20Updated 1 month ago

autoregressivedeep-learninmachine-learningnarneural-codecpythonpytorchmencodecspeechspeech-synthesistext-to-texttransformerttsvall-evoicevoice-cloningvoice-synthesiszero-shot

mahshid1378/VALL-E

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python10Updated 11 months ago

chatgptin-context-learninglarge-language-modelstext-to-speechttsvall-evalle

kuwacom/VALL-E-X-Tools

VALL-E-X の日本語ツール

Jupyter Notebook10Updated 2 years ago

aittsvall-e