"topic:speech-generation" — Search

19 results for “topic:speech-generation”

Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!

Jupyter Notebook36057Updated 3 years ago

disentanglement-learningone-shotspeechspeech-generationvoice-conversion

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

artificial-intelligencecomputer-visiondeep-learningedge-computingiotlego-mindstormsnatural-language-processingoak-dopencvraspberry-pismarthomespeech-generationspeech-recognitionvisual-programming

ga642381/SpeechGen

《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》

775Updated 2 years ago

deep-learninglarge-language-modelspromptspeech-generationspeech-llmspeech-processing

ictnlp/NAST-S2x

A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.

Python775Updated 1 year ago

non-autoregressivenon-autoregressive-transformerssimultaneous-translationspeech-generationspeech-to-speech-translation

youngsheen/GPST

[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer

Python693Updated 1 year ago

autoregressivefairseqlanguage-modelmoshispeech-generation

caizexin/GenVC

Self-supervised Generative LM-based Voice Conversion

Python5411Updated 11 months ago

speech-generationvoice-anonymizationvoice-cloningvoice-conversion

Otosaku/OtosakuTTS-iOS

Swift library for offline text-to-speech synthesis on iOS/macOS. Generate natural speech directly on device using CoreML-optimized FastPitch and HiFiGAN models. No internet required, fully private.

Swift517Updated 7 months ago

coremliosios-libraryml-modelsspeech-generationspeech-synthesisspeech-synthesizerspmswifttext-to-speechttsvoice-synthesis

RaduBolbo/F5-TTS-Emotional-CFG

Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS

Python305Updated 2 weeks ago

classifier-free-guidanceemotionemotional-speechf5-ttsf5ttsfine-grainedfine-tuningflow-matchingpythonpytorchspeech-generationspeech-synthesistext-to-speechttsvoice-cloningzero-shot

Nicolas-Prevot/TTS_playground

Unified toolkit for testing and comparing multiple state-of-the-art open-source Text-to-Speech (TTS) models (with voice cloning, multilingual support, and audio samples).

Python81Updated 1 month ago

open-sourcepythonspeech-generationspeech-synthesistext-to-speechttsvoice-cloningvoice-synthesis

nidhiyashwanth/SesameAILabs-csm

A conversational speech model (CSM) that generates natural-sounding speech with context awareness and consistent audio quality. Supports multi-speaker conversations and maintains contextual understanding across turns, ensuring consistent audio output throughout the conversation.

Jupyter Notebook52Updated 12 months ago

context-awareconversational-aicsmmoshisesamesesameailabsspeech-generation

Vidyut/vidyut-tts

Streamlit frontend for Coqui-tts

Python30Updated 2 years ago

speech-generationtext-to-speechttstts-frontend

bharathkumaarr/Text2Speech

Text to Speech generator. Supports multiple accents.

CSS10Updated 1 year ago

speech-generationtext-to-speechtext-to-speech-converter

takenori-y/lfeats

A unified interface to extract hidden representations from various speech foundation models