"topic:hifi-gan" — Search

32 results for “topic:hifi-gan”

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

deep-learningganhifi-ganpytorchspeech-synthesistext-to-speechttsvocoder

OpenMusic: SOTA Text-to-music (TTM) Generation

aiai-musicai-music-generationai-music-generatoraudioldmdiffusion-modelsdiffusion-transformerdithifi-ganmdtmusic-aimusic-ai-architecturesmusic-generationtext-to-audiotext-to-audio-aitext-to-musictext-to-music-transformervall-e

keonlee9420/DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python34744Updated 4 years ago

ddpmdeep-neural-networksdiffgan-ttsdiffspeechdiffusiondiffusion-modelsfastspeechgangenerative-modelhifi-ganmulti-speaker-ttsneural-ttsnon-arnon-autoregressivepytorchsingle-speaker-ttsspeech-synthesistext-to-speechtts

keonlee9420/PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Python34138Updated 4 years ago

deep-neural-networksfastspeechgenerative-modelhifi-ganhigh-qualitymel-ganneural-ttsnon-arnon-autoregressivenormalizing-flowsportable-ttspytorchspeech-synthesistext-to-speechttsvae

keonlee9420/Comprehensive-Transformer-TTS

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS

Python32843Updated 3 years ago

comprehensivedeep-learningfastspeechfastspeech2hifi-ganmel-ganmulti-speakerneural-ttsnon-arnon-autoregressivepytorchsingle-speakersotaspeech-synthesissupervisedtext-to-speechtransformerttsultimate-ttsunsupervised

NTT123/vietTTS

Vietnamese Text to Speech library

Python255104Updated 2 years ago

deep-learninghifi-gantacotrontext-to-speechtts-enginesvietnamvietnamesevocoder

keonlee9420/Comprehensive-E2E-TTS

A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS

Python14719Updated 3 years ago

deep-learningend-to-endfastspeech2hifi-ganjetsmulti-speakerneural-ttsnon-arnon-autoregressivepytorchsingle-speakersotaspeech-synthesistext-to-speechtext-to-wavttsultimate-ttsunsupervised

nipponjo/tts-arabic-pytorch

🎙️ Arabic TTS models (Tacotron2, FastPitch)

Jupyter Notebook13733Updated 3 months ago

arabicarabic-ttsdeep-learningfastpitchhifi-ganhifiganmulti-speaker-ttspythonpytorchspeechspeech-synthesistacotron2tacotron2-pytorchtext-to-speechtorchaudiottstts-modelvocosvoice-synthesis

rishikksh20/Avocodo-pytorch

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python12215Updated 3 years ago

avocodogangenerative-adversarial-networkhifi-ganpytorchspeech-synthesistext-to-speechttsvocoder

Voice-Privacy-Challenge/Voice-Privacy-Challenge-2022

Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software

Python6915Updated 1 year ago

anonymizationanonymization-metricsasrasvattack-modelde-identificationhifi-gankaldimcadamsmetricsprivacyprivacy-protectionspeaker-recognitionspeech-processingspeech-recognitionspeech-synthesisvoice-anonymizationvoice-conversionvoice-privacyvoice-privacy-challenge

keonlee9420/Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Python4815Updated 2 years ago

autoregressivecomprehensivedeep-learningdiagonal-guided-attentionefficiencyhifi-ganmel-ganmulti-speakerneural-ttspytorchreduction-factorrobustnesssingle-speakerspeech-synthesistacotrontacotron2text-to-speechtts

hwRG/End-to-End-TTS-Fine-Tune

Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.

Python299Updated 2 years ago

end-to-endfastspeech2fine-tunehifi-gantts

lucadellalib/discrete-wavlm-codec

A neural speech codec based on discrete WavLM representations

Python253Updated 1 year ago

clusteringcodechifi-gank-meansneural-speech-codingpytorchquantizationself-supervised-learningspeech-synthesistoken-extractionwavlm

manhph2211/ViTTS

In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system :smile: In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...

Python120Updated 2 years ago

deepspeechhifi-ganistftnetmfamosnetmultispeaker-speech-synthesisnormalizing-flowportaspeechrealtime-ttsspeech-synthesistext-to-speechvietnamese-text-to-speechvietnamese-ttsvocoder

ssmlkl/MnTTS2

This is the experimental description of MnTTS2.

Jupyter Notebook115Updated 1 year ago

fastspeech2hifi-ganmongolianmulti-speaker-ttstts

NTT123/hifigan-tpu

Train HiFi-GAN on TPU

Python100Updated 3 years ago

ganhifi-ganjaxpaxtext-to-speechttsvocoder

jik876/hifi-gan-demo

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

HTML104Updated 5 years ago

deep-learningganhifi-ganspeech-synthesistext-to-speechtts

nipponjo/tts-german-pytorch

🎙️ German TTS (FastPitch) with Thorsten voice / emotional

Python90Updated 1 year ago

deep-learningemotional-speechfastpitchgermangerman-languagehifi-ganpythonpytorchspeechspeech-synthesistext-to-speechtorchaudiotts

PeechApp/tts-peech

DelightfulTTS with Hifi-GAN and Univnet vocoders

Jupyter Notebook82Updated 1 year ago

delightfulttshifi-ganttsunivnet

34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

Python70Updated 5 days ago

hifi-ganmypyneural-source-filternsfpythonpytorchttsvocodervoice-conversion

ducnt18121997/Viet-Transformer-TTS

This is PyTorch Implementation of A Non-Autoregressive Transformer with unsupervised learning durations based on Transformer & Conformer blocks, supporting for Vietnamese language.

Python51Updated 11 months ago

conformerhifi-ganmultispeakernon-autoregressivespeech-synthesistext-to-speechtransformerunsupervised-learningvietnamesevoice-clone

watchstep/glow-tts-jejueo

제주어 음성 합성 (보완 중)

Jupyter Notebook30Updated 3 years ago

glow-ttshifi-ganjejueokoreantts

yasuohasegawa/ios-fastspeech2-hifigan

On-device iOS Text-to-Speech using FastSpeech2 and HiFi-GAN (Japanese & English)

C++20Updated 8 months ago

coremldeep-learningespnetfastspeech2hifi-ganopenjtalkspeech-synthesisswiftswiftuitext-to-speechtts

lordzuko/SpeakingStyle

Aligning latent space of speaking style with human perception using a re-embedding strategy

Jupyter Notebook10Updated 2 years ago

blizzard-challengefastspeech2hifi-ganpytorchpytorch-distributeddataparallelspeaking-stylespeech-synthesisvocoder

khaykingleb/hifi-gan

Neural vocoder for high-fidelity speech synthesis (implementation of the referenced research)

Python11Updated 4 years ago

ganhifi-ganpytorchttsvocoder

muhammadVohra787/speech-synthesis-app

The Speech Synthesis App converts text into natural-sounding speech using advanced models, providing an interactive platform for audio generation.

Python00Updated 1 year ago

fastspeech2hifi-gantexttospeechtts

andrew264/AudioExpts

Doing devious stuff with audio

Jupyter Notebook00Updated 1 year ago

deep-neural-networkshifi-ganmelspectrogrampytorchpytorch-lightning

hwRG/HiFi-GAN-Pytorch

If you have a wav & transcript, can train HiFi-GAN right now.

Python00Updated 3 years ago

ganhifi-ganttsvocoder

free001style/HiFiGAN

HiFiGAN Implementation

Python00Updated 1 year ago

hifi-ganttsvocoder

RALYHDB/ASV-spoofing

This repository contains the code and resources associated with my Bachelor's Thesis. The project evaluates the performance of various automatic speaker verification (ASV) systems against identity spoofing attacks generated using text-to-speech (TTS) synthesis technologies.

Python00Updated 1 year ago

asvcurve-detdiffwaveeerfastspeech2hifi-ganneural-network-modelsspoofing-attackstacotron2ttsvoice-cloning

Page 1 of 2