Repos
31
Stars
169
Forks
24
Top Language
Python
Loading contributions...
Top Repositories
Repositories
31Grapheme to phoneme conversion with deep learning.
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
No description provided.
GUI for a Text-To-Speech Model trained on Gothic Dataset
No description provided.
No description provided.
No description provided.
unofficial vits2-TTS implementation in pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
Multilingual G2P in 100 languages
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
MOS score prediction by fine-tuned wav2vec2.0 model
No description provided.
TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Also for voice clone!
No description provided.
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
Unofficial pytorch implementation of BigVGAN: A Universal Neural Vocoder with Large-Scale Training
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
No description provided.
No description provided.
:mag: Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
A PyTorch-based Speech Toolkit
No description provided.
Thorsten - Open German Voice Dataset
DeepSpeech is an open source speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
TL with CNN for cancer survival prediction using gene-expression data
A LaTeX template for Bachelor and Master thesis
Deep learning for Text to Speech