267 results for “topic:speech-enhancement”
A PyTorch-based Speech Toolkit
End-to-End Speech Processing Toolkit
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Noise supression using deep filtering
The PyTorch-based audio source separation toolkit for researchers
AI powered speech denoising and enhancement
General Speech Restoration
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
A must-read paper for speech separation based on neural networks
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Tensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Voice Conversion Tool Kit
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
The official implementation of GTCRN, an ultra-lightweight SE model.
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
The dataset of Speech Recognition
Python implementation of performance metrics in Loizou's Speech Enhancement book
Tools for Speech Enhancement integrated with Kaldi
Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
deep learning based speech enhancement using keras or pytorch, make it easy to use
Pytorch based speech enhancement toolkit.
Real-time GCC-NMF Blind Speech Separation and Enhancement
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
Collection of EM algorithms for blind source separation of audio signals
General Speech Restoration
simple delaysum, MVDR and CGMM-MVDR
A unofficial Pytorch implementation of Microsoft's PHASEN