Top Repositories
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
A toolkit dedicate for speech evaluation.
Public examples for ESPNet2 demonstration
End-to-End Speech Processing Toolkit
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
Repositories
58A toolkit dedicate for speech evaluation.
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
End-to-End Speech Processing Toolkit
No description provided.
No description provided.
Public examples for ESPNet2 demonstration
No description provided.
No description provided.
No description provided.
No description provided.
No description provided.
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Vox-Profile Benchmark
A song aesthetic evaluation toolkit trained on SongEval.
UTokyo-SaruLab MOS Prediction System
Python implementation of performance metrics in Loizou's Speech Enhancement book
Reference-aware automatic speech evaluation toolkit
Unified automatic quality assessment for speech, music, and sound.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
A simple library for Fréchet Audio Distance (FAD) calculation
No description provided.
Learning audio concepts from natural language supervision
Speech Human Evaluation Estimation Toolkit (SHEET)
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
No description provided.
This code is to run the WARP-Q speech quality metric.
Python code for Fairseq maintained by ESPnet
Summer-Program(AI-for-Beginners)