Repos
24
Stars
384
Forks
104
Top Language
Python
Loading contributions...
Top Repositories
Finetune VITS and MMS using HuggingFace's tools
Fine-tune your own MusicGen with LoRA
A list of scripts/notebooks I'd like to keep handy
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Repositories
24Fine-tune your own MusicGen with LoRA
Finetune VITS and MMS using HuggingFace's tools
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
A list of scripts/notebooks I'd like to keep handy
No description provided.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Toolkit for using and training Parler-TTS, a high-quality text-to-speech model.
No description provided.
Large scale audio inference examples using `accelerate` and `datasets`
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Predicts the level of noise and reverberation on your audiofiles
No description provided.
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Public repo for HF blog posts
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Foundational Models for State-of-the-Art Speech and Text Translation
Here are some of my work done at the MVA master.
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
No description provided.
The Hugging Face Course on Transformers for Audio
🔊 Text-Prompted Generative Audio Model