31 results for “topic:wav2vec”
Self-Supervised Speech Pre-training and Representation Learning Toolkit
speech to text with self-supervised learning based on wav2vec 2.0 framework
A live speech recognition using Facebooks wav2vec 2.0 model.
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Training scripts for Speech-To-Text models for Ukrainian language
Wav2vec resources and models for Brazilian Portuguese
Wave2vec 2.0 Recognize pipeline
Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Speeech Recognition for Indic languages.
Fine-tuning wav2vec2 to for Pathological Speech Processing
No description provided.
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Create high-resolution visually dubbed videos with DINet
Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks
No api-keys | local | llama3.1 For language studying and live translation
KcELECTRA와 Wav2Vec, PCGrad 최적화를 활용한 119 긴급 신고 분석 멀티모달·멀티태스크 학습 PyTorch 구현
👩🏻💻 ( ͡❛ ‿●‿ ͡❛) wav2vec
This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.
Deep audio modeling
A repo to make installation and training of a wav2vec model easier
ASR model generates transcription from audio waves, then correct the word spelling
Smart home controller simulator, receiving voice commands from a microphone.
Diagnosis of the onset of parkinsons disease, using wav2vec as a feature extractor and a random forest as a classifier. This is an easy to use suite, refer to the README for usage guides.
Interactive AI tool for voice-based interview practice with real-time feedback on grammar, confidence, and performance.
A JAX / NNX implementation of a VQ-VAE for audio compression
A wav2vec like Automatic Speech Recognition model
asr for German Language
A collection of speech language models with a focus on acoustic codes