"topic:wav2vec" — Search | GitHunt

Repositories Developers Collections

© 2026 GitHunt · tansuasici

31 results for “topic:wav2vec”

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python2.5k528Updated 1 week ago

apccpcdata2vecdecoardecoar2distilhuberthubertmockingjaypaserepresentation-learningself-supervised-learningspeech-pretrainingspeech-representationteraunispeech-satvq-apcvq-wav2vecwav2vecwav2vec2wavlm

mailong25/self-supervised-speech-recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Python379116Updated 4 years ago

self-supervised-learningsemi-supervised-learningspeech-recognitionspeech-to-textunsupervised-learningvietnamese-speech-recognitionwav2vec

oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Python37858Updated 2 years ago

asrpyaudiospeechspeech-recognitionspeech-to-textwav2vecwav2vec2

arxyzan/data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python18525Updated 2 years ago

beitdata2vecfairseqhuggingfacepytorchrobertaself-supervised-learningwav2vec

shangeth/SpeakerProfiling

Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf

Python6822Updated 4 years ago

audio-processingclassificationcnnlstmspeaker-recognitionspeaker-verificationspeechspeech-processingwav2vec

robinhad/voice-recognition-ua

Training scripts for Speech-To-Text models for Ukrainian language

Jupyter Notebook392Updated 2 years ago

asrcoqui-aideepspeechspeech-recognitionspeech-to-textsttukrainianukrainian-languagewav2vec

lucasgris/wav2vec4bp

Wav2vec resources and models for Brazilian Portuguese

Jupyter Notebook372Updated 3 years ago

automatic-speech-recognitionbrazilian-portuguesedatasetportuguesespeech-to-textwav2vecwav2vec2

loretoparisi/wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Python3310Updated 5 years ago

asrautomatic-speech-recognitiondockerkenlmpytorchwav2letterwav2vec

bhattbhavesh91/wav2vec2-huggingface-demo

Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer

Jupyter Notebook2914Updated 4 years ago

facebook-wav2vecself-supervised-learningspeechspeech-processingspeech-recognitionspeech-to-textunsupervised-learningwav2vec

daanzu/wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

Python233Updated 4 years ago

pythonpytorchspeechspeech-recognitionspeech-to-textwav2vecwav2vec2

notAI-tech/IndicASR

Speeech Recognition for Indic languages.

Python133Updated 4 years ago

asrindian-languagepytorchspeech-recognitionspeech-to-texttelugutransformerswav2vecwav2vec2

jvel07/wav2vec2_patho

Fine-tuning wav2vec2 to for Pathological Speech Processing

Jupyter Notebook61Updated 2 years ago

computational-paralinguisticsdeep-learningdnn-embeddingsemotion-recognitionfine-tuningpytorchsound-processingspeech-embeddingsspeech-processingspeech-recognitiontransformersutterancewav2vecwav2vec2x-vector

thisisHJLee/Fine-Tuning-of-XLSR-Wav2Vec2-on-Korean

No description provided.

Jupyter Notebook41Updated 3 years ago

avrnlpstttransformerswav2vecwav2vec2

phanxuanphucnd/wav2asr

A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.

Python44Updated 4 years ago

asrspeech-recognitionspeech-to-textwav2asrwav2vecwav2vec2wave2asr

abdur75648/DINet-Inference

Create high-resolution visually dubbed videos with DINet

Python30Updated 1 year ago

avatar-generationdeepspeechdinetdubbinglipganlipsyncopenfacevideo-generationwav2lwav2vec

slinusc/speaker_identification_evaluation

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

Jupyter Notebook31Updated 2 months ago

wav2vecwav2vec2whisperxls-r

Katashynskyi/Voice_assistant_UA_EN

No api-keys | local | llama3.1 For language studying and live translation

Python31Updated 1 year ago

chatbotedge-ttslanguage-classificationllamallmollama-pythonspeech-recognitionspeech-to-textstreamlitvoice-assistantwav2vec

PhamPham2S/119-Multimodal-Emergency-Analysis

KcELECTRA와 Wav2Vec, PCGrad 최적화를 활용한 119 긴급 신고 분석 멀티모달·멀티태스크 학습 PyTorch 구현

Jupyter Notebook30Updated 2 months ago

grad-monitorkcelectramulti-modalmulti-task-learningordinal-regressionpcgradpytorchwav2vec

kimtth/huggingface-wav2vec

👩🏻‍💻 ( ͡❛ ‿●‿ ͡❛) wav2vec

21Updated 4 years ago

huggingfacewav2vec

oswaldoludwig/Pruning-pre-trained-models-using-evolutionary-computation

This repository contains scripts to prune Wav2vec2 using a neuroevolution-based method. More details about this method can be found in the paper Compressing Wav2vec2 for Embedded Applications.

Shell21Updated 2 years ago

asre2e-asrevolutionary-algorithmsevolutionary-computationgenetic-algorithmhuggingfacemodel-compressionpre-trained-modelpruningpruning-algorithmspruning-optimizationpytorchtransformer-modelswav2vecwav2vec2

Deep audio modeling

Python10Updated 4 years ago

audiodeep-learningpytorchspeech-recognitionwav2vecwav2vec2

NabinAdhikari674/wav2vec

A repo to make installation and training of a wav2vec model easier

Python10Updated 5 years ago

MarwaAbdelAal/ASR-correction-model

ASR model generates transcription from audio waves, then correct the word spelling

Python10Updated 4 years ago

asrnlppython3wav2vec

mradovic38/voice-command-recognition

Smart home controller simulator, receiving voice commands from a microphone.

Jupyter Notebook11Updated 1 year ago

audio-classificationserbiansmart-homesmarthome-controllerspeech-analysisspeech-recognitionspeech-to-textvoice-commandsvoice-controlvoice-recognitionwav2vecwav2vec2

omar-A-hassan/Parkinsons

Diagnosis of the onset of parkinsons disease, using wav2vec as a feature extractor and a random forest as a classifier. This is an easy to use suite, refer to the README for usage guides.

Python10Updated 4 months ago

ciclassificationdisease-detectionmachine-learningwav2vec

TheViper008/AI-Powered-Interview-Simulator

Interactive AI tool for voice-based interview practice with real-time feedback on grammar, confidence, and performance.

Python00Updated 8 months ago

bootstrapdjangojavasriptpythonwav2vecwhisper-ai

A JAX / NNX implementation of a VQ-VAE for audio compression

Jupyter Notebook00Updated 1 year ago

jaxnnxvqvaewav2vec

d1pankarmedhi/w2vASR

A wav2vec like Automatic Speech Recognition model

00Updated 8 months ago

asrautomatic-speech-recognitionpytorchspeech-to-textwav2vec

hciays/ailab_ss2022

asr for German Language

Python00Updated 3 years ago

asrconformergermantransformerwav2vec

ogunlao/speech_language_models

A collection of speech language models with a focus on acoustic codes

Python00Updated 11 months ago

acoustic-modelllmspeech-recognitionspeech-synthesiswav2vec

Page 1 of 2