"topic:timit" — Search

33 results for “topic:timit”

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Python2.4k446Updated 1 week ago

asrdeep-learningdeep-neural-networksdnndnn-hmmgrukaldilstmlstm-neural-networksmultilayer-perceptron-networkpytorchrecurrent-neural-networksrnnrnn-modelspeechspeech-recognitiontimit

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Python1.2k270Updated 6 hours ago

artificial-intelligenceasraudioaudio-processingcnnconvolutional-neural-networksdeep-learningdigital-signal-processingfilteringneural-networkspythonpytorchsignal-processingspeaker-identificationspeaker-recognitionspeaker-verificationspeech-processingspeech-recognitiontimitwaveform

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML37431Updated 1 week ago

beamformingdeep-learningdeeplearninglibrispeechneural-networkneural-networksspeaker-identificationspeaker-recognitionspeaker-verificationspeechspeech-analysisspeech-apispeech-emotion-recognitionspeech-processingspeech-recognitionspeech-recognizerspeech-separationspeech-to-textspeechrecognitiontimit

philipperemy/timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

323165Updated 2 days ago

darpaspeechtimittimit-dataset

hirofumi0810/tensorflow_end2end_speech_recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Python314119Updated 3 weeks ago

asrattention-mechanismautomatic-speech-recognitionbeam-searchcsjctcend-to-endend-to-end-learningjoint-ctc-attentionlibrispeechspeech-recognitionspeech-to-texttensorflowtimittimit-dataset

Diamondfan/CTC_pytorch

CTC end -to-end ASR for timit and 863 corpus.

Python21949Updated 1 month ago

ctcdecoderkaldipytorchtimit

HawkAaron/RNN-Transducer

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Python13931Updated 1 year ago

asrend-to-endmxnetrnn-transducerrnnt-jointrnnt-modelsequence-transductionspeech-recognitiontimittransducers

WindQAQ/listen-attend-and-spellArchived

Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API of Tensorflow, which makes the training and evaluation truly end-to-end.

Python8930Updated 1 year ago

listen-attend-and-spellseq2seqspeech-recognitionspeech-to-texttensorflowtimit

grausof/keras-sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Python7526Updated 1 month ago

artificial-intelligenceasraudioaudio-processingcnnconvolutional-neural-networksdeep-learningdigital-signal-processingfilteringkerasmachine-learningneural-networkspeaker-recognitionspeaker-verificationspeech-processingspeech-recognitiontensorflowtimitwaveform

hirofumi0810/asr_preprocessing

Python implementation of pre-processing for End-to-End speech recognition

Python6923Updated 1 year ago

attention-mechanismautomatic-speech-recognitioncsjctcdatasetend-to-endlibrispeechpreprocessingspeech-recognitionswitchboardtimittimit-datasettranscription

matthijsvk/TIMITspeech

Speech recognition on the TIMIT (or any other) dataset

Python4411Updated 4 months ago

neural-networkphonemesspeechspeech-recognitiontheanotimit

mravanelli/pytorch_MLP_for_ASR

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

Perl4013Updated 2 months ago

asrcudadeep-learningdeep-neural-networksfeedforward-neural-networkkaldikaldi-asrmlpmultilayer-perceptronneural-networkspythonpytorchspeech-recognitiontimit

AppleHolic/PytorchSR

Pytorch based phoneme recognition (TIMIT phoneme classification)

Python355Updated 3 months ago

cbhgminimalgrupaperpytorchspeechrecognitiontimit

mravanelli/theano-kaldi-rnn

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Perl3413Updated 4 months ago

deep-learningdeep-neural-networksgated-recurrent-unitsgrukaldirecurrent-neural-networksrnntheanotheano-kaldi-rnnstimit

zhaoyu611/Automatic_Speech_Recognition_with_Multi_Models

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

Python198Updated 1 year ago

acoustic-modelautomatic-speech-recognitionctcdeep-learninglstmrnntensorflowtimit

biyoml/PyTorch-End-to-End-ASR-on-TIMIT

Attention-based end-to-end ASR on TIMIT in PyTorch

Python186Updated 3 days ago

asrattention-seq2seqend-to-endpytorchtimit

orbxball/timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

Shell160Updated 11 months ago

data-preprocessingdeep-learningmfccphonespeech-recognitiontimittimit-dataset

anicolson/SPN-ASI

Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.

Python113Updated 2 years ago

deep-xiideal-binary-maskmarginalisationmarginalizationmissing-datamissing-feature-theoryrobust-speaker-identificationrobust-speaker-recognitionrobust-speaker-verificationrobustnessspeaker-identificationspeaker-verificationspn-speaker-modelsum-product-networkstimittimit-dataset

colinator/timit_utils

Python/numpy/pandas convenience wrapper for the TIMIT database.

Jupyter Notebook113Updated 3 years ago

audioaudio-recordingsphoneme-transcriptionsphonemespythontimittimit-databasetimit-utilstranscription

drkostas/bench-utils

A collection of benchmarking tools.

Python110Updated 1 year ago

benchmarkbenchmarkingtimertimit

dingzeyuli/SpEAR-speech-database

A database of clean and noisy speech for audio research

95Updated 10 months ago

audiodatasetspeechtimitwaveform

WindQAQ/tensorflow-wavenetArchived

Implementation of WaveNet network based on Tensorflow.

Python93Updated 3 years ago

speech-recognitionspeech-to-texttensorflowtimitwavenet

KrishnaDN/LAS-Pytorch

Implementation of the paper "Listen, Attend and Spell" Paper in Pytorch

Python73Updated 2 years ago

asrasr-modellisten-attend-and-spellseq2seq-modelspeech-respeech-to-texttimit

jackyzha0/Speech2Braille

[🏆 Silver Medal at CWSF] Tensorflow Implementation of TIMIT Deep BLSTM-CTC with Tensorboard Support

Python60Updated 9 months ago

blstmblstm-ctcbraillectcraspberry-pitensorflowtimit

HanSeokhyeon/Speech_recognition_for_English_and_Korean

다양한 feature를 이용한 음성인식 LAS model입니다. (한국어는 개발예정)

Python41Updated 3 years ago

lasmfccphonemetimit

BradleyHe/TIMIT-Voice-Mixer

Python project which mixes and tests sentences from the TIMIT dataset using LAS

Python20Updated 8 months ago

timit

BradleyHe/TIMIT-Alignment

TIMIT forced alignment with the Montreal Forced Aligner

Python20Updated 2 years ago

timit

BradleyHe/TIMIT-Phoneme-Mixer

Python project that mixes phonemes from the TIMIT dataset

Python20Updated 4 years ago

timit

hammaad2002/SimpleASRmodel

A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.

Jupyter Notebook20Updated 2 years ago

asrasr-modelcrdnnlibrispeechpytorchpytorch-implementationpytorch-tutorialspeech-recognitionsupervised-learningtimittimit-dataset

kipmccharen/sys6016_DL_project

pretrained SpeechBrain wav2vec seq2seq+CTC model trained on TIMIT dataset. Created by Kip McCharen, Siddharth Surapaneni, and Pavan Bondalapati

Python21Updated 4 years ago

aprperspeechbraintimitwav2vec2

Page 1 of 2