SP

SpringerNLP/Chapter12

Chapter 12: End-to-end Speech Recognition

asr automatic-speech-recognition deepspeech2 pytorch

Deep Speech 2

This case study explores end-to-end ASR using the Deep Speech 2 architecture on PyTorch with the Common Voice dataset.

Running the Docker image with GPU

docker run -it --runtime=nvidia springernlp/chapter_12ds:latest

Requirements

Nvidia docker2

The container will start a jupyter notebook.
Follow the commands inside the Chapter 12 notebook.

Book Reference

More information can be found at: Deep Learning for NLP and Speech Recognition by Springer

On this page

Languages

Jupyter Notebook99.8%Dockerfile0.2%

Contributors

Created February 23, 2019

Updated November 10, 2025