MC

mcagri/DatasetLabeler

Audio Dataset Generator and Labeler

asr data-labeling-tools machine-learning mlops-workflow prefect

Audio Dataset Labelling Automation

This project aims to streamline the dataset creation and labelling processes for Automatic Speech Recognition (ASR) systems.
Project consists of 3 parts.

AudioFile Ingestion
Automatic Labelling with ASR (Whisper Model)
Manual Labelling for improved dataset quality

WebRTC-VAD implementation is taken from this repository.
https://github.com/wiseman/py-webrtcvad/blob/master/example.py

On this page

Languages

Python100.0%

Contributors

MIT License

Created June 14, 2024

Updated August 13, 2024