57 results for “topic:forced-alignment”
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Command line utility for forced alignment using Kaldi
Text to speech alignment using CTC forced alignment
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
一站式全自动字幕生成软件,下载、转录、翻译、压制全流程覆盖,无需人工介入 / One-stop automated subtitle generator. Handles downloading, transcription, translation, and hardcoding—zero human intervention required.
📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)
DeepSpeech based forced alignment tool
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Timething is a library for aligning text transcripts with their audio recordings.
Extract phoneme-level timestamps from speeh audio.
📈 A forced aligner intended for synchronization of narrated text
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions
scripts to align a given wave to its transcription using trained models by Kaldi
Workflow for forced alignment between languages
Фонограми та синтагми: інструменти обробки
WriteMyVideo's purpose is to help people create videos quickly and easily by simply typing out the video’s script and a description of images to include in the video.
An open source computer-aided translation tool for audios and videos
Gentle and praatio scripts for easy forced alignment
Align various Sanskrit texts and audio
A Weakly Supervised Forced Alignment for disluent speech
Python and command-line utility for aligning audio to a transcript.
Forced alignment decoder for Whisper.
Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners
A corpus that can be used to train English-to-Italian End-to-End Speech-to-Text Machine Translation models
A framework for generating labeled audio recordings of single-spoken keywords via automatic forced alignment.
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Aligning a Japanese audio-book with its text and create Anki sentence cards with audio.