"topic:forced-alignment" — Search

57 results for “topic:forced-alignment”

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

alignmentaudioclidtwespeakespeak-ngfestivalffmpegforced-alignmentlinuxmacosnlppythonsmilspeechsrttexttext-to-speechttswindows

MontrealCorpusTools/Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python1.8k285Updated 2 weeks ago

acoustic-modelforced-alignmentgrapheme-to-phonekaldipronunciation-dictionarypython

MahmoudAshraf97/ctc-forced-aligner

Text to speech alignment using CTC forced alignment

Python46379Updated 3 weeks ago

forced-alignment

echogarden-project/echogarden

Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.

TypeScript43941Updated 6 months ago

command-lineforced-alignmentlanguage-detectionlanguage-identificationnode-jssource-separationspeechspeech-alignmentspeech-recognitionspeech-synthesisspeech-to-textspeech-translationtext-to-speechvoice-isolation

corvo007/MioSub

一站式全自动字幕生成软件，下载、转录、翻译、压制全流程覆盖，无需人工介入 / One-stop automated subtitle generator. Handles downloading, transcription, translation, and hardcoding—zero human intervention required.

TypeScript40932Updated 1 week ago

alignmentass-subtitlescaptionsdiarizationffmpegforced-alignmentgemini-apigemini-subtitle-proi18nspeaker-diarizationspeech-to-textsrt-subtitlessubstation-alphasubtitle-generatorsubtitle-translationsubtitlessubtitles-generatortranscriptionwhisper

r4victor/syncabook

📖🎧 A tool for creating ebooks with synchronized text and audio (EPUB3 with Media Overlays)

HTML33730Updated 2 years ago

audiobooksebooksepub3forced-alignmentlibrivox

mozilla/DSAlign

DeepSpeech based forced alignment tool

Python23930Updated 5 years ago

deepspeechforced-alignment

saurabhshri/CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

C++17234Updated 6 years ago

alignerapiccextractorcliclosed-captionscppforced-alignmentgoogle-summer-of-codegsocgsoc-2017karaokephonetic-transcriptionspocketsphinxspeech-recognitionsubtitle-alignmentsubtitlestranscriptionword-level-alignment

feldberlin/timething

Timething is a library for aligning text transcripts with their audio recordings.

Jupyter Notebook13014Updated 1 year ago

alignmentaudiocliforced-alignmenthuggingfacenlppythonspeechspeech-recognitiontts

tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

Python12012Updated 2 weeks ago

alignmentforced-alignmentphoneme-predictionphoneme-recognitionphonemesspeechspeech-processingspeech-recognitiontext-to-speechtimestampsttstts-datasetword

r4victor/afaligner

📈 A forced aligner intended for synchronization of narrated text

Python10214Updated 7 months ago

forced-alignment

bunyaminergen/Callytics

Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analyze phone conversations from customer service and call centers.

Python7810Updated 11 months ago

denoisingdiarizationforced-alignmentllama3llmopenaiopensourcesentiment-analysisspeech-emotion-recognitionspeech-processingspeech-recognitionspeech-to-textsummarytopic-modelingtranscriptionvoice-activity-detectionvoice-recognition

Telegram-Zalo/zac2022-lyric-alignment

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

Python6818Updated 3 years ago

deep-learningdynamic-programmingforced-alignmentmusic-alignmentpytorchvietnamesewav2vec2

MahtaFetrat/ManaTTS-Persian-Speech-Dataset

ManaTTS is the largest open Persian speech dataset with 114+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

Jupyter Notebook495Updated 8 months ago

data-collectiondata-preprocessingdataset-preparationforced-alignmentmana-ttsmanattspersianpersian-speechspeech-corpusspeech-data-collectionspeech-datasetspeech-processingspeech-synthesistext-to-speechtext-to-speech-datasetttstts-dataset

ronggong/interspeech2018_submission01

Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable and phoneme onset positions

Python464Updated 7 years ago

beijing-operacnnforced-alignmenthmmhsmminterspeechkerassinging-voice

amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

Shell367Updated 6 years ago

alignmentasrforced-alignmentkaldikaldi-asr

jhdeov/interlingual-MFA

Workflow for forced alignment between languages

Python242Updated 2 months ago

cross-languagecross-language-alignmentforced-alignmentlow-resource-languagesmontreal-forced-alignermultilingual-alignment

proger/uk

Фонограми та синтагми: інструменти обробки

Python210Updated 8 months ago

dataset-generationforced-alignmenthmmkaldispeech-recognitionukrainianukrainian-language

joshchen984/WriteMyVideo-Backend

WriteMyVideo's purpose is to help people create videos quickly and easily by simply typing out the video’s script and a description of images to include in the video.

Python209Updated 2 years ago

forced-alignmentgentlepythonrqvideovideo-editingyoutube

xulihang/Silhouette

An open source computer-aided translation tool for audios and videos

B4X190Updated 4 months ago

computer-aided-translationforced-alignmentmacspeech-recognitionsubtitlevadwhisper

BayesForDays/gently

Gentle and praatio scripts for easy forced alignment

182Updated 3 years ago

forced-alignmentphoneticsphonologypraatpsycholinguisticsspeech-processingtextgridtextgridtools

avinashvarna/audio_alignment

Align various Sanskrit texts and audio