64 results for “topic:phonemes”
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Software Automatic Mouth - Tiny Speech Synthesizer
Grapheme to phoneme conversion with deep learning.
Deep Voice: Real-time Neural Text-to-Speech
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
High-Fidelity Neural Phonetic Posteriorgrams
Extract phoneme-level timestamps from speeh audio.
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.
Jason Riggle's chart of phonological features in JSON format + extras
Multilingual Grapheme to Phoneme
Speech recognition on the TIMIT (or any other) dataset
JS speech analyzer for fast speech analysis and labeling
pytorch model for contexless-phoneme prediction from speech audio
Python wrapper for Espeak and Mbrola, for simple local TTS
Load your phoneme files and generate the lipsync animation from your recorded audio files. No video or webcam needed!
Threlte Live – A SvelteKit + Three.js platform for live-streaming 3D VRM avatars. Features real-time chat with Google Generative AI, ElevenLabs TTS lip-sync, Mixamo animations, and Cloudflare edge deployment – perfect for creating interactive, high-performance streaming overlays.
Python Hindi Concatenative Based TTS using Phoneme Database
A statistical random words generator for constructed languages.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based on https://github.com/nawarhalabi/Arabic-Phonetiser
a piecemeal natural language processing library
XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use
Tajweed-aware Quranic phonemizer / grapheme to phoneme (G2P) converter
Python/numpy/pandas convenience wrapper for the TIMIT database.
Webapp for creating interactive pronunciation guides for any English word.
Given a target word and a set of words, find the word which best rhymes with the target
Hebrew - English Transliteration Engine
match strings by how they sound
convert phoneme to grapheme vietnames