"topic:phonemes" — Search

Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structures and then uses hidden markov models to obtain alignment within segments. The final alignment is concatenation of time stamps of lyrics within the segments for each song.

Python9322Updated 8 years ago

alignmentlyricsphonemessegmentation

anna-hope/phonemes

Jason Riggle's chart of phonological features in JSON format + extras

Python549Updated 1 year ago

computational-linguisticsipa-symbolslinguisticsphonemesphoneticsphonological-featuresphonology

jcsilva/multilingual-g2p

Multilingual Grapheme to Phoneme

Shell515Updated 10 years ago

espeakg2plexiconphonemes

matthijsvk/TIMITspeech

Speech recognition on the TIMIT (or any other) dataset

Python4411Updated 8 years ago

neural-networkphonemesspeechspeech-recognitiontheanotimit

tabahi/WebSpeechAnalyzer

JS speech analyzer for fast speech analysis and labeling

JavaScript393Updated 6 months ago

audio-analysisaudio-processingfeaturefeature-engineeringfeature-extractionformant-detectionmusicmusic-information-retrievalmusic-visualizerphonemessignal-processingspectrumspectrum-analyzerspeechspeech-analysisspeech-processingspeech-recognition

tabahi/contexless-phonemes-CUPE

pytorch model for contexless-phoneme prediction from speech audio

Python324Updated 4 months ago

allophoneslinguisticsphoneme-predictionphoneme-recognitionphonemesspeech-processingspeech-recognitionspeech-to-text

hadware/voxpopuli

Python wrapper for Espeak and Mbrola, for simple local TTS

Python3016Updated 1 year ago

audioespeaklanguagembrolaphonemespython3-6python37tts-enginesvoicevoiceswavwrapper

ReForge-Mode/UniLipSync

Load your phoneme files and generate the lipsync animation from your recorded audio files. No video or webcam needed!

C#306Updated 3 years ago

ffaunihanlipsyncphonemephoneme-extractorphoneme-predictionphonemesreforgemodeunityunity3dunivrmvroidvroid-hubvroidstudio

dexvdev/svelte-vrm-live

Threlte Live – A SvelteKit + Three.js platform for live-streaming 3D VRM avatars. Features real-time chat with Google Generative AI, ElevenLabs TTS lip-sync, Mixamo animations, and Cloudflare edge deployment – perfect for creating interactive, high-performance streaming overlays.

TypeScript267Updated 7 months ago

ai-agentsai-animationai-animeai-avatarsai-girlfriendelevenlabslip-synclipsynclivestreammixamo-animationsphoneme-conversionphonemessveltesveltejssveltekitthreejsthreltettsvrm

Raj2503/Python-Text-To-Speech-Hindi

Python Hindi Concatenative Based TTS using Phoneme Database

Python256Updated 4 years ago

concatenativehindi-languagenlpphonemesphonetic-transcriptionspython-hindipython-ttspython3text-to-speechttstts-engines

fluorine/ConlangWordGenerator

A statistical random words generator for constructed languages.

Ruby211Updated 12 years ago

conlangsconstructed-languageesperantolanguagelinguisticsphonemesruby

venusdev85/Speech-Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Python200Updated 7 years ago

audiocnndata-processingdeep-learningend-to-endfeature-vectorlayer-normalizationlstmphonemesrnnrnn-encoder-decoderspeech-recognitiontensorflowtimit-dataset

motazsaad/ara-pronunciation-tool

A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based on https://github.com/nawarhalabi/Arabic-Phonetiser

Python166Updated 8 years ago

arabic-nlpdiacriticsphonemespronunciationpronunciation-dictionary

shnewto/ttaw

a piecemeal natural language processing library

Rust143Updated 2 months ago

alliterationcmucmudictcratesdouble-metaphonelanguagemetaphonenaturalnatural-languagenatural-language-processingnlpphonemesphonesprocessingpronouncepronounciationrhymerustsyllables

traderpedroso/xphoneBR

XphoneBR is a Brazilian portuguese transformer base grapheme-to-phoneme and normalization tool modeling library that leverages recent deep learning technology and is optimized for usage in production systems such as TTS. In particular, the library should be accurate, fast, easy to use

Python120Updated 1 year ago

g2pgraphemephonemephoneme-conversionphonemesportugueseportuguese-braziliantext-to-speechtts

Hetchy/Quranic-Phonemizer

Tajweed-aware Quranic phonemizer / grapheme to phoneme (G2P) converter

Python123Updated 5 days ago

arabicarabic-diacriticsg2pgrapheme-to-phonemeipaphoneme-conversionphonemesphonetiserqurantajweed

colinator/timit_utils

Python/numpy/pandas convenience wrapper for the TIMIT database.

Jupyter Notebook113Updated 7 years ago

audioaudio-recordingsphoneme-transcriptionsphonemespythontimittimit-databasetimit-utilstranscription

heyseth/phonemenal

Webapp for creating interactive pronunciation guides for any English word.