159 results for “topic:language-identification”
A collection of sample apps to demonstrate how to use Google's ML Kit APIs on Android and iOS
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
The most accurate natural language detection library for Go, suitable for short text and mixed-language text
The most accurate natural language detection library for Rust, suitable for short text and mixed-language text
The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, speech recognition, forced alignment, speech translation, voice isolation, language detection and more.
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/accents), English, code-switching, and both speech and singing ASR. FireRedVAD supports speech/singing/music in 100+ langs. FireRedLID supports 100+ langs and 20+ zh dialects. FireRedPunc supports zh and en.
Simple embedding based text classifier inspired by fastText, implemented in tensorflow
Textpipe: clean and extract metadata from text
⚡️ 80x faster Fasttext language detection out of the box | Split text by language
Vietnamese NLP Toolkit for Node
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
Code for the paper Language Identification Using Deep Convolutional Recurrent Neural Networks
A TensorFlow-based spoken language identification
Fast and accurate natural language detection. Detector written in Javascript. Nito-ELD, ELD.
✨ Split text by languages (e.g. 你喜欢看アニメ吗 -> 你喜欢看 | アニメ | 吗) for NLP tasks (e.g. parse, TTS). Powered by fasttext and budoux
Fast and accurate natural language detection. Detector written in PHP. Nito-ELD, ELD.
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
End to End Dialect Identification using Convolutional Neural Network
End-to-end spoken language identification out of the box.
fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-hant)
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
Babel Street Analytics Client Library for Python
Targetted language identifier, based on FastText and Hunspell.
CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed data.
AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.
Multi-Langauge Identification