71 results for “topic:indian-languages”
Resources and tools for Indian language Natural Language Processing
A collaborative catalog of NLP resources for Indic languages
Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
A configurable engine for analysing multi-lingual and multi-modal content.
Resources to go with the Indic NLP Library
Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/IndicXlit
Towards Building Text-To-Speech Systems for the Next Billion Users - Microsoft Research Intern Work - Accepted at ICASSP 2023
Software and Resources for Mitigating Online Gender Based Violence in India
Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)
Xlit-Crowd: Hindi-English Transliteration Corpus
Curated list of publicly available parallel corpus for Indian Languages
Python library for converting numbers to words for all Indian Languages.
Tooling to play around with multilingual machine translation for Indian Languages.
A Python NLP Toolkit for Gujarati(Under Progress)
An LSTM-CRF classifier for NER in Telugu, an Indian language.
This repositary hosts my experiments for the project, I did with OffNote Labs.
Small demo showing how MuRIL (Multilingual Representations for Indian Languages : A BERT model pre-trained on 17 Indian languages) understands Indian Languages better
A deep learning-based Speech Emotion Recognition (SER) model trained primarily on Indian languages. Designed for applications in call centers, sentiment analysis, and accessibility tools.
A Python client library for interacting with Bhashini services, including Automatic Speech Recognition (ASR), Neural Machine Translation (NMT), and Text-to-Speech (TTS) for 13 indian languages.
Repository for pre-trained wav2vec 2.0 models on 7 Indian languages
Clean Indian code-mixed text before it reaches your LLM.
IndicF5: High-Quality Text-to-Speech for Indian Languages , including voice cloning
🤖 D.A.V.I.D AI - Advanced AI assistant with voice control, gesture recognition & multi-language support. Privacy-first, offline-first, 15 languages. By Nexuzy Tech Ltd.
Magento 2 module to integrate Digital India Bhashini Translation Plugin (v3) — multilingual translations for Indian languages, admin controls, and skip-translation support.
Script to collect scrape and clean sentences in Chhattisgarhi
Translations for Aaptaha.
This repository demonstrates usage of Amazon Bedrock Claude 3 models for Indian languages. The use cases include but not limited to: 1. Information extraction, 2. Question answering 3. Summarisation, 4. Translation and 5. Transliteration from the content in Indian languages such as Hindi, Telugu, Tamil, Malayalam, Marathi, Kannada to mention a few.
BHRAM-IL: A Benchmark for Hallucination Recognition and Assessment in Multiple Indian Languages
Fast transliteration for web text inputs (input, textarea, contenteditable) with a JS/WASM hybrid runtime.