82 results for “topic:amharic”
Syllable-aware BPE tokenizer for the Amharic language (አማርኛ) – fast, accurate, trainable.
Amharic English Machine Translation Corpus prepared through website crawelling and custom preprocessing.
Machine translation (MT) benchmark dataset for languages in the Horn of Africa.
An Amharic News Text classification Dataset
A toolset for Amharic Language pre-processing. Includes an Amharic Stemmer, Transliterator, Stopword remover , Lexical analyzer, Corpus indexer and Term weighter.
A Python package that can transliterate Latin characters to Geez characters and vice versa.
A JavaScript-based converter for transliterating Amharic text into Latin characters
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities
A library for generating Ethiopic fake data such as names, addresses, and phone numbers
A Voice-First AI Companion
AmQA - The first Amharic Open Domain Question Answering Dataset
Make typing Amharic [on mobile] great [again].
simple bs4 based web crawl for a corpus in need of statistical machine translation
This is a telegram bot to translate text to amharic
notebooks to finetune `bert-small-amharic`, `bert-mini-amharic`, and `xlm-roberta-base` models using an Amharic text classification dataset and the transformers library
The set of files used for the development of the Amharic Corpus.
An obscenity filter library for the Amharic language
An Amharic keyboard to add to your website
Tech terms translation from english to amharic. The aim is to make amharic technology terms natural when used in technology.
Amharic Large Language Model
Cross-Platform Ethiopic Input Method plugin and Cross-platform Desktop Application developed with AvaloniaUi, XAML, and .NET 6
The project aims to define algorithms concepts in clear and simple language, making them accessible to anyone who can speak Amharic irrespective of their technical skill.
Quran speech recognition (ASR) with verse matching, Iqra mode, and translations in Arabic, English, Somali, Amharic & Swahili. Built with Tarteel Whisper.
Yet another way to type amharic on standard english keyboard.
Collection of Geez script fonts
A Java GUI program to teach children amharic in a simple way. [SCHOOL PROJECT]
Modular Amharic text preprocessing toolkit with composable processors and pipeline.
Just an Amharic thesaurus
Amharic-Word Embedding-Word2vec is a pre-trained distributed word representation (word embedding) which aims to provide the Amharic NLP researcher with free to use.
This repository contains implementations of various Natural Language Processing (NLP) tasks and tools specifically for the Amharic language using Java. The goal is to provide a comprehensive set of tools to facilitate NLP research and development for Amharic.