46 results for “topic:ancient-languages”
Data for the quantitative study of (Vedic) Sanskrit
Main application code for Ambuda, a breakthrough Sanskrit library (ambuda.org)
Code and data for "Summarising Historical Text in Modern Languages" (EACL 2021)
[PRL 2025, APSIPA 2022] Syllable Analysis Data Augmentation (SADA), This project introduces a glyph dictionary and grammar-aware augmentation strategy designed to enhance Khmer palm leaf manuscript recognition. By modeling the language's grammatical structure, we support more robust OCR performance in low-resource settings.
Raw dataset for Old Persian cuneiform
Official releases of the PROIEL treebank of ancient Indo-European languages
A tool for exploring the Linear A corpus
An Ancient Greek Morphology Tagger
Semantic Dictionaries for Ancient Languages
Code and sample images described in the paper "DeepScribe: Localization and Classification of Elamite Cuneiform Signs Via Deep Learning"
The Ancient Greek dictionary for Hunspell (grc_GR for Notepad++, Google Chrome, Vivaldi etc).
No-nonsense simple transliteration between writing systems, mostly of Semitic origin
A metafont-glyphs dataset which facilitate people to define CJK-like glyphs with their metafont scripts by machine learning
[SSDA 2023] This project explores advanced document image recognition methods tailored for low-resource historical German manuscripts.
An array of tools for Sanskrit for tasks such as noun declension and verb conjugation.
A program for creating a searchable local language dictionary based (mainly) on dumped wiktionary data. Allows user to collect definitions which can be exported as a machine readable flashcard file. Currently supports Latin, Ancient Greek and Old English.
Online decimal to maya numeral converter.
Contains a text fabric dataset of the Ugaritic corpus.
🗿: Maya Glyph
Pali Lessons in English by ChatGPT
Summary grammar and modified DVLs for OCR's Classical Greek (9-1), Latin (9-1) GCSEs, from the 2016 syllabi. Used as part of educational resources in Tiffin School and the Kingston Academy.
Corpus of texts written in cuneiform
Train a generative Language model to output ancient chinese text, using Chinese classics.
GitHub repo scaffold for a “script-writing bot” that can (a) mimic archaic styles and (b) transliterate into historical/occult alphabets (e.g., Theban / “Alphabet of Honorius,” “Celestial” alphabet) while also supporting genuine ancient languages (Latin, Koine/Attic Greek, Coptic) when possible.
Source code for the submissions to SIGTYP 2024, EvaLatin 2024, and AXOLOTL 2024 shared tasks
"BrahmiLipi" is an android application that helps to learn Brahmi Script through Devnagari and Latin/English scripts.
A library that provides anchient Egyptian hieroglyphs for use with a hieroglyph renderer such as Egyptian Writer.
Library for rendering egyptian hieroglyphic texts.
A library for converting between MdC (Manuel de Codage) and GlyphX (Hieroglyph XML). Both are used for displaying egyptian hieroglyphs.
An android library with a custom TextView for displaying Egyptian hieroglyphs.