34 results for “topic:language-documentation”
Resources for conservation, development, and documentation of low resource (human) languages.
Yet another search platform for linguistic corpora.
R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles the glosses list
Script for workflow to add morphological analysis into ELAN files
Tools for the Pangloss Collection, an online archive of under-documented languages
An implementation of SileroVAD as a recognizer for ELAN
Chinese dialect pronunciation database pipeline — converts multi-format word lists into optimized SQLite databases covering 2000+ dialect locations and 6M+ entries.
A specification for formatting interlinear glossed texts in a way that is computationally parseable
A JavaScript library that converts scription text files to the Data Format for Digital Linguistics
Linguistic data on the Nuuchahnulth (Wakashan) language
The DLx portal for viewing, searching, and aggregating data
Mostly XML (TEI) markup of Mixtepec-Mixtec Language resources
This is the official repository of the Eastling editor. It is part of the Eastling suite: Easy Annotation and Synchronization Tool for linguists.
A JavaScript library for working with linguistic data in DLx format
The Lotus web app for managing linguistic data
A JavaScript library for converting linguistic data to HTML
Website for the Algonquian Components Project (Nisinoon)
No description provided.
A network visualisation of Siwu ideophones. You can view and interact with the network at:
Collection of Public Domain data in Komi-Zyrian
Scripts to automate common ffmpeg commands for processing video in language documentation.
Cross-Linguistic Data Format (CLDF) dataset for the Enggano word list from the late 19th century (c1895) based on the Holle List.
This repository hosts scripts and materials related to the Pangloss Collection. The official repository of the Pangloss Collection is at: https://github.com/CNRS/Pangloss
Digitised comparative word list in Modigliani's "L'isola delle donne" from 1894. The word list captures forms in Nias, Batak-Toba, Enggano, and Malay, with Italian reference. The Enggano forms are included in the EnoLEX database (https://doi.org/10.25446/oxford.28282169).
The Kholosi language of Iran.
A network visualisation of Japanese mimetics. You can view and interact with the network at:
A repository to track R codes in (pre-)processing the flora and fauna Google Spreadsheet. The original, lightly annotated data is now archived in Oxford SDS 👇
Digitised comparative Enggano word list from Oudemans (1889). This publication contains the unpublished Enggano word list by Francis (1870) put in comparison with those by Boewang (1854), van de Straaten & Severijn (1855), von Rosenberg (1855). View the data at https://github.com/engganolang/oudemans1889/blob/main/data/oudemans1889-long.csv
Cross-Linguistic Data Format (CLDF) dataset for The Digitised, Searchable Holle List in Stokhof (1980). The interactive version is deployed as a webpage 👇.
A collection of conversational phrases in Tlahuapa Mixtec (Oto-Manguean)