17 results for “topic:corpus-search”
An Integrated Corpus Tool With Multilingual Support for the Study of Language, Literature, and Translation
ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with diverse types of annotation.
The great textmining tool that obviates all others
Parse Corpus Query Language (CQL) into a list of JSON queries
A concordancing program for English with a GUI interface that can read .docx, .srt, and plaintext files and export concordance lines to .txt,. docx, .tsv, .xlsx, and .html.
A rugged, practical R toolkit for web scraping, stepwise NLP, and lightweight LLM pipelines.
Massive Speech Corpus Tool - Recursive (MaSCoT-R) is a Praat script for working with very large speech corpora
Predicting time-consuming CQL queries in language corpora
We designed an Information Retrieval system based on Vector Space model in python. We Also have implemented Bi gram Indices for Phrasal query search and Champion List retrieval. We also compared time of whole retrieving in our project report.
No description provided.
MaSCoT is a Praat tool developed to facilitate searching, extracting and analyzing information contained in large, richly-annotated speech corpora developed in Praat. This version is for single TextGrid/WAV pairs.
A Lucene-based corpus search service for parallel corpora, suitable for translators and lexicographers
Gets text and extracts sentences in a language from text using that language's lexicon.
A fast, small, and portable Windows application for searching large text corpora, with regex and right-to-left support.
Script to search through the EMMA corpus
For a corpus linguistics project, I created an information retrieval program called "You Are Not Alone". My phrase_finder() function searches for a self-identifying phrase in 4 large classic texts (The Souls of Black Folk, Jane Eyre, The Strange Case of Dr. Jekyll & Mr. Hyde, and Frankenstein). Standpoint: "So Matilda’s strong young mind continued to grow, nurtured by the voices of all those authors who had sent their books out into the world like ships on the sea. These books gave Matilda a hopeful and comforting message: You are not alone.” ~ from Matilda by Roald Dahl 📖
OCR-first Arabic book corpus platform with citation-grade APIs