"topic:dialect-identification" — Search

32 results for “topic:dialect-identification”

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

arabicarabic-dialectsdialect-identificationmorphological-analysismorphological-disambiguationmorphological-generationmorphological-reinflectionnamed-entity-recognitionnlpnlp-apisnlp-librarypos-taggingsentiment-analysisstemming

instadeepai/tunbert

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)

Python13042Updated 3 years ago

bert-modelsdialect-identificationnlpquestion-answeringsentiment-analysis

iabufarha/ArSarcasm

This repository contains the Arabic sarcasm dataset (ArSarcasm)

2613Updated 5 years ago

arabic-nlpdialect-identificationsarcasm-detectionsentiment-analysis

swshon/dialectID_siam

Dialect identification using Siamese network

Jupyter Notebook154Updated 8 years ago

characterdialectdialect-identificationi-vectoridentificationlanguage-recognitionmgbmgbchallengephonemesiamesesiamese-networkwords

qcri/Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.

151Updated 3 years ago

acousticarabicasrcodeswitchingdialect-identificationegyptianevaluationlexicalmordern-standard-arabic

iabufarha/ArSarcasm-v2

ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analysis, which is a part of WANLP 2021.

123Updated 4 years ago

arabic-nlpdialect-identificationnlpsarcasm-detectionsentiment-analysis

sinaahmadi/CORDI

Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)

Python112Updated 1 year ago

automatic-speech-recognitiondialect-identificationerbilkurdishkurdish-language-processinglanguage-identificationmachine-translationmahabadsanandajsoranisulaymaniyah

hb20007/greek-dialect-classifier

Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek

Jupyter Notebook103Updated 3 weeks ago

classificationclassifiercypriotdialectdialect-identificationdialectsgreekjupyterjupyter-notebooklanguage-classificationlanguage-identificationmachine-learningn-gramsnlpnlp-machine-learningnltknltk-datanltk-librarynltk3notebook

AlexYangLi/DMT

VarDial19 shared task: Discriminating between Mainland and Taiwan Variation of Mandarin Chinese (DMT)

Python61Updated 6 years ago

dialectdialect-identificationmandarinmandarin-chinese

A-

a-coles/SMS-Stylometry

A tool that predicts the dialect of English of an SMS message using recurrent neural networks supplemented with data from Google Trends.

Python62Updated 8 years ago

authorship-identificationdialect-identificationgoogle-trendslocation-detectionrnnsmsstylometry

CristianViorelPopa/transformers-dialect-identification

No description provided.

Jupyter Notebook51Updated 4 years ago

bertcomputational-linguisticsdialect-identificationmoroconlpromanianromanian-berttransformers

Cyr-Ch/german-dialect-aware-g2p

Dialect-aware grapheme-to-phoneme conversion for German using Transformer + XLM-R. Context-aware, multi-dialect support with CTC+CE training. Built with PyTorch Lightning & Hydra.

Python40Updated 4 months ago

dialect-identificationgrapheme-to-phonemetransformer

kscanne/canuint

Ríomhchlár a dhéanann aicmiú staitistiúil ar théacsanna Gaeilge de réir a gcanúint

Perl41Updated 5 years ago

classifierdialectdialect-identificationgaeilgeirishnlp

MohamedSebaie/Arabic_Dialect_Identification_NLP-AIM-Task

Arabic_Dialect_Identification_NLP-AIM-Task

Jupyter Notebook32Updated 4 years ago

arabertbert-fine-tuningdialect-identificationfarasalinearsvcnlp-machine-learningpreprocessing

abdelrahman-wael/Arabic-Dialect-Classification-Nadi-Shared-Task

using AraBert to classify different Arabic dialects. ranked fourth in WANLP2020 workshop.

Python31Updated 5 years ago

arabic-dialectsdialect-identificationnlp-machine-learning

telsahy/capstone-35

Twitter Dialect Datasets and Classifiers (GULF Arabic Corpus)

Jupyter Notebook21Updated 7 years ago

arabicarabic-nlpdialect-identificationnlp-machine-learningtopic-modelingtwitter-api

eesanoble/Arabic-Dialect-Classifier

An Arabic Tweet Dialect Classifier

Jupyter Notebook20Updated 4 years ago

arabic-nlpdialect-identificationmachine-learningnatural-language-processingnlp

Blue16-WangFudi/DialectSense

Chinese dialect identification using audio embeddings from LLMs.

Python21Updated 3 months ago

audioclassificationdialect-identificationembeddingsllmspeech

sinaahmadi/teshi

An atlas of Central Kurdish dialects + a simple game to detect dialects

HTML20Updated 1 year ago

dialect-identificationdialectskurdishkurdish-language-processinglanguage-identification

telsahy/capstone-52

Twitter Dialect Datasets and Classifiers (EG + GULF Arabic Corpus)

Jupyter Notebook21Updated 7 years ago

arabicarabic-nlpdialect-identificationnlp-machine-learningtopic-modelingtwitter-api

disooqi/MADAR-shared-task

This shared task will be the first to target a large set of dialect labels at the city and country levels. The data for the shared task is created or collected under the Multi-Arabic Dialect Applications and Resources (MADAR) project.

Jupyter Notebook10Updated 6 years ago

2019arabicdialect-identificationmadarnlpshared-task

telsahy/capstone-34

Twitter Dialect Datasets and Classifiers (EG Arabic Corpus)

Jupyter Notebook12Updated 6 years ago

arabicarabic-nlpdialect-identificationnlp-machine-learningtopic-modelingtwitter-api

Karthik-Dulam/Vaani-Dialect-Identification

Dialect Identification in Indic Languages

Python10Updated 11 months ago

asrdeep-learningdialect-identificationpytorchpytorch-lightningwav2vec2whisper

hasanhuz/Location_Analysis_Project

No description provided.