"topic:code-mixing" — Search

34 results for “topic:code-mixing”

A curated list of research papers and resources on code-switching

bilingualcode-mixedcode-mixingcode-switchcode-switchinglanguagenlppapersresearchspeech

This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalence Constant Theory and Matrix Language Theory.

Jupyter Notebook5813Updated 3 months ago

code-mixingcode-switchingdata-generationlanguage-modelinglinguisticsnatural-language-processingpython3synthetic-data-generation

microsoft/LID-tool

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.

Python5710Updated 1 month ago

code-mixingcode-switchinglanguage-identificationlanguage-tagslinguisticsmalletnatural-language-processingpython3

praatibhsurana/Hinglish_Hindi_WSD

A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.

Python378Updated 2 months ago

code-mixinghindi-pos-taghindi-spell-correctionhinglishhinglish-to-hindi-transliterationindic-languagesindic-nlpindic-transliterationindowordnetlesklesk-algorithmnlppos-taggingpython-3python-librarypython-packagespelloword-sense-disambiguationwsdwsd-dataset

sumanbanerjee1/Code-Mixed-Dialog

No description provided.

Python337Updated 3 years ago

code-mixinghredseq2seq

cisnlp/MaskLID

💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024

Python123Updated 1 day ago

code-mixingcode-switchcode-switchinglanguage-identificationlanguage-identification-toolkitlanguage-identifier

lingo-iitgn/awesome-code-mixing

A curated list of resources dedicated to Code-mixed Natural Language Processing (NLP).

110Updated 1 month ago

code-mixingcode-switchingnlp

salesforce/adversarial-polyglotsArchived

Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)

Python106Updated 10 months ago

adversarial-attacksadversarial-examplesadversarial-trainingcode-mixingmultilingualnlprobustness

aparnadutta/code-mixed-lid

Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.

Python101Updated 8 months ago

bangla-nlpcode-mixinglanguage-identificationmachine-learningnlpword-level-language-model

ash-shar/Code-Switching-and-Swearing-Patterns-on-Twitter

Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes

Python72Updated 1 year ago

code-mixingcode-switchinggender-detectionlocation-detectionnlpsocial-network-analysisswearingtwitter

mmaguero/josa-corpus

Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus

70Updated 5 months ago

baselinesbert-fine-tuningbilstmcode-mixingcode-switchingcorpus-linguisticsdeep-learninglow-resource-languagesmultilingualsentiment-analysissentiment-classificationtext-categorizationtext-classificationtraditional-machine-learning

LCS2-IIITD/HIT-ACL2021-Codemixed-Representation

This repo contains the source code of HIT: A Hierarchically Fused Deep Attention Network for RobustCode-mixed Language Representation (Accepted in ACL 2021)

Python65Updated 2 years ago

attention-modelcode-mixingmachine-translationnamed-entity-recognitionnlpparts-of-speechsentiment-classificationtransformer

andrianllmm/tagLID

A word-level Language Identification (LID) tool for Tagalog-English (Taglish) text

Python20Updated 2 months ago

code-mixingcode-switchingenglishlanguage-identificationlinguisticsnlptagalogtaglish

gulabpatel/Code-Mixing

will discuss code mixing algorithms evolution

Jupyter Notebook20Updated 1 year ago

code-mixinglid

ir-nlp-csui/id-en-code-mixed

Indonesian-English code-mixed Twitter dataset

20Updated 1 year ago

code-mixingenglish-languageindonesian-languagelanguage-identificationlexical-normalizationtwitter

ayanc18/PsycholinguisticCodeMixing

Psycholinguistic Analysis of Code Mixing - Speech and Natural Language Processing Term Project: CS60057. Department of Computer science and Engineering, Indian Institute of Technology Kharagpur

Python11Updated 4 years ago

code-mixingnatural-language-processingpsycholinguisticspython3

Lidan0241/language-detection

A language detection model for code-switched texts in es/en/zh

Jupyter Notebook11Updated 1 year ago

code-mixingcode-switchingidentification-languagenlp

Wei-RongRong2/RojakLanguageSentimentAnalysis

This is a machine learning project focused on analysing and classifying sentiments in code-switched and code-mixed text, specifically targeting the unique linguistic characteristics found in Malaysian conversations.

Jupyter Notebook10Updated 11 months ago

code-mixingcode-switchingdeep-learningdocker-imageflask-applicationlstmmachine-learningmalaya-librarymalaysian-languagemultilingual-nlpmultinomial-naive-bayesnamed-entity-recognitionnatural-language-processingrender-deploymentsentiment-analysissupport-vector-machine

gentaiscool/codemixqa

CodeMixQA is a benchmark with high-quality human annotations, comprising 16 diverse parallel code-switched language-pair variants that span multiple geographic regions and code-switching patterns, and include both original scripts and their transliterated forms.

Python10Updated 1 month ago

benchmarkcode-mixingcode-switchingnlp

poornagurram/code_mixing_sentiment

No description provided.

Python11Updated 5 years ago

code-mixingnlpsentiment-analysis

Bernardbyy/BahasaRojakSentimentAnalysis

Handling Bahasa Rojak (Malaysian Code Mixing Language) OOV and performing Sentiment Analysis using downstreamed XLM-R

Jupyter Notebook11Updated 5 months ago

bahasa-melayuchinese-simplifiedcode-mixingdomain-adaptationfine-tuningout-of-vocabularysentiment-analysissentiment-classificationtransfer-learningtwitterxlmroberta

jessicasaikia/multilingual-BERT-mBERT

This repository implements a Multilingual BERT (mBERT) model for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Python00Updated 1 year ago

assameseassamese-textcode-mixedcode-mixingenglishenglish-languagembertmultilingual-bertnlpnlp-machine-learningparts-of-speechparts-of-speech-taggingpos-taggerpos-tagging

Nexdata-AI/300-Person-Mandarin-Chinese-and-English-Bilingual-Spontaneous-Monologue-smartphone

300-Person-Mandarin-Chinese-and-English-Bilingual-Spontaneous-Monologue-smartphone

00Updated 1 year ago

asrcode-mixingspeech-to-textspontaneous-speech-recognition

vcyrot/Frenglish-Benchmark

A Centralized Frenglish Benchmark from Naturally Occurring Code-Switching and Code-Mixing

00Updated 3 years ago

code-mixingcode-switchingfrench-englishnlp

jessicasaikia/long-short-term-memory-LSTM

This repository implements a Long Short Term Memory (LSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Python00Updated 1 year ago

assameseassamese-textcode-mixedcode-mixingenglishenglish-languagelong-short-term-memorylong-short-term-memory-modelslstmlstm-modellstm-neural-networksnlpnlp-machine-learningpart-of-speech-taggingparts-of-speechpos-taggerpos-tagging

jessicasaikia/conditional-random-field-CRF

This repository implements a Conditional Random Field (CRF) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Python00Updated 1 year ago

assameseassamese-textcode-mixedcode-mixingconditional-random-fieldcrfcrf-modelcrfsuiteenglishenglish-languagenlpnlp-machine-learningparts-of-speechparts-of-speech-taggingpos-taggerpos-tagging

kmi-linguistics/Code-mixing

No description provided.

00Updated 8 years ago

assamesecode-mixingcode-switchingcomputational-linguisticsenglishhindiindian-languageindian-languageslanguage-detectionlanguage-identificationnatural-language-processingnlpnlp-machine-learningsocial-media

jessicasaikia/bidirectional-long-short-term-memory-BiLSTM

This repository implements a Bidirectional Long Short Term Memory (BiLSTM) for performing Parts-of-Speech (POS) Tagging on Assamese-English code-mixed texts.

Python00Updated 1 year ago

assameseassamese-textbidirectional-long-short-term-memory-networkbidirectional-lstmbilstmbilstm-modelcode-mixedcode-mixingenglishenglish-languagenlpnlp-machine-learningparts-of-speechparts-of-speech-taggingpos-taggerpos-tagging

Anwarvic/truel_bilingual_nmt

The official code for the "True Bilingual NMT" paper

Python00Updated 2 years ago

bilingualcode-mixingcode-switchingmachine-translationmultilingual-translationsneural-machine-translationpretrained-modelstransformer

Mohit1053/NLP_Project

Hindi-English code-mixed text classification using TF-IDF + Logistic Regression and BERT fine-tuning

Jupyter Notebook00Updated 1 week ago

bertcode-mixingnlppythonsentiment-analysistext-classification

Page 1 of 2