#

dialect-identification

Here are 27 public repositories matching this topic...

CAMeL-Lab / camel_tools

A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.

nlp sentiment-analysis named-entity-recognition nlp-apis arabic nlp-library pos-tagging morphological-analysis stemming arabic-dialects dialect-identification morphological-generation morphological-disambiguation morphological-reinflection

Updated Sep 25, 2024
Python

instadeepai / tunbert

TunBERT is the first release of a pre-trained BERT model for the Tunisian dialect using a Tunisian Common-Crawl-based dataset. TunBERT was applied to three NLP downstream tasks: Sentiment Analysis (SA), Tunisian Dialect Identification (TDI) and Reading Comprehension Question-Answering (RCQA)

nlp sentiment-analysis question-answering dialect-identification bert-models

Updated Feb 13, 2023
Python

iabufarha / ArSarcasm

This repository contains the Arabic sarcasm dataset (ArSarcasm)

sentiment-analysis arabic-nlp sarcasm-detection dialect-identification

Updated Feb 18, 2021

swshon / dialectID_siam

Dialect identification using Siamese network

character identification words dialect language-recognition i-vector siamese phoneme siamese-network mgb mgbchallenge dialect-identification

Updated Dec 12, 2017
Jupyter Notebook

qcri / Arabic_speech_code_switching

The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguistic and the acoustic cues. This dataset is a potential benchmark for DCS in spontaneous speech.

evaluation acoustic arabic lexical asr codeswitching dialect-identification egyptian mordern-standard-arabic

Updated Apr 3, 2022

iabufarha / ArSarcasm-v2

ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analysis, which is a part of WANLP 2021.

nlp sentiment-analysis arabic-nlp sarcasm-detection dialect-identification

Updated Jan 26, 2022

CORDI

sinaahmadi / CORDI

Language and Speech Technology for Central Kurdish Varieties (LREC-COLING 2024)

machine-translation automatic-speech-recognition kurdish sorani language-identification dialect-identification kurdish-language-processing erbil sulaymaniyah mahabad sanandaj

Updated Nov 29, 2024
Python

greek-dialect-classifier

hb20007 / greek-dialect-classifier

Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek

Updated Jun 12, 2024
Jupyter Notebook

AlexYangLi / DMT

VarDial19 shared task: Discriminating between Mainland and Taiwan Variation of Mandarin Chinese (DMT)

dialect mandarin dialect-identification mandarin-chinese

Updated Apr 10, 2019
Python

CristianViorelPopa / transformers-dialect-identification

nlp transformers romanian computational-linguistics bert dialect-identification romanian-bert moroco

Updated Apr 26, 2021
Jupyter Notebook

a-coles / SMS-Stylometry

A tool that predicts the dialect of English of an SMS message using recurrent neural networks supplemented with data from Google Trends.

sms rnn stylometry google-trends authorship-identification dialect-identification location-detection

Updated Dec 19, 2017
Python

kscanne / canuint

Ríomhchlár a dhéanann aicmiú staitistiúil ar théacsanna Gaeilge de réir a gcanúint

nlp classifier irish dialect gaeilge dialect-identification

Updated May 22, 2020
Perl

MohamedSebaie / Arabic_Dialect_Identification_NLP-AIM-Task

Arabic_Dialect_Identification_NLP-AIM-Task

preprocessing nlp-machine-learning dialect-identification linearsvc farasa bert-fine-tuning arabert

Updated Mar 16, 2022
Jupyter Notebook

abdelrahman-wael / Arabic-Dialect-Classification-Nadi-Shared-Task

using AraBert to classify different Arabic dialects. ranked fourth in WANLP2020 workshop.

nlp-machine-learning arabic-dialects dialect-identification

Updated Feb 26, 2021
Python

telsahy / capstone-35

Twitter Dialect Datasets and Classifiers (GULF Arabic Corpus)

twitter-api topic-modeling arabic nlp-machine-learning arabic-nlp dialect-identification

Updated Jun 28, 2018
Jupyter Notebook

teshi

sinaahmadi / teshi

An atlas of Central Kurdish dialects + a simple game to detect dialects

kurdish language-identification dialects dialect-identification kurdish-language-processing

Updated Dec 18, 2024
HTML

Arabic-Dialect-Classifier

eesanoble / Arabic-Dialect-Classifier

An Arabic Tweet Dialect Classifier

nlp machine-learning natural-language-processing arabic-nlp dialect-identification

Updated Feb 8, 2022
Jupyter Notebook

telsahy / capstone-52

Twitter Dialect Datasets and Classifiers (EG + GULF Arabic Corpus)

twitter-api topic-modeling arabic nlp-machine-learning arabic-nlp dialect-identification

Updated Jun 28, 2018
Jupyter Notebook

telsahy / capstone-34

Twitter Dialect Datasets and Classifiers (EG Arabic Corpus)

twitter-api topic-modeling arabic nlp-machine-learning arabic-nlp dialect-identification

Updated Feb 3, 2020
Jupyter Notebook

hasanhuz / Location_Analysis_Project

twitter word2vec location geopy dialect-identification location-analysis

Updated Dec 16, 2018
Python

Improve this page

Add a description, image, and links to the dialect-identification topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dialect-identification topic, visit your repo's landing page and select "manage topics."