Sarcasm-detection-over-Reddit-Corpus

Course Project for DSE 407 - Natural Language Processing

by Dr. Tanmay Basu and Dr. Jasabanta patro

The task was to develop an NLP method that could identify the sarcastic comments perfectly based on the learnings from the labelled dataset. We used bag of words model and TF-idf vectorizers and applied 3 classifiers namely, Multinomial Naive Bayes, Logistic regression and Support vector machine to train the machine to learn labelling. We applied different NLP techniques like stop-word removal, lemmatization and stemming on the dataset to test for the accuracy of prediction. Later, we used the best trained model to predict the class 'sarcastic' or 'non-sarcastic' on the given test dataset.

Khodak, Mikhail and Saunshi, Nikunj and Vodrahalli, Kiran Proceedings of the Linguistic Resource and Evaluation Conference (LREC) (2018) [A Large Self-Annotated Corpus for Sarcasm] (https://doi.org/10.48550/arXiv.1704.05579)

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
1704.05579.pdf		1704.05579.pdf
3.ipynb		3.ipynb
DSE_407_NLP___Automatic_Sarcasm_detection.pdf		DSE_407_NLP___Automatic_Sarcasm_detection.pdf
NLP_project.ipynb		NLP_project.ipynb
NLP_project1.ipynb		NLP_project1.ipynb
NLP_project2.ipynb		NLP_project2.ipynb
NLP_project3.ipynb		NLP_project3.ipynb
Project_merged.ipynb		Project_merged.ipynb
README.md		README.md
RNN.ipynb		RNN.ipynb
RoBERTa (1).ipynb		RoBERTa (1).ipynb
final.ipynb		final.ipynb
nlp_project.py		nlp_project.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sarcasm-detection-over-Reddit-Corpus

Course Project for DSE 407 - Natural Language Processing

About

Releases

Packages

Contributors 2

Languages

NemaVatsala/Sarcasm-detection-over-Reddit-Corpus

Folders and files

Latest commit

History

Repository files navigation

Sarcasm-detection-over-Reddit-Corpus

Course Project for DSE 407 - Natural Language Processing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages