Skip to content

Commit

Permalink
Final commit
Browse files Browse the repository at this point in the history
  • Loading branch information
alberto-paparella committed Jul 17, 2023
1 parent 0c905d4 commit abcf91a
Show file tree
Hide file tree
Showing 12 changed files with 10 additions and 1 deletion.
7 changes: 7 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,3 +20,10 @@ The designed model must predict a probability of each type of toxicity for each
- test_labels.csv - labels for the test data; value of -1 indicates it was not used for scoring

The dataset is under [CC0](https://creativecommons.org/share-your-work/public-domain/cc0/), with the underlying comment text being governed by [Wikipedia's CC-SA-3.0](https://creativecommons.org/licenses/by-sa/3.0/).

## Python notebooks

The project is divided into 3 notebooks:
- `toxic_comment_classifier` contains the core kernel, featuring data analysis and machine learning algorithms as logistic regression and naive-bayes
- `lstm-tcc` contains the LSTM kernel
- `bert-tcc` contains the BERT fine-tuning kernel
Binary file added Toxic_Comment_Classifier.pdf
Binary file not shown.
1 change: 1 addition & 0 deletions bert-tcc.ipynb

Large diffs are not rendered by default.

Binary file removed dataset/test.zip
Binary file not shown.
Binary file removed dataset/test_labels.zip
Binary file not shown.
Binary file removed dataset/train.zip
Binary file not shown.
Binary file added images/ROC AUC score.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/correlations.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/text_length.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added images/words_distribution.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
1 change: 1 addition & 0 deletions lstm-tcc.ipynb

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion toxic_comment_classifier.ipynb

Large diffs are not rendered by default.

0 comments on commit abcf91a

Please sign in to comment.