Binary classification on torchtext.datasets.SST
Test set accuracies on default splits, evaluated on default train-test split without cross validation:
~83% acc - Wang and Manning (https://www.aclweb.org/anthology/P12-2018.pdf) have 83.45% acc on RT-2k with MNB-uni
~78% acc
~77% acc
- >80->81.5% acc with Adadelta, dropout_rate = 0.25, weight_decay = 0
- similarly with weigh_decay = 0.01
- Kim 2014 has >81% acc (https://github.com/yoonkim/CNN_sentence/blob/master/README.md) with Adadelta, dropout = 0.5 and l2 regularization
- more on dropout and l2: https://arxiv.org/pdf/1510.03820.pdf