-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Have anyone reproduced the result? #13
Comments
I also got a similar result, with Kappa scores ranging from 0.5 to 0.6. fold == 0, prompt == 1, all other parameters are default values. |
How did you get the final result? As I saw in the source code, the author run 50 epochs on one fold and get the final dev score and test score. Should I run 50 epochs individually in each fold, and average their test results as the final experimental result? |
I am also trying to replicate the results and getting similar outcomes. I've also tried varying the seed as described in the paper, to no avail. |
I was able to replicate the results with QWKs in the
@kavehtp Perhaps you want to update the FAQ to reflect this? Thanks for making this repo available! |
Also, my understanding from the paper is that the best results used a combination of CNN & RNN (LSTM). When I was able to replicate, I passed |
What versions of python,theano,keras and tensorflow did you use? I am facing issues with tensorflow. |
Even with --cnndim 50 and the embeddings file, I still get the highest QWK with 0.556 for prompt 1, fold_0. Did you use any parameters? And how did you deal with words tagged "<unk> <num> <pad>"? Thank you so much! |
Did anyone get the same result as mentioned in the paper? |
I have tried to reproduce the result, by got QWK much less than that in the paper. Here is my log for prompt 1, fold_0:
log.txt
Did i do something wrong?
The text was updated successfully, but these errors were encountered: