Have anyone reproduced the result? #13

qiaopt · 2019-01-10T02:51:56Z

I have tried to reproduce the result, by got QWK much less than that in the paper. Here is my log for prompt 1, fold_0:

Did i do something wrong?

DamonCC · 2019-09-05T11:39:47Z

I also got a similar result, with Kappa scores ranging from 0.5 to 0.6. fold == 0, prompt == 1, all other parameters are default values.

ghost · 2019-09-14T14:34:35Z

How did you get the final result? As I saw in the source code, the author run 50 epochs on one fold and get the final dev score and test score. Should I run 50 epochs individually in each fold, and average their test results as the final experimental result?

jkdufair · 2019-09-17T15:02:12Z

I am also trying to replicate the results and getting similar outcomes. I've also tried varying the seed as described in the paper, to no avail.

jkdufair · 2019-09-17T19:47:56Z

I was able to replicate the results with QWKs in the .8 range. You'll need to utilize the embeddings file, as described here. Additionally, when you download the file from the link in the FAQ, the embeddings values are separated by commas but this repo expects them to be separated by spaces. I was able to accomplish this with

sed -ri ':a;s/(\ [^,]*),/\1 /;ta' embeddings.w2v.txt

@kavehtp Perhaps you want to update the FAQ to reflect this?

Thanks for making this repo available!

jkdufair · 2019-09-18T16:06:10Z

Also, my understanding from the paper is that the best results used a combination of CNN & RNN (LSTM). When I was able to replicate, I passed --cnndim 50 as well. I do not believe CNN is defaulted in parameters.

nahos · 2020-03-11T19:03:36Z

What versions of python,theano,keras and tensorflow did you use? I am facing issues with tensorflow.

NNNNNaaaaaa · 2020-08-10T07:51:35Z

Also, my understanding from the paper is that the best results used a combination of CNN & RNN (LSTM). When I was able to replicate, I passed --cnndim 50 as well. I do not believe CNN is defaulted in parameters.

Even with --cnndim 50 and the embeddings file, I still get the highest QWK with 0.556 for prompt 1, fold_0. Did you use any parameters? And how did you deal with words tagged "<unk> <num> <pad>"? Thank you so much!

philiphaddad97 · 2023-04-26T23:24:36Z

Did anyone get the same result as mentioned in the paper?

This was referenced Sep 19, 2019

Error while using embeddings from Glove2Vec #12

Open

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x92 in position 1539: invalid start byte #11

Open

bourne-3 mentioned this issue Aug 6, 2020

What versions of python,theano,keras and tensorflow did you use? I am facing issues with tensorflow. #17

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have anyone reproduced the result? #13

Have anyone reproduced the result? #13

qiaopt commented Jan 10, 2019

DamonCC commented Sep 5, 2019

ghost commented Sep 14, 2019

jkdufair commented Sep 17, 2019

jkdufair commented Sep 17, 2019

jkdufair commented Sep 18, 2019

nahos commented Mar 11, 2020

NNNNNaaaaaa commented Aug 10, 2020 •

edited

Loading

philiphaddad97 commented Apr 26, 2023

Have anyone reproduced the result? #13

Have anyone reproduced the result? #13

Comments

qiaopt commented Jan 10, 2019

DamonCC commented Sep 5, 2019

ghost commented Sep 14, 2019

jkdufair commented Sep 17, 2019

jkdufair commented Sep 17, 2019

jkdufair commented Sep 18, 2019

nahos commented Mar 11, 2020

NNNNNaaaaaa commented Aug 10, 2020 • edited Loading

philiphaddad97 commented Apr 26, 2023

NNNNNaaaaaa commented Aug 10, 2020 •

edited

Loading