Eval results aren't matching the paper #29

sia-cerebras · 2024-01-24T02:02:41Z

I'm not able to match the 3-shot eval results reported in the paper for the pretrained model.
I downloaded the Meditron-7b model from HF.
For example, for MedQA I get 0.353, while the paper reports 0.287±0.008
My command was: ./inference_pipeline.sh -b medqa4 -c meditron-7b -s 3 -m 0 -out_dir out_dir

On PubMedQA, I got 0.486, but the paper reports .693±.151.

The text was updated successfully, but these errors were encountered:

jfernandrezj · 2024-02-24T13:06:54Z

Just checking if anybody is interested, I run my own eval and it is worse than Mistral:7b on TNM coding (by far)!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval results aren't matching the paper #29

Eval results aren't matching the paper #29

sia-cerebras commented Jan 24, 2024

jfernandrezj commented Feb 24, 2024

Eval results aren't matching the paper #29

Eval results aren't matching the paper #29

Comments

sia-cerebras commented Jan 24, 2024

jfernandrezj commented Feb 24, 2024