add UltraLM-13b-V2.0/UltraLM-13b-V2.0-best-of-16/UltraLM-13b-best-of-16 to AlpacaEval #139

lifan-yuan · 2023-09-25T08:11:19Z

No description provided.

YannDubs · 2023-09-28T05:52:05Z

src/alpaca_eval/models_configs/ultralm-13b-best-of-16/configs.yaml

+ultralm-13b:
+  prompt_template: "ultralm-13b-best-of-16/prompt.txt"
+  fn_completions: "huggingface_local_completions"
+  completions_kwargs:


did you not have to change the completion function to do best-of-16?

Hi Yann,
I kept this "fn_completions" as other models since there is no instruction guiding what options are allowed in this field. Can I add a comment in the line, such as fn_completions: "huggingface_local_completions" # best-of-16 sampling? or can I change 'huggingface_local_completions' to something like 'huggingface_best_of_16_completions'?

this is meant to be used to be able to replicate your results. All the options can be found here: https://github.com/tatsu-lab/alpaca_eval/blob/main/src/alpaca_eval/decoders/__init__.py

in you case, it seems that you cannot replicate your results with our code. Are all the parameters completions_kwargs correct then? GIven that you can't replicate your results with our code, I'd suggest just removing fn_completions and completions_kwargs, and adding a comment in the yml that says that results can't be reproduced in alpaca_eval because they require best of n sampling

YannDubs · 2023-09-28T05:52:54Z

Hi @lifan-yuan, can you upload the model outputs also?

lifan-yuan · 2023-09-28T12:39:51Z

Hi @lifan-yuan, can you upload the model outputs also?

Sorry for the missing files. I will create a new commit adding the output files after we find out the right way to represent the best-of-16 sampling.

lifan-yuan · 2023-09-30T09:10:34Z

Hi @YannDubs, I've added a comment in the yml file and explained how the results can be reproduced. The model_outputs are updated as well.

YannDubs · 2023-09-30T20:51:47Z

LGTM! Great job!

add ultralm-13b-v2.0/ultralm-13b-v2.0-best-of-16/ultralm-13b-best-of-16

da0caa2

YannDubs reviewed Sep 28, 2023

View reviewed changes

lifan-yuan added 2 commits September 30, 2023 16:33

update config and upload model_outputs

615babf

model_outputs

f81d268

YannDubs merged commit 59a9cab into tatsu-lab:main Sep 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add UltraLM-13b-V2.0/UltraLM-13b-V2.0-best-of-16/UltraLM-13b-best-of-16 to AlpacaEval #139

add UltraLM-13b-V2.0/UltraLM-13b-V2.0-best-of-16/UltraLM-13b-best-of-16 to AlpacaEval #139

lifan-yuan commented Sep 25, 2023

YannDubs Sep 28, 2023

lifan-yuan Sep 28, 2023

YannDubs Sep 29, 2023

YannDubs commented Sep 28, 2023

lifan-yuan commented Sep 28, 2023

lifan-yuan commented Sep 30, 2023

YannDubs commented Sep 30, 2023

add UltraLM-13b-V2.0/UltraLM-13b-V2.0-best-of-16/UltraLM-13b-best-of-16 to AlpacaEval #139

add UltraLM-13b-V2.0/UltraLM-13b-V2.0-best-of-16/UltraLM-13b-best-of-16 to AlpacaEval #139

Conversation

lifan-yuan commented Sep 25, 2023

YannDubs Sep 28, 2023

Choose a reason for hiding this comment

lifan-yuan Sep 28, 2023

Choose a reason for hiding this comment

YannDubs Sep 29, 2023

Choose a reason for hiding this comment

YannDubs commented Sep 28, 2023

lifan-yuan commented Sep 28, 2023

lifan-yuan commented Sep 30, 2023

YannDubs commented Sep 30, 2023