Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Results on WSC and WIC datasets cannot be reproduced on OPT-13B with MeZO #15

Open
MathIsAll opened this issue Jul 24, 2023 · 5 comments

Comments

@MathIsAll
Copy link

Hello,

Thank you for your fantastic work. When I run mezo.sh for WSC and WIC on OPT-13B with MeZO, the reported results in paper cannot be reproduced.

I run mezh.sh on 4 x A100, with per_device_batch = 4, lr = 1e-6, eps = 1e-3.

I want to know more details about the training settings for the reproduction.

Thanks!

@gaotianyu1350
Copy link
Member

Hi,

Can you provide more details on your run, for example, what is the result that you got?

BTW, in our experiments, we only used one A100 for OPT-13B.

@gaotianyu1350
Copy link
Member

Hi, I just realized that for WSC you should use 1e-7/1e-3. Note that for all OPT-13B MeZO experiments we do a grid search over LR=1e-6/1e-7.

@gaotianyu1350
Copy link
Member

gaotianyu1350 commented Aug 2, 2023

FYI, here are the results I got

WSC 1e-6/1e-3

"accuracy": 0.5384615384615384,

WSC 1e-7/1e-3

"accuracy": 0.6346153846153846,

WIC 1e-6/1e-3

"accuracy": 0.6112852664576802,

WIC 1e-7/1e-3

"accuracy": 0.5736677115987461,

@sglucas
Copy link

sglucas commented Sep 27, 2023

Hi, May I ask for the seed of WSC and WIC. I find the default seed is 0 and I cannot reproduce the result of WIC 61.1.

@gaotianyu1350
Copy link
Member

The seed we used for the experiment is 0. Note that different hardwares may lead to slightly different results, even if the random seeds are the same.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants