Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to use MeZO in training a simple CIFAR-10 model #27

Open
Cascol-Chen opened this issue Jan 3, 2024 · 3 comments
Open

How to use MeZO in training a simple CIFAR-10 model #27

Cascol-Chen opened this issue Jan 3, 2024 · 3 comments

Comments

@Cascol-Chen
Copy link

Cascol-Chen commented Jan 3, 2024

Hi, thanks you for sharing such an amazing work.
To use MeZO more easily, could you provide a minimum demo to show how can we use MeZO as an optimizer to train a CIFAR model?

@gaotianyu1350
Copy link
Member

Hi,

Please refer to the section "How to add MeZO to my own code?" here. You can find our highlighted MeZO code in this file and it should be fairly easy to copy it to whatever training code that you are using. The logic of MeZO is quite simple.

@Cascol-Chen
Copy link
Author

Hi,

Sorry for the late reply. Although there are comments to highlight MeZO code, the trainer.py comprises thousand lines of code with complicated dependancy and it's hard to make modification. Therefore, I sincerely recommend a minimum demo on CIFAR to make the great work easier to be followed.

@gaotianyu1350
Copy link
Member

Hi,

If you search "MeZO added" in the file you will find all the related code in trainer.py.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants