Training script #23

arampacha · 2021-07-06T20:57:46Z

- add bf16 support
- check if training with bf16 weights works fine
- add resuming from ckpt
- add wandb tracking
- complete adafactor option
- figure out how to best utilize profiler for training loop optimization
- add gradient accumulation
- support iterable datasets and max_steps argument
- prefetch generator for dataloader

arampacha · 2021-07-07T09:28:15Z

Casting weights to bf16 is not recommended and removed for now.

shpotes · 2021-07-08T01:46:14Z

here's the gradient accumulation from the vision_transformer codebase:
https://github.com/google-research/vision_transformer/blob/ba9a85bdc430daf4da7b9da67b486a4e0f5bb278/vit_jax/hyper.py#L77

And here's a small example
https://github.com/google-research/vision_transformer/blob/ba9a85bdc430daf4da7b9da67b486a4e0f5bb278/vit_jax/train.py#L63-L66

mrinal18 · 2021-07-08T05:25:31Z

for gradient accumulation, i have opened a PR: #29
let me know if we can sync up for this

celsofranssa · 2021-08-03T13:28:31Z

Hello,
what are the minimum hardware requirements to run the training script?

arampacha · 2021-08-03T14:12:11Z

Hi @celsofranssa, the hyperparameters in HF model cards (for example here) are tuned for TPU-v3-8. But you can run the script on GPU adjusting batch size accordingly and mb switching dtype from bfloat16 to float16 for your hardware. Not sure what the minimum requirement would be exactly. You can also consider decreasing block_size if you run out of memory.

arampacha assigned arampacha and arunraja-hub Jul 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training script #23

Training script #23

arampacha commented Jul 6, 2021 •

edited

Loading

arampacha commented Jul 7, 2021

shpotes commented Jul 8, 2021 •

edited

Loading

mrinal18 commented Jul 8, 2021

celsofranssa commented Aug 3, 2021

arampacha commented Aug 3, 2021

**Training script** #23

**Training script** #23

Comments

arampacha commented Jul 6, 2021 • edited Loading

arampacha commented Jul 7, 2021

shpotes commented Jul 8, 2021 • edited Loading

mrinal18 commented Jul 8, 2021

celsofranssa commented Aug 3, 2021

arampacha commented Aug 3, 2021

Training script #23

Training script #23

arampacha commented Jul 6, 2021 •

edited

Loading

shpotes commented Jul 8, 2021 •

edited

Loading