Question about training time cost #12

DachunKai · 2024-08-30T05:11:43Z

Hello,

Could you please specify the training time cost for the IART model on REDS, such as the IART_REDS_BI_N16.pth? I'm interested in knowing what type of GPU was used, the number of GPUs involved, and approximately how many days the training took.

Thank you!

PhuTran1005 · 2024-10-17T07:24:40Z

Hello,

Could you please specify the training time cost for the IART model on REDS, such as the IART_REDS_BI_N16.pth? I'm interested in knowing what type of GPU was used, the number of GPUs involved, and approximately how many days the training took.

Thank you!

kai422 · 2024-10-21T00:30:00Z

Training the model IART_REDS_BI_N16.pth on a V100 took four weeks. Since I only had 16Gx8 of VRAM available, I enabled gradient checkpointing, which is a memory-saving technique. However, this also means that during backpropagation, the forward pass gets recalculated. If you have more modern GPUs with larger VRAM (so you can disable gradient checkpointing), such as an A100 80G, I estimate that the training could be completed within a week.

Regards,
Kai

PhuTran1005 · 2024-10-21T03:37:06Z

Thank you for your detailed response.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about training time cost #12

Question about training time cost #12

DachunKai commented Aug 30, 2024

PhuTran1005 commented Oct 17, 2024

kai422 commented Oct 21, 2024

PhuTran1005 commented Oct 21, 2024

Question about training time cost #12

Question about training time cost #12

Comments

DachunKai commented Aug 30, 2024

PhuTran1005 commented Oct 17, 2024

kai422 commented Oct 21, 2024

PhuTran1005 commented Oct 21, 2024