copy flat data maybe faster #3942

wiyr · 2023-07-13T03:45:11Z

wiyr
Jul 13, 2023

https://github.com/microsoft/DeepSpeed/blob/master/deepspeed/runtime/fp16/fused_optimizer.py#L298C1-L301C37

Since fp16_groups shared memory with fp16_groups_flat, fp16_groups_flat format is the same as fp32_groups_flat，so we can just copy fp32_groups_flat to fp16_groups_flat without unflatten ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

copy flat data maybe faster #3942

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

copy flat data maybe faster #3942

wiyr Jul 13, 2023

Replies: 0 comments

wiyr
Jul 13, 2023