[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec #3030

HydrogenSulfate · 2023-12-04T12:58:24Z

Add mixed precision training mode for se_e2_a_mixed_prec. Time&mem cost below

x	Time	Mem
fp32	~3.0s/100 batch	~1.9G
(manually) mixed precision(fp16+fp32) 3.5s	~3.5s/100 batch	~1.5G

(GPU)Training metric with water(se_e2_a_mixed_prec)(

reference(fp64):

… into api

… deemd2paddle

… into api

Deepmd2paddle

… into api

Support virial forward run on gpu with cpu kernel

Deepmd in Paddle for example, just 'water_se_a' model

…a_cpu fix error in cpu mode

…load support jist save load

Following issues fixed: 1. Removed @paddle.jit.to_static decorator. Model will be converted to static graph at save time. 2. Manually set InputSpec for "Ener" model with "se_a" descriptor 3. Due to lack of support for "double" datatype at inference time, default training precision was set to float (low precision)

[Paddle] Fixed model save issues with Ener model

Detected functional regression between CUDA 10.1 and CUDA 11.2 Therefore force 3 custom ops namely, "env_mat", "force_se_a" and "virial_se_a" to fallback on CPU Minor changes to save_model function in "trainer.py" to suppress dynamic-to-static warnings

Force env_mat force_se_a virial_se_a to fallback on CPU

This reverts commit 156c0d3.

Revert "Force env_mat force_se_a virial_se_a to fallback on CPU"

for more information, see https://pre-commit.ci

…ogenSulfate/deepmd-kit into add_ddle_backend_polish_ver

…_prec)

…revert code format) (#3096) Code formatting for #3030 --------- Signed-off-by: HydrogenSulfate <[email protected]> Co-authored-by: zhouwei25 <[email protected]> Co-authored-by: JiabinYang <[email protected]> Co-authored-by: Han Wang <[email protected]> Co-authored-by: Zhanlue Yang <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

zhwesky2010 and others added 30 commits April 10, 2021 09:51

Add the Paddle version of Ener model(Ener fit/Descrpt se_a),lr,loss

df60692

fix Ener model double grad

a4adb02

add paddle version of prod_env_mat_a

f060845

remove addtional log

1648829

Merge branch 'deemd2paddle' of https://github.com/JiabinYang/deepmd-kit…

b9eadb4

… into api

add prod_force and prod_virtial op in paddle

8e52aa7

remove old file

98a0539

remove additional change

4f2496b

remove additional change

a9bd2c7

add ut and ut used kernel for prod and virial

ffd165c

rename test

c7723e5

Merge branch 'deemd2paddle' of https://github.com/JiabinYang/deepmd-kit…

7cb1c94

… into api

rename test

5bd354c

Merge branch 'api' of https://github.com/deepmodeling/deepmd-kit into…

d596c18

… deemd2paddle

Merge branch 'deemd2paddle' of https://github.com/JiabinYang/deepmd-kit…

604ee62

… into api

support GPU backward of force and virial

c24e33e

Merge pull request deepmodeling#499 from JiabinYang/deemd2paddle

9426f1c

Deepmd2paddle

Add Ener Model for Paddle

5e4795b

Merge branch 'deemd2paddle' of https://github.com/JiabinYang/deepmd-kit…

0031e55

… into api

temp support gpu with cpu kernel on virial

dcb1631

renew api usage to fit latest paddle

dde23ee

Merge pull request deepmodeling#512 from JiabinYang/deemd2paddle

e988870

Support virial forward run on gpu with cpu kernel

Add Ener Model for Paddle

13b8e6f

fix Ener Model Infer

f813c77

Merge pull request deepmodeling#529 from zhouwei25/deepmd2paddle

ddcb9d7

Deepmd in Paddle for example, just 'water_se_a' model

fix error in cpu mode

7f3802f

Merge pull request deepmodeling#556 from JiabinYang/fix_prod_mat_env_…

f1bccb1

…a_cpu fix error in cpu mode

support jist save load

87effc5

Merge pull request deepmodeling#597 from JiabinYang/support_jit_save_…

4b24e1f

…load support jist save load

fix change device error code

9a92b7f

jim19930609 and others added 25 commits July 20, 2021 03:32

Merge pull request deepmodeling#870 from jim19930609/paddle

a87e7b3

[Paddle] Fixed model save issues with Ener model

Merge pull request deepmodeling#880 from jim19930609/paddle

d55286d

Force env_mat force_se_a virial_se_a to fallback on CPU

Revert "Force env_mat force_se_a virial_se_a to fallback on CPU"

45a2962

This reverts commit 156c0d3.

Merge pull request deepmodeling#1230 from jim19930609/paddle

e5aeb25

Revert "Force env_mat force_se_a virial_se_a to fallback on CPU"

update reprod water_se2_a code

fc78e6d

update ugly but runnable code

0af71a0

refine code

4689924

fix for missing code

0fd9f23

add unitest code and fix for custom op installation in python

0000512

update README for unitest of python custom op

03c1318

refine docs

46dbc9c

polish code

a38d4f0

update CPU train and content in RAEDME

dc5f2a1

merge old paddle branch

9fc9a67

remove old code cuz merge

6c290a2

[pre-commit.ci] auto fixes from pre-commit.com hooks

1b60bfd

for more information, see https://pre-commit.ci

remove C++ inference dependency of tensorflow and add more buffer

4eeb08a

Merge branch 'add_ddle_backend_polish_ver' of https://github.com/Hydr…

095d493

…ogenSulfate/deepmd-kit into add_ddle_backend_polish_ver

update dev code

b3c4a6f

add spin_energy python train/test code

691d85e

refine code

7bbc875

update README.md

17223e7

add (manually)mixed precision training mode(tested with se_e2_a_mixed…

9a54887

…_prec)

njzjz changed the base branch from paddle to paddle2 December 21, 2023 03:14

HydrogenSulfate mentioned this pull request Dec 29, 2023

[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec(revert code format) #3096

Merged

njzjz closed this Dec 30, 2023

HydrogenSulfate deleted the add_amp branch September 3, 2024 14:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec #3030

[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec #3030

HydrogenSulfate commented Dec 4, 2023 •

edited

Loading

[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec #3030

[Paddle Backend] Add mixed precision training for se_e2_a_mixed_prec #3030

Conversation

HydrogenSulfate commented Dec 4, 2023 • edited Loading

HydrogenSulfate commented Dec 4, 2023 •

edited

Loading