Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pd: support dpa2 #4418

Merged
merged 81 commits into from
Dec 25, 2024
Merged
Show file tree
Hide file tree
Changes from 1 commit
Commits
Show all changes
81 commits
Select commit Hold shift + click to select a range
48f77f3
add core modules of paddle backend and water/se_e2_a example
HydrogenSulfate Nov 2, 2024
2082a59
add paddle code in consistent test
HydrogenSulfate Nov 2, 2024
2ae45b8
clean env and training
HydrogenSulfate Nov 2, 2024
7f03a04
add more test files
HydrogenSulfate Nov 2, 2024
4d1c44c
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 2, 2024
72c9b4e
fix pt->pd
HydrogenSulfate Nov 2, 2024
3b1c348
update test_python.yml
HydrogenSulfate Nov 2, 2024
a46dcb5
restore .pre-commit-config.yaml
HydrogenSulfate Nov 3, 2024
90f9ff9
remove redundant file
HydrogenSulfate Nov 3, 2024
0a6baa6
Skip bfloat16 for some cases
HydrogenSulfate Nov 3, 2024
4b77e55
enable prim by default in unitest
HydrogenSulfate Nov 3, 2024
6e139a2
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Nov 3, 2024
9437957
fix env code
HydrogenSulfate Nov 5, 2024
f1d762f
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 5, 2024
8534597
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 6, 2024
c22b45d
update test_ener.py
HydrogenSulfate Nov 6, 2024
39842ff
add missing pd_class
HydrogenSulfate Nov 6, 2024
07cd98e
use paddle Tensor instead of numpy array in pd/test_auto_batch_size.p…
HydrogenSulfate Nov 6, 2024
bb2d547
add training test and remove ase_calc.py
HydrogenSulfate Nov 7, 2024
5fb6d8e
add training test and remove ase_calc.py
HydrogenSulfate Nov 7, 2024
91066f8
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 7, 2024
90c9c03
upload missing json
HydrogenSulfate Nov 7, 2024
eb7384e
restore pt/test_auto_batch_size.py
HydrogenSulfate Nov 7, 2024
9faf54f
rerun CI for network problem
HydrogenSulfate Nov 7, 2024
4e3a121
add multitask unitest
HydrogenSulfate Nov 7, 2024
18333ab
add more unitest
HydrogenSulfate Nov 7, 2024
f9c6da8
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 7, 2024
3fd979d
remove redundant file and fix typo
HydrogenSulfate Nov 7, 2024
5922e84
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Nov 7, 2024
f5a17a9
update unitest
HydrogenSulfate Nov 8, 2024
8bea1bf
delete record
HydrogenSulfate Nov 8, 2024
8a7875f
remove more unused code and files
HydrogenSulfate Nov 8, 2024
df9f887
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 8, 2024
67b81e1
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 8, 2024
b0bf733
Merge branch 'add_paddle_backend_core_and_water_se_e2_a' of https://g…
HydrogenSulfate Nov 8, 2024
71a3c0a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Nov 8, 2024
ede5047
remove redundant annotations
HydrogenSulfate Nov 8, 2024
d11bf4d
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 8, 2024
b7a8cec
add nvtx profiler code in training, which is more accurate and detailed
HydrogenSulfate Nov 8, 2024
7567cf8
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 8, 2024
416fec8
update code as devel and fix typo
HydrogenSulfate Nov 9, 2024
1c0161c
fix pth -> json
HydrogenSulfate Nov 9, 2024
02a6f84
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 9, 2024
3354e5c
update unitest and training
HydrogenSulfate Nov 9, 2024
0d3f8cf
install paddle when test_cuda
HydrogenSulfate Nov 9, 2024
18215ff
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 9, 2024
859b94d
fix unitest
HydrogenSulfate Nov 9, 2024
74ee1c2
add eta in logging message for convenient
HydrogenSulfate Nov 9, 2024
f176309
remove hybrid code and enable one unitest
HydrogenSulfate Nov 9, 2024
4935e7b
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 9, 2024
fac51d3
add pd/__init__.py
HydrogenSulfate Nov 11, 2024
d3ca1f0
Merge branch 'devel' into add_paddle_backend_core_and_water_se_e2_a
HydrogenSulfate Nov 11, 2024
db1cd76
fix enable_prim
HydrogenSulfate Nov 11, 2024
36512fd
remove unused layernorm
HydrogenSulfate Nov 11, 2024
351bf7a
update dpa1 code
HydrogenSulfate Nov 25, 2024
01d4179
Merge branch 'devel' into add_dpa1
HydrogenSulfate Nov 25, 2024
701926a
update dpa2 code
HydrogenSulfate Nov 25, 2024
bc1cb38
update code of dpa1
HydrogenSulfate Nov 28, 2024
b4bc9db
Merge branch 'devel' into add_dpa1
HydrogenSulfate Nov 28, 2024
c944b82
restore decomp to paddle function
HydrogenSulfate Nov 28, 2024
4c925f9
remove redundant files
HydrogenSulfate Nov 28, 2024
dd3191a
update unitest and codes
HydrogenSulfate Nov 29, 2024
7df0e2f
fix
HydrogenSulfate Nov 29, 2024
ac479ed
update code
HydrogenSulfate Nov 29, 2024
3e64196
Merge branch 'devel' into add_dpa1
HydrogenSulfate Nov 29, 2024
56e079c
update typos
HydrogenSulfate Nov 29, 2024
3d70e7c
update code
HydrogenSulfate Nov 29, 2024
8b5b4a8
fix coverage
HydrogenSulfate Nov 29, 2024
e74d272
update consistent check of dpa1
HydrogenSulfate Nov 30, 2024
1a79a06
Merge branch 'add_dpa1' into add_dpa2_v2
HydrogenSulfate Dec 1, 2024
4520e97
add unittests code of dpa2 and replace several decomp. API to paddle.…
HydrogenSulfate Dec 1, 2024
79ed0f0
fix typos
HydrogenSulfate Dec 1, 2024
ed80c6d
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Dec 2, 2024
89d82f9
update UT code in test_se_t_tebd
HydrogenSulfate Dec 2, 2024
984b242
Merge branch 'add_dpa2_v22' into add_dpa2_v2
HydrogenSulfate Dec 2, 2024
617a258
update __init__
HydrogenSulfate Dec 2, 2024
6e5ebb3
solve code QL
HydrogenSulfate Dec 3, 2024
3cfb90f
Merge branch 'devel' into add_dpa2_v2
HydrogenSulfate Dec 18, 2024
575726a
fix unitest and typo
HydrogenSulfate Dec 19, 2024
befc111
Merge branch 'devel' into add_dpa2_v2
HydrogenSulfate Dec 23, 2024
abeae6d
update reference in dpa2.py
HydrogenSulfate Dec 23, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
add training test and remove ase_calc.py
  • Loading branch information
HydrogenSulfate committed Nov 7, 2024
commit bb2d54709c07da3c6ae0a74033c454603714a73d
52 changes: 26 additions & 26 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -65,13 +65,13 @@ repos:
- id: clang-format
exclude: ^(source/3rdparty|source/lib/src/gpu/cudart/.+\.inc|.+\.ipynb$)
# markdown, yaml, CSS, javascript
- repo: https://github.com/pre-commit/mirrors-prettier
rev: v4.0.0-alpha.8
hooks:
- id: prettier
types_or: [markdown, yaml, css]
# workflow files cannot be modified by pre-commit.ci
exclude: ^(source/3rdparty|\.github/workflows|\.clang-format)
# - repo: https://github.com/pre-commit/mirrors-prettier
# rev: v4.0.0-alpha.8
# hooks:
# - id: prettier
# types_or: [markdown, yaml, css]
# # workflow files cannot be modified by pre-commit.ci
# exclude: ^(source/3rdparty|\.github/workflows|\.clang-format)
# Shell
- repo: https://github.com/scop/pre-commit-shfmt
rev: v3.10.0-1
Expand All @@ -83,25 +83,25 @@ repos:
hooks:
- id: cmake-format
#- id: cmake-lint
- repo: https://github.com/njzjz/mirrors-bibtex-tidy
rev: v1.13.0
hooks:
- id: bibtex-tidy
args:
- --curly
- --numeric
- --align=13
- --blank-lines
# disable sort: the order of keys and fields has explict meanings
#- --sort=key
- --duplicates=key,doi,citation,abstract
- --merge=combine
#- --sort-fields
#- --strip-comments
- --trailing-commas
- --encode-urls
- --remove-empty-fields
- --wrap=80
# - repo: https://github.com/njzjz/mirrors-bibtex-tidy
# rev: v1.13.0
# hooks:
# - id: bibtex-tidy
# args:
# - --curly
# - --numeric
# - --align=13
# - --blank-lines
# # disable sort: the order of keys and fields has explict meanings
# #- --sort=key
# - --duplicates=key,doi,citation,abstract
# - --merge=combine
# #- --sort-fields
# #- --strip-comments
# - --trailing-commas
# - --encode-urls
# - --remove-empty-fields
# - --wrap=80
# license header
- repo: https://github.com/Lucas-C/pre-commit-hooks
rev: v1.5.5
Expand Down
6 changes: 0 additions & 6 deletions deepmd/pd/utils/ase_calc.py

This file was deleted.

176 changes: 176 additions & 0 deletions source/tests/pd/test_training.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,176 @@
# SPDX-License-Identifier: LGPL-3.0-or-later
import json
import os
import shutil
import unittest
from copy import (
deepcopy,
)
from pathlib import (
Path,
)

import numpy as np

from deepmd.pd.entrypoints.main import (
get_trainer,
)
from deepmd.pd.utils.finetune import (
get_finetune_rules,
)

from .model.test_permutation import (
model_se_e2_a,
)


class DPTrainTest:
def test_dp_train(self):
# test training from scratch
trainer = get_trainer(deepcopy(self.config))
trainer.run()
state_dict_trained = trainer.wrapper.model.state_dict()

# test fine-tuning using same input
finetune_model = self.config["training"].get("save_ckpt", "model.ckpt") + ".pd"
self.config["model"], finetune_links = get_finetune_rules(
finetune_model,
self.config["model"],
)
trainer_finetune = get_trainer(
deepcopy(self.config),
finetune_model=finetune_model,
finetune_links=finetune_links,
)

# test fine-tuning using empty input
self.config_empty = deepcopy(self.config)
if "descriptor" in self.config_empty["model"]:
self.config_empty["model"]["descriptor"] = {}
if "fitting_net" in self.config_empty["model"]:
self.config_empty["model"]["fitting_net"] = {}
self.config_empty["model"], finetune_links = get_finetune_rules(
finetune_model,
self.config_empty["model"],
change_model_params=True,
)
trainer_finetune_empty = get_trainer(
deepcopy(self.config_empty),
finetune_model=finetune_model,
finetune_links=finetune_links,
)

# test fine-tuning using random fitting
self.config["model"], finetune_links = get_finetune_rules(
finetune_model, self.config["model"], model_branch="RANDOM"
)
trainer_finetune_random = get_trainer(
deepcopy(self.config_empty),
finetune_model=finetune_model,
finetune_links=finetune_links,
)

# check parameters
state_dict_finetuned = trainer_finetune.wrapper.model.state_dict()
state_dict_finetuned_empty = trainer_finetune_empty.wrapper.model.state_dict()
state_dict_finetuned_random = trainer_finetune_random.wrapper.model.state_dict()
for state_key in state_dict_finetuned:
if "out_bias" not in state_key and "out_std" not in state_key:
np.testing.assert_allclose(
state_dict_trained[state_key].numpy(),
state_dict_finetuned[state_key].numpy(),
)
np.testing.assert_allclose(
state_dict_trained[state_key].numpy(),
state_dict_finetuned_empty[state_key].numpy(),
)
if "fitting_net" not in state_key:
np.testing.assert_allclose(
state_dict_trained[state_key].numpy(),
state_dict_finetuned_random[state_key].numpy(),
)

# check running
trainer_finetune.run()
trainer_finetune_empty.run()
trainer_finetune_random.run()

def test_trainable(self):
fix_params = deepcopy(self.config)
fix_params["model"]["descriptor"]["trainable"] = False
fix_params["model"]["fitting_net"]["trainable"] = False
free_descriptor = hasattr(self, "not_all_grad") and self.not_all_grad
if free_descriptor:
# can not set requires_grad false for all parameters,
# because the input coord has no grad, thus the loss if all set to false
# we only check trainable for fitting net
fix_params["model"]["descriptor"]["trainable"] = True
trainer_fix = get_trainer(fix_params)
model_dict_before_training = deepcopy(
trainer_fix.model.get_fitting_net().state_dict()
)
trainer_fix.run()
model_dict_after_training = deepcopy(
trainer_fix.model.get_fitting_net().state_dict()
)
else:
trainer_fix = get_trainer(fix_params)
model_dict_before_training = deepcopy(trainer_fix.model.state_dict())
trainer_fix.run()
model_dict_after_training = deepcopy(trainer_fix.model.state_dict())
for key in model_dict_before_training:
np.testing.assert_allclose(
model_dict_before_training[key].numpy(),
model_dict_after_training[key].numpy(),
)

def tearDown(self):
for f in os.listdir("."):
if f.startswith("model") and f.endswith(".pd"):
os.remove(f)
if f in ["lcurve.out"]:
os.remove(f)
if f in ["stat_files"]:
shutil.rmtree(f)


class TestEnergyModelSeA(unittest.TestCase, DPTrainTest):
def setUp(self):
input_json = str(Path(__file__).parent / "water/se_atten.json")
with open(input_json) as f:
self.config = json.load(f)
data_file = [str(Path(__file__).parent / "water/data/data_0")]
self.config["training"]["training_data"]["systems"] = data_file
self.config["training"]["validation_data"]["systems"] = data_file
self.config["model"] = deepcopy(model_se_e2_a)
self.config["training"]["numb_steps"] = 1
self.config["training"]["save_freq"] = 1

def tearDown(self) -> None:
DPTrainTest.tearDown(self)


class TestFparam(unittest.TestCase, DPTrainTest):
"""Test if `fparam` can be loaded correctly."""

def setUp(self):
input_json = str(Path(__file__).parent / "water/se_atten.json")
with open(input_json) as f:
self.config = json.load(f)
data_file = [str(Path(__file__).parent / "water/data/data_0")]
self.config["training"]["training_data"]["systems"] = data_file
self.config["training"]["validation_data"]["systems"] = data_file
self.config["model"] = deepcopy(model_se_e2_a)
self.config["model"]["fitting_net"]["numb_fparam"] = 1
self.config["training"]["numb_steps"] = 1
self.config["training"]["save_freq"] = 1
self.set_path = Path(__file__).parent / "water/data/data_0" / "set.000"
shutil.copyfile(self.set_path / "energy.npy", self.set_path / "fparam.npy")

def tearDown(self) -> None:
(self.set_path / "fparam.npy").unlink(missing_ok=True)
DPTrainTest.tearDown(self)


if __name__ == "__main__":
unittest.main()
23 changes: 12 additions & 11 deletions source/tests/pt/test_auto_batch_size.py
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@
import unittest

import numpy as np
import torch

from deepmd.pt.utils.auto_batch_size import (
AutoBatchSize,
Expand All @@ -10,28 +11,28 @@

class TestAutoBatchSize(unittest.TestCase):
def test_execute_all(self):
dd0 = np.zeros((10000, 2, 1, 3, 4))
dd1 = np.ones((10000, 2, 1, 3, 4))
dd0 = torch.zeros((10000, 2, 1, 3, 4))
dd1 = torch.ones((10000, 2, 1, 3, 4))
auto_batch_size = AutoBatchSize(256, 2.0)

def func(dd1):
return np.zeros_like(dd1), np.ones_like(dd1)
return torch.zeros_like(dd1), torch.ones_like(dd1)

dd2 = auto_batch_size.execute_all(func, 10000, 2, dd1)
np.testing.assert_equal(dd0, dd2[0])
np.testing.assert_equal(dd1, dd2[1])
np.testing.assert_equal(dd0.cpu().numpy(), dd2[0].cpu().numpy())
np.testing.assert_equal(dd1.cpu().numpy(), dd2[1].cpu().numpy())

def test_execute_all_dict(self):
dd0 = np.zeros((10000, 2, 1, 3, 4))
dd1 = np.ones((10000, 2, 1, 3, 4))
dd0 = torch.zeros((10000, 2, 1, 3, 4))
dd1 = torch.ones((10000, 2, 1, 3, 4))
auto_batch_size = AutoBatchSize(256, 2.0)

def func(dd1):
return {
"foo": np.zeros_like(dd1),
"bar": np.ones_like(dd1),
"foo": torch.zeros_like(dd1),
"bar": torch.ones_like(dd1),
}

dd2 = auto_batch_size.execute_all(func, 10000, 2, dd1)
np.testing.assert_equal(dd0, dd2["foo"])
np.testing.assert_equal(dd1, dd2["bar"])
np.testing.assert_equal(dd0.cpu().numpy(), dd2["foo"].cpu().numpy())
np.testing.assert_equal(dd1.cpu().numpy(), dd2["bar"].cpu().numpy())