Feat: add model format for dpa1 #3211

iProzd · 2024-02-01T10:07:33Z

This PR add model format for DPA1 model:

Add torch reformat implementation for DPA1 model
Add numpy implementation for DPA1 model without attention layer
Align the torch and numpy implementations

TODO:

Add numpy implementation for DPA1 model with attention layer
Align the TF and numpy implementations
Align the smoothness implementations
Make filter_layers._networks in torch be accessable from outside

for more information, see https://pre-commit.ci

deepmd/pt/model/descriptor/dpa1.py

deepmd/model_format/dpa1.py

+        atype_embd = atype_embd_ext[:, :nloc, :]
+        # nf x nloc x nnei x tebd_dim
+        atype_embd_nnei = np.tile(atype_embd[:, :, np.newaxis, :], (1, 1, nnei, 1))
+        nlist_mask = nlist != -1


deepmd/model_format/dpa1.py

source/tests/pt/test_dpa1.py

+        ):
+            dtype = PRECISION_DICT[prec]
+            rtol, atol = get_tols(prec)
+            err_msg = f"idt={idt} prec={prec}"


source/tests/pt/test_dpa1.py

+            dd0.se_atten.mean = torch.tensor(davg, dtype=dtype, device=env.DEVICE)
+            dd0.se_atten.dstd = torch.tensor(dstd, dtype=dtype, device=env.DEVICE)
+            # dd1 = DescrptDPA1.deserialize(dd0.serialize())
+            model = torch.jit.script(dd0)


deepmd/model_format/network.py

+            resnet=False,
+            precision=precision,
+        )
+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]


deepmd/model_format/network.py

+        )
+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]
+        if self.uni_init:
+            self.w = 1.0


deepmd/model_format/network.py

+        self.w = self.w.squeeze(0)  # keep the weight shape to be [num_in]
+        if self.uni_init:
+            self.w = 1.0
+            self.b = 0.0


codecov · 2024-02-01T10:14:57Z

Codecov Report

Attention: 529 lines in your changes are missing coverage. Please review.

Comparison is base (afb440a) 74.39% compared to head (a96cab0) 20.72%.
Report is 2 commits behind head on devel.

Files	Patch %	Lines
deepmd/pt/model/descriptor/se_atten.py	0.00%	200 Missing ⚠️
deepmd/model_format/dpa1.py	0.00%	117 Missing ⚠️
deepmd/model_format/network.py	0.00%	109 Missing ⚠️
deepmd/pt/model/network/mlp.py	0.00%	64 Missing ⚠️
deepmd/pt/model/descriptor/dpa1.py	0.00%	36 Missing ⚠️
deepmd/model_format/__init__.py	0.00%	1 Missing ⚠️
deepmd/pt/model/descriptor/se_a.py	0.00%	1 Missing ⚠️
deepmd/pt/model/task/ener.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##            devel    #3211       +/-   ##
===========================================
- Coverage   74.39%   20.72%   -53.68%     
===========================================
  Files         345      346        +1     
  Lines       31981    32509      +528     
  Branches     1592     1594        +2     
===========================================
- Hits        23791     6736    -17055     
- Misses       7265    25075    +17810     
+ Partials      925      698      -227

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

deepmd/pt/model/descriptor/dpa1.py

+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers")
+        env_mat = data.pop("env_mat")


wanghan-iapcm

The serialize and de-serialize of the model_format/dpa1 should be tested.

njzjz · 2024-02-03T14:03:43Z

deepmd/model_format/dpa1.py

+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


Why is it pop and not used?

source/tests/pt/test_dpa1.py

+                dd0_state_dict = dd0.se_atten.state_dict()
+                dd4_state_dict = dd4.se_atten.state_dict()
+
+                dd0_state_dict_attn = dd0.se_atten.dpa1_attention.state_dict()


source/tests/pt/test_dpa1.py

+                dd4_state_dict = dd4.se_atten.state_dict()
+
+                dd0_state_dict_attn = dd0.se_atten.dpa1_attention.state_dict()
+                dd4_state_dict_attn = dd4.se_atten.dpa1_attention.state_dict()


deepmd/model_format/dpa1.py

+        data = copy.deepcopy(data)
+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")


deepmd/model_format/dpa1.py

+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


njzjz · 2024-02-03T14:00:59Z

deepmd/model_format/dpa1.py

+    Then the scaled dot-product attention method is adopted:
+
+    .. math::
+    A(\mathcal{Q}^{i,l}, \mathcal{K}^{i,l}, \mathcal{V}^{i,l}, \mathcal{R}^{i,l})=\varphi\left(\mathcal{Q}^{i,l}, \mathcal{K}^{i,l},\mathcal{R}^{i,l}\right)\mathcal{V}^{i,l},


Need indents, otherwise, it cannot be rendered correctly. See https://deepmodeling--3211.org.readthedocs.build/projects/deepmd/en/3211/api_py/deepmd.model_format.html#deepmd.model_format.DescrptDPA1

njzjz · 2024-02-03T14:03:43Z

deepmd/model_format/dpa1.py

+        variables = data.pop("@variables")
+        embeddings = data.pop("embeddings")
+        type_embedding = data.pop("type_embedding")
+        attention_layers = data.pop("attention_layers", None)


Why is it pop and not used?

njzjz · 2024-02-03T14:04:20Z

deepmd/model_format/network.py

+    w : np.ndarray, optional
+        The embedding weights of the layer.


Mismatch the actual parameters.

njzjz · 2024-02-03T14:05:18Z

deepmd/model_format/network.py

+    w : np.ndarray, optional
+        The learnable weights of the normalization scale in the layer.
+    b : np.ndarray, optional
+        The learnable biases of the normalization shift in the layer.


Mismatch the actual parameters.

iProzd · 2024-04-21T09:18:10Z

This PR is merged into #3696

Feat: add model format for dpa1

51643af

github-actions bot added the Python label Feb 1, 2024

iProzd requested review from njzjz and wanghan-iapcm February 1, 2024 10:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

8abff4c

for more information, see https://pre-commit.ci

github-advanced-security bot found potential problems Feb 1, 2024

View reviewed changes

fix pre-commit

b4770de

github-advanced-security bot found potential problems Feb 1, 2024

View reviewed changes

deepmd/pt/model/descriptor/dpa1.py

embeddings = data.pop("embeddings")

type_embedding = data.pop("type_embedding")

attention_layers = data.pop("attention_layers")

env_mat = data.pop("env_mat")

Check notice

Code scanning / CodeQL

Unused local variable Note

Variable env_mat is not used.

fix uts

8fec4cb

wanghan-iapcm reviewed Feb 1, 2024

View reviewed changes

iProzd added 3 commits February 1, 2024 22:38

fix uts

42c75de

fix uts

603128a

Add dp impl serialization test

a96cab0

github-advanced-security bot found potential problems Feb 1, 2024

View reviewed changes

njzjz added the Test CUDA Trigger test CUDA workflow label Feb 2, 2024

github-actions bot removed the Test CUDA Trigger test CUDA workflow label Feb 2, 2024

njzjz reviewed Feb 3, 2024

View reviewed changes

njzjz linked an issue Mar 19, 2024 that may be closed by this pull request

[Feature Request] pt: refactor DPA-1 in the PyTorch backend #3506

Closed

iProzd closed this Apr 21, 2024

iProzd deleted the rf_dpa1 branch April 24, 2024 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: add model format for dpa1 #3211

Feat: add model format for dpa1 #3211

iProzd commented Feb 1, 2024

codecov bot commented Feb 1, 2024 •

edited

Loading

wanghan-iapcm left a comment

njzjz Feb 3, 2024

njzjz Feb 3, 2024

njzjz Feb 3, 2024

njzjz Feb 3, 2024

njzjz Feb 3, 2024

iProzd commented Apr 21, 2024

Feat: add model format for dpa1 #3211

Feat: add model format for dpa1 #3211

Conversation

iProzd commented Feb 1, 2024

codecov bot commented Feb 1, 2024 • edited Loading

Codecov Report

wanghan-iapcm left a comment

Choose a reason for hiding this comment

njzjz Feb 3, 2024

Choose a reason for hiding this comment

njzjz Feb 3, 2024

Choose a reason for hiding this comment

njzjz Feb 3, 2024

Choose a reason for hiding this comment

njzjz Feb 3, 2024

Choose a reason for hiding this comment

njzjz Feb 3, 2024

Choose a reason for hiding this comment

iProzd commented Apr 21, 2024

codecov bot commented Feb 1, 2024 •

edited

Loading