[BUG] Finetuned model has wrong type_map #3455

iProzd · 2024-03-13T03:24:37Z

Bug summary

When doing finetuing, the user-defined type_map (e.g. ['H', 'O']) will be covered by the type_map in the pretrained model (e.g. the whole periodic table) , which is confusing for users.

DeePMD-kit Version

3.0.0a

TensorFlow Version

2.6.0

How did you download the software?

Built from source

Input Files, Running Commands, Error Log, etc.

See above.

Steps to Reproduce

See above.

Further Information, Files, and Links

No response

The text was updated successfully, but these errors were encountered:

njzjz · 2024-03-13T04:21:52Z

Idea: the easiest way is to add a virtual Model, "adapt type map model", which just adapts the input atom_type from the outer model type_map to the inner model type_map and forwards everything else like #3450.

iProzd · 2024-03-13T04:51:40Z

Idea: the easiest way is to add a virtual Model, "adapt type map model", which just adapts the input atom_type from the outer model type_map to the inner model type_map and forwards everything else like #3450.

I see, but this will cause the model to be wrapped repeatedly each time it is fine-tuned, as discussed with @wanghan-iapcm and @anyangml.

njzjz · 2024-03-13T08:45:50Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

anyangml · 2024-03-13T08:55:52Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

For the LinearModel, let's say we have two pre-trained models: model A with ["H", "O", "Na"], model B with ["H", "O", "K"]. Now if we want to finetune a LinearModel with this, the new type map becomes ["H", "O"].

njzjz · 2024-03-13T16:35:07Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

For the LinearModel, let's say we have two pre-trained models: model A with ["H", "O", "Na"], model B with ["H", "O", "K"]. Now if we want to finetune a LinearModel with this, the new type map becomes ["H", "O"].

This is not correct. The type map should be the union of two models.

anyangml · 2024-03-14T00:40:31Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

For the LinearModel, let's say we have two pre-trained models: model A with ["H", "O", "Na"], model B with ["H", "O", "K"]. Now if we want to finetune a LinearModel with this, the new type map becomes ["H", "O"].

This is not correct. The type map should be the union of two models.

I think the combined model should only handle the common types. Suppose the new type map is the union of the two, there will be unseen types for each individual models.

njzjz · 2024-03-14T01:06:02Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

For the LinearModel, let's say we have two pre-trained models: model A with ["H", "O", "Na"], model B with ["H", "O", "K"]. Now if we want to finetune a LinearModel with this, the new type map becomes ["H", "O"].

This is not correct. The type map should be the union of two models.

I think the combined model should only handle the common types. Suppose the new type map is the union of the two, there will be unseen types for each individual models.

A model doesn't need to evaluate all types. A typical example is DPLR. Pairwise potentials may also be aimed at several certain types.

wanghan-iapcm · 2024-03-14T01:39:45Z

repeatedly each time it is fine-tuned

Indeed, I don't understand why it needs to change the type map each time it is fine-tuned...

because the user may provide new type_map that is not consistent with the model type_map

This PR： 1. merge `change_energy_bias` into `compute_output_stats` and reformat it into `change_out_bias` of `model` level. 2. support single-task/multi-task finetuning from single-task/multi-task pretrained model. Need fix in future PR: 1. Finetuned model has covered `type_map`. (If fixed, `change_out_bias` func will not need the input params `origin_type_map` and `full_type_map`.) See also #3455. 2. `change_out_bias` support for other models.(e.g. Spin, ZBL, Polar, Dipole and Dos.)

@wanghan-iapcm

Fix #3747. Fix #3455. - Consistent fine-tuning with init-model, now in pt, fine-tuning include three steps: 1. Change model params (for multitask fine-tuning, random fitting and type-related params), 2. Init-model, 3. Change bias - By default, input will use user input while fine-tuning, instead of being overwritten by that in the pre-trained model. When adding “--use-pretrain-script”, user can use that in the pre-trained model. - Now `type_map` will use that in the user input instead of overwritten by that in the pre-trained model. Note: 1. After discussed with @wanghan-iapcm, **behavior of fine-tuning in TF is kept as before**. If needed in the future, it can be implemented then. 2. Fine-tuning using DOSModel in PT need to be fixed. (an issue will be opened, maybe fixed in another PR, cc @anyangml )  ## Summary by CodeRabbit - **New Features** - Added support for using model parameters from a pretrained model script. - Introduced new methods to handle type-related parameters and fine-tuning configurations. - **Documentation** - Updated documentation to clarify the model section requirements and the new `--use-pretrain-script` option for fine-tuning. - **Refactor** - Simplified and improved the readability of key functions related to model training and fine-tuning. - **Tests** - Added new test methods and utility functions to ensure consistency of type mapping and parameter updates.  --------- Signed-off-by: Duo <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Han Wang <[email protected]> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

@wanghan-iapcm

Fix deepmodeling#3747. Fix deepmodeling#3455. - Consistent fine-tuning with init-model, now in pt, fine-tuning include three steps: 1. Change model params (for multitask fine-tuning, random fitting and type-related params), 2. Init-model, 3. Change bias - By default, input will use user input while fine-tuning, instead of being overwritten by that in the pre-trained model. When adding “--use-pretrain-script”, user can use that in the pre-trained model. - Now `type_map` will use that in the user input instead of overwritten by that in the pre-trained model. Note: 1. After discussed with @wanghan-iapcm, **behavior of fine-tuning in TF is kept as before**. If needed in the future, it can be implemented then. 2. Fine-tuning using DOSModel in PT need to be fixed. (an issue will be opened, maybe fixed in another PR, cc @anyangml )  ## Summary by CodeRabbit - **New Features** - Added support for using model parameters from a pretrained model script. - Introduced new methods to handle type-related parameters and fine-tuning configurations. - **Documentation** - Updated documentation to clarify the model section requirements and the new `--use-pretrain-script` option for fine-tuning. - **Refactor** - Simplified and improved the readability of key functions related to model training and fine-tuning. - **Tests** - Added new test methods and utility functions to ensure consistency of type mapping and parameter updates.  --------- Signed-off-by: Duo <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Han Wang <[email protected]> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

iProzd added the bug label Mar 13, 2024

iProzd self-assigned this Mar 13, 2024

njzjz added this to the v3.0.0 milestone Mar 16, 2024

github-project-automation bot added this to Multiple backend support for DeePMD-kit Mar 16, 2024

github-project-automation bot moved this to Todo in Multiple backend support for DeePMD-kit Mar 16, 2024

iProzd mentioned this issue Mar 17, 2024

pt: support multitask finetune #3480

Merged

iProzd added this to DeePMD-3.0.0 beta release Apr 30, 2024

iProzd moved this to Backlog in DeePMD-3.0.0 beta release Apr 30, 2024

iProzd mentioned this issue May 22, 2024

feat(pt): consistent fine-tuning with init-model #3803

Merged

njzjz linked a pull request Jun 6, 2024 that will close this issue

feat(pt): consistent fine-tuning with init-model #3803

Merged

iProzd closed this as completed Jun 13, 2024

github-project-automation bot moved this from Backlog to Done in DeePMD-3.0.0 beta release Jun 13, 2024

github-project-automation bot moved this from Todo to Done in Multiple backend support for DeePMD-kit Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Finetuned model has wrong type_map #3455

[BUG] Finetuned model has wrong type_map #3455

iProzd commented Mar 13, 2024 •

edited

Loading

njzjz commented Mar 13, 2024

iProzd commented Mar 13, 2024 •

edited

Loading

njzjz commented Mar 13, 2024

anyangml commented Mar 13, 2024

njzjz commented Mar 13, 2024

anyangml commented Mar 14, 2024

njzjz commented Mar 14, 2024

wanghan-iapcm commented Mar 14, 2024

[BUG] Finetuned model has wrong type_map #3455

[BUG] Finetuned model has wrong type_map #3455

Comments

iProzd commented Mar 13, 2024 • edited Loading

Bug summary

DeePMD-kit Version

TensorFlow Version

How did you download the software?

Input Files, Running Commands, Error Log, etc.

Steps to Reproduce

Further Information, Files, and Links

njzjz commented Mar 13, 2024

iProzd commented Mar 13, 2024 • edited Loading

njzjz commented Mar 13, 2024

anyangml commented Mar 13, 2024

njzjz commented Mar 13, 2024

anyangml commented Mar 14, 2024

njzjz commented Mar 14, 2024

wanghan-iapcm commented Mar 14, 2024

iProzd commented Mar 13, 2024 •

edited

Loading

iProzd commented Mar 13, 2024 •

edited

Loading