How to establish a reasonable training set when the system charged #693
-
Dear developers, I am trying to fine-tuning Foundation model, MACE-mp-0b. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
(1) I think it is generally true the more configurations you have that are relevant to what you want to study, the better your model will be. How many you need, is hard to say without knowing more precisely what you are trying to achieve. When you fine-tune, often a few dozen configurations are enough to get a meaningful improvement in accuracy for specific systems. (2) yes, I think higher energy structures like the ones you describe will be informative. But the best structures are those that you directly care about, so if you want to study crystals at high temperatures, do simulations with the foundation model of that system, collect configurations, and fine tune on those. One thing you might run into is that the foundation model might cause your structure to melt at a temperature that is below the real melting point. This should be improved with just a few dozen configs collected at high temperature (both before and after the melting), so if you then continue to collect configurations with your find-tuned model, you should be able to simulate your high temperature solid without melting it at too low a temperature. |
Beta Was this translation helpful? Give feedback.
the current version of mace does not do explicit electrostatic long range interactions. But this doesn't mean that you can't study protons in insulators, especially ones which have very good screening (e.g. water), so that the long range contribution is small. if your DFT data contains protons, mace will learn them. in practice I think it does help if your overall cell is neutral. we do quite well on proton hopping in water for example. you don't need to indicate in lammps that your H is a proton, its behaviour will be inferred from its local environment.
there are plans to treat charges, fermi levels, long range electrostatics more properly, stay tuned.