RTFS DeepMD #4

markcoletti · 2023-10-24T18:12:42Z

Take a deep dive into the DeepMD code-base. We need to understand fundamentally how it works.

source
- https://github.com/minnervva/deepmd-kit (Original repo: https://github.com/deepmodeling/deepmd-kit)
- new things in version 2: https://pubs.aip.org/aip/jcp/article/159/5/054801/2904916/DeePMD-kit-v2-A-software-package-for-deep
people
- Mark
- Ada
- Wael
find where tensorflow is being invoked
find the dataloader

asedova · 2023-10-24T19:42:27Z

Found the dataloader in https://github.com/deepmodeling/deepmd-kit/blob/master/deepmd/train/trainer.py. It uses https://github.com/deepmodeling/deepmd-kit/tree/master/deepmd/utils/random.py and data_system.py in that same utils dir. random.py is just a wrapper around an older numpy random function (RandomState) which is technically deprecated, but there is a seed set that is passed in from the input json file that should work ok. Otherwise the frames are just chosen using this RNG (which is also strange since you would think you would want to train on ALL the frames, not just a random subset, that could potentially have repetitions?). But anyway, it does seem like at this DeePMD level, the data loading should be deterministic. We still may have some type of streaming happening at the TF or Horovod level though.

asedova · 2023-10-24T19:43:49Z

Need to next check the TF/horovod levels of distributed training to see if there may be some task stealing or asynchronous data streaming or something.

markcoletti assigned elwasif and markcoletti and unassigned elwasif Oct 24, 2023

markcoletti mentioned this issue Oct 24, 2023

Fork DeePMD repo #12

Closed

asedova assigned asedova and markcoletti and unassigned markcoletti and asedova Oct 24, 2023

asedova added the question Further information is requested label Oct 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RTFS DeepMD #4

RTFS DeepMD #4

markcoletti commented Oct 24, 2023 •

edited

Loading

asedova commented Oct 24, 2023 •

edited

Loading

asedova commented Oct 24, 2023

RTFS DeepMD #4

RTFS DeepMD #4

Comments

markcoletti commented Oct 24, 2023 • edited Loading

asedova commented Oct 24, 2023 • edited Loading

asedova commented Oct 24, 2023

markcoletti commented Oct 24, 2023 •

edited

Loading

asedova commented Oct 24, 2023 •

edited

Loading