[feature request] OpenMP support for LAMMPS ML-PACE #57

bernstei · 2023-09-23T19:35:11Z

It would be very useful for me if we could run PACE potentials with LAMMPS OpenMP support, either direct or via kokkos (in general for small systems where domain decomposition is limited, or in my case mainly because I'm parallelizing over a set of small configurations with MPI and having LAMMPS's MPI domain decomposition coexisting would be hard).

I have no idea what that would entail, but I do have some experience coding OpenMP parallelized interatomic potential (although in fortran, not C++, so fewer pointers). If it seems vaguely feasible to anyone who knows the code well, I'd be happy do discuss and try to put something together. The basic idea we've used before is to have each process loop over a subset of the atoms, then accumulate the energies and force contributions. I looked briefly at pair_pace.cpp, but since I'm not sure about the internals of aceimpl and its ace attribute, I'm not sure how one would go about sharing them across threads in the correct way (read-only bits public, thread-specific bits private or thread_private).

[edited] I just realized that this may be the wrong repo for this feature request, and if so, I apologize. Feel free to tell me, and I'll move it wherever it belongs best.

The text was updated successfully, but these errors were encountered:

yury-lysogorskiy · 2023-10-05T08:33:30Z

Dear @bernstei , I'm afraid it is not so trivial. The core method is compute_atom (here or here or here). It uses these global arrays, defined on the class level and initialized once. So, having openmp parallel-for over different atoms i in pair_pace.cpp::compute will result in a mess.

I could imagine two strategies:

Push parallel-for down to compute_atom's inner loops over basis functions, neighbours, etc.
Have per-thread "global" arrays.

What would be your thoughts about it?

bernstei · 2023-10-05T13:13:21Z

Per-thread arrays sounds simpler, as long as they're not so big that their memory usage becomes an issue. I guess it'd also depend on exactly how those global arrays are used - are they just filled in once at the beginning and then used for each atom? Are there contributions from each atom that have to be gathered somehow, either easy if they involve writing to different array locations, or harder if contributions from different atoms have to be summed. I can look at how those global arrays are used and see if I have any specific thoughts.

Of those 3 versions of compute_atom, which do you think is simplest (if there's a difference), for me to start with?

yury-lysogorskiy · 2023-10-05T14:23:22Z

These arrays are used for every given current central atom. Logic behind all three evaluators is to work on individual atom at every moment. So, no need to collect something over atoms in compute_atom. Only "pair-forces" are accumulated in pair_pace::compute across atoms.

in LAMMPS default evaluator is recursive c-tilde, but it can be too complicated. B-evaluator is used for extrapolation grade calculations. Product c-tilde is simpler than recursive ctilde.

I would suggest product c-tilde as starting point. Stan Moore also used it as a basis for GPU-KOKKOS implementation, as simpler one (and more GPU friendly).

bernstei · 2023-10-05T14:52:41Z

Thanks. I'll take a look, probably next week.

bernstei · 2023-10-05T14:56:19Z

I'm also wondering about adding OpenMP support to the existing kokkos version. I'll investigate that too.

[added] I emailed Stan Moore (cc'ing you), for clarification on the existing kokkos implementation and why it's GPU only.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[feature request] OpenMP support for LAMMPS ML-PACE #57

[feature request] OpenMP support for LAMMPS ML-PACE #57

bernstei commented Sep 23, 2023 •

edited

Loading

yury-lysogorskiy commented Oct 5, 2023

bernstei commented Oct 5, 2023

yury-lysogorskiy commented Oct 5, 2023 •

edited

Loading

bernstei commented Oct 5, 2023

bernstei commented Oct 5, 2023 •

edited

Loading

[feature request] OpenMP support for LAMMPS ML-PACE #57

[feature request] OpenMP support for LAMMPS ML-PACE #57

Comments

bernstei commented Sep 23, 2023 • edited Loading

yury-lysogorskiy commented Oct 5, 2023

bernstei commented Oct 5, 2023

yury-lysogorskiy commented Oct 5, 2023 • edited Loading

bernstei commented Oct 5, 2023

bernstei commented Oct 5, 2023 • edited Loading

bernstei commented Sep 23, 2023 •

edited

Loading

yury-lysogorskiy commented Oct 5, 2023 •

edited

Loading

bernstei commented Oct 5, 2023 •

edited

Loading