Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
comm_brick.cpp		comm_brick.cpp
comm_brick.h		comm_brick.h
pair_d3.cu		pair_d3.cu
pair_d3.h		pair_d3.h
pair_d3_pars.h		pair_d3_pars.h
pair_e3gnn.cpp		pair_e3gnn.cpp
pair_e3gnn.h		pair_e3gnn.h
pair_e3gnn_parallel.cpp		pair_e3gnn_parallel.cpp
pair_e3gnn_parallel.h		pair_e3gnn_parallel.h
patch_lammps.sh		patch_lammps.sh

README.md

We support the LAMMPS pair style d3 of the Grimme's D3 dispersion (van der Waals) correction scheme accelerated with CUDA, which can be used within LAMMPS in conjunction with SevenNet.

PLEASE NOTE: Currently, this D3 code does not support mulit-GPU parallelism yet. So it can only be run on a single GPU.

About Grimme's D3 code accelerated with CUDA

This is LAMMPS implementation of Grimme's D3 method. We have ported the code from the original fortran code to a LAMMPS pair style written in CUDA/C++.

While D3 method is significantly faster than DFT, existing CPU implementations were slower than SevenNet. To address this, we have adopted CUDA and single precision (FP32) operations to accelerate the code.

Installation for LAMMPS

Simply run,

sevenn_patch_lammps ./lammps_sevenn --d3

You can follow the remaining installation steps in the SevenNet documentation.

Also, this code requires a GPU with a compute capability of at least 6.0. If you try to compile it with version 5.0, you may encounter an atomicAdd error.

The target compute capability of this code follows the setting of LibTorch in SevenNet, except for version 5.0.

You can manually select the target capability using the TORCH_CUDA_ARCH_LIST environment variable. For example, you can use: export TORCH_CUDA_ARCH_LIST="6.1;7.0;8.0;8.6;8.9;9.0".

Usage for LAMMPS

You can use the D3 dispersion correction in LAMMPS with SevenNet through the pair/hybrid command:

pair_style hybrid/overlay e3gnn d3 {cutoff_d3_r} {cutoff_d3_cn} {type_of_damping} {name_of_functional}
pair_coeff * * e3gnn {path_to_serial_model} {space_separated_chemical_species}
pair_coeff * * d3 {space_separated_chemical_species}

for example,

pair_style hybrid/overlay e3gnn d3 9000 1600 damp_bj pbe
pair_coeff * * e3gnn ./deployed_serial.pt C H O
pair_coeff * * d3 C H O

cutoff_d3_r and cutoff_d3_cn are square of cutoff radii for energy/force and coordination number, respectively. Units are Bohr radius: 1 (Bohr radius) = 0.52917721 (Å). Default values are 9000 and 1600, respectively. this is also the default values used in VASP.¹

Available type_of_damping are as follows:

damp_zero: Zero damping
damp_bj: Becke-Johnson damping

Available name_of_functional options are the same as in the original Fortran code. SevenNet-0 is trained on the 'PBE' functional, so you should specify 'pbe' in the script when using it. For other supporting functionals, check 'List of parametrized functionals' in here.

Features

Selective(or no) periodic boundary condition: implemented, But only PBC/noPBC can be checked through original FORTRAN code; selective PBC cannot
3-body term, n > 8 term: not implemented (as to VASP)
Modified versions of zero and bj damping

Cautions

It can be slower than the CPU with a small number of atoms.
The maximum number of atoms that can be calculated is 46,340 (overflow issue).
There can be occurred small amounts of numerical error
- The introduction of some FP32 operations can lead to minor numerical errors, particularly in pressure calculations, but these are generally smaller than those seen with SevenNet.
- If the error is too large, ensure that the fmad=false option in patch_lammps.sh is correctly applied during build.

To do

Remove atom_modify / compute virial dependency.
Add support for ASE as calculator interface.
Add support for multi GPUs (with e3gnn/parallel).
Implement without Unified Memory.
Unfix the threadsPerBlock=128.
Unroll the repetition loop k (for small number of atoms).

Contributors

Hyungmin An: Ported the original Fortran D3 code to C++ with OpenMP and MPI.
Gijin Kim: Accelerated the C++ D3 code with OpenACC² and CUDA, and currently maintains it.

On the VASP DFT-D3 page, the VDW_RADIUS and VDW_CNRADIUS are 50.2 and 20.0, respectively (units are Å). However, when running VASP 6.3.2 with D3 using zero damping (BJ does not provide such a log), the default values in the OUTCAR file are 50.2022 and 21.1671. These values are the same as our defaults. ↩
Since OpenACC is not compatible with libtorch, we chose to use the CUDA. ↩

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pair_e3gnn

pair_e3gnn

README.md

About Grimme's D3 code accelerated with CUDA

Installation for LAMMPS

Usage for LAMMPS

Features

Cautions

To do

Contributors

Files

pair_e3gnn

Directory actions

More options

Directory actions

More options

Latest commit

History

pair_e3gnn

Folders and files

parent directory

README.md

About Grimme's D3 code accelerated with CUDA

Installation for LAMMPS

Usage for LAMMPS

Features

Cautions

To do

Contributors

Footnotes