Skip to content

Commit

Permalink
Merge pull request #140 from bc118/more_paper_editor_suggestsions
Browse files Browse the repository at this point in the history
Additional editor suggestions added to paper on the equations
  • Loading branch information
bc118 authored Dec 6, 2024
2 parents bfead16 + 424d6db commit 33dc0b5
Showing 1 changed file with 29 additions and 41 deletions.
70 changes: 29 additions & 41 deletions paper/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -53,7 +53,7 @@ affiliations:
- name: School of Engineering and Physical Sciences, Heriot-Watt University, Edinburgh EH14 4AS, United Kingdom
index: 6

date: 30 November 2024
date: 6 December 2024
bibliography: paper.bib

---
Expand All @@ -62,61 +62,49 @@ bibliography: paper.bib

Molecular Mechanics (MM) simulations (e.g., molecular dynamics and Monte Carlo) provide a third method of scientific discovery, adding to the traditional theoretical and experimental scientific methods [@Mielke:2019; @Siegfried:2014]. Experimental methods measure the data under set conditions (e.g., temperature and pressure), whereas the traditional theoretical methods are based on analytical equations, and sometimes their constants are fitted to experimental data. The MM simulations are deterministic and stochastic, and their models, commonly known as "force fields", can be optimized to match experimental data, similar to analytical theory-based methods [@Allen:2017; @Frekel:2002; @Jorgensen:1996; @Martin:1998; @Weiner:1984; @Weiner:1986; @Potoff:2009; @Hemmen:2015; @Errington:1999]. In larger, more complex systems, the stochastic simulation's molecules can jump large energy barriers that deterministic simulations may not be able to overcome in a reasonable timeframe, even with modern computing capabilities [@Allen:2017; @Frekel:2002]. However, deterministic and stochastic systems that provide adequate sampling for calculating a given property can provide critical insights into the system's phase space, which are not obtainable via traditional theoretical and experimental methods. Additionally, molecular simulations provide critical insights from visualizations and by obtaining chemical or material properties that do not currently exist, are not easily attainable (e.g., too expensive or dangerous) by traditional theoretical and experimental methods [@Hollingsworth:2018; @Hirst:2014], or require hard-to-achieve conditions, such as very high pressures and temperatures [@Yu:2023; @Koneru:2022; @Swai:2020; @Kumar:2022; @Louie:2021]. However, the force field parameters are ideally determined from Quantum Mechanics (QM) simulations or other methods, including the vibrational spectrum and machine learning methods [@Kania:2021; @Friederich:2018; @Vermeyen:2023; @Mayne:2013; @Schmid:2011; @Vanommeslaeghe:2014]. The MM proper dihedrals (i.e., dihedrals) are challenging to obtain if they do not currently exist for the chosen force field, inaccurately scale-up in larger molecules, or misbehave with other moiety combinations, provided some were separately derived using small molecules [@Kania:2021; @Mayne:2013]. While the same QM simulations can fit the dihedrals in most force field types, these dihedrals are not easily transferable between force fields due to the different parameters and formulas, including the combining rules and 1-4 scaling factors. [@Huang:2013; @Vanommeslaeghe:2010; @Vanommeslaeghe:2014; @Chen:2015].

The `MoSDeF-Dihedral-Fit` [@Crawford:2023b] library allows users to quickly calculate the MM dihedrals directly from the QM simulations for several force fields (OPLS, TraPPE, AMBER, Mie, and Exp6) [@Jorgensen:1996; @Martin:1998; @Weiner:1984; @Weiner:1986; @Potoff:2009; @Hemmen:2015; @Errington:1999]. The user simply has to generate or use an existing Molecular Simulation Design Framework (MoSDeF) force field `.xml` file [@Cummings:2021; @Summers:2020; @GMSO:2019; @forcefield-utilities:2022], provide Gaussian 16 `.log` or Gaussian-style QM simulation files that cover the dihedral rotation (typically between 0-360 degrees), and provide the molecular structure information in a `.mol2` format [@Gaussian16:2016]. The `MoSDeF-Dihedral-Fit` software uses the QM and MM data to produce the dihedral for the specific force field, fitting the constants for the OPLS dihedral form (equation \ref{eqn:oplseqn}) and then analytically converting them to the Ryckaert-Bellemans torsion (equation \ref{eqn:RBeqn}) and the periodic dihedral forms (equation \ref{eqn:periodiceqn}). This analytical conversion from the OPLS dihedral form requires setting the specified parameters in the Ryckaert-Bellemans torsion and periodic dihedral forms (see equations \ref{eqn:RBeqn} and \ref{eqn:periodiceqn}). The software outputs the calculated MM dihedral points, enabling users to fit unsupported dihedral forms, provided the force fields are supported by the MoSDeF, GPU Optimized Monte Carlo (GOMC), MoSDeF-GOMC [@Crawford:2023a; @Crawford:2022; @Crawford:2023b; @Nejahi:2019; @Nejahi:2021], and vmd-python [@vmd-python:2016] software (a derivative of the VMD software [@Humphrey:1996; @Stone:2001]).


# Statement of need

While many of these MM force field parameters can be transferred between force fields, such as bonds, angles, and improper dihedrals (often referred to as "impropers"), the proper dihedrals (dihedrals) can not be easily transferred due to the different combining rules (arithmetic and geometric) and 1-4 scaling factors (i.e., between the 1st and 4th bonded atoms) that were used in the development of the original parameters [@Berthelot:1898; @Good:1970; @Lorentz:1881]. The accuracy of these dihedral parameters is critical in obtaining the correct molecular conformations and configurations, which are required for understanding and analyzing the system's microstructure and physical properties (e.g., free energies, viscosities, adsorption loading, diffusion constants, and many more).

Some integrated dihedral fitting software currently exists for AMBER [@Horton:2022] or CHARMM-style force fields [@Mayne:2013], and other software will fit the dihedral constants to the final MM and QM energies, which need to be calculated by other means [@Guvench:2008]. However, there is a need for a simple, generalized software package that supports multiple potential functions, imports QM and MM files, automatically reads and organizes the QM data, calculates the MM energies, auto-corrects the dihedral fit to account for multiple instances of the dihedral, and automatically removes the unusable cosine power series combinations due to this symmetry. The `MoSDeF-dihedral-fit` software accomplishes all this and automatically accounts for any of the common combining rules and the 1-4 scaling factors specified via the MoSDeF `.xml` (i.e., force field) files [@Cummings:2021; @Summers:2020; @GMSO:2019; @forcefield-utilities:2022]. By allowing the user to set any other dihedral in the molecule to zero, this software avoids forcing one dihedral fit to correct the inaccurate forces of another dihedral, resulting in a problematic or bad cosine series fit; thus, providing a more flexible and accurate fit by combining multiple dihedral conformational energies in a single dihedral, a strategy used in the original and modern OPLS force fields [@Jorgensen:1996; @Chao:2021]. For example, a carboxylic acid with an alkyl tail has two dihedrals in the same rotation cycle; the C-C-C-O: (O: = oxygen without hydrogen) dihedral is set to zero while the C-C-O-H dihedral is fit [@Jorgensen:1996; @Chao:2021; @Ganesh:2004]. The `MoSDeF-dihedral-fit` [@Crawford:2023b] API fills the missing gap by providing a generalized and easy solution to fitting dihedrals in their commonly used forms and outputting the MM dihedral data points so users can fit other custom dihedral forms.

# Acknowledgements

This research was partially supported by the National Science Foundation (grants OAC-1835713, OAC-1835874, and CBET 2052438). Atomfold LLC also donated research and development time and computational resources for this research and software. Wayne State University Grid provided some of the computational resources used in this work.


# Mathematics

**Proper dihedral (dihedral) forms**

<u>OPLS dihedral</u>:

$$ U_{OPLS} = \frac{k_0}{2} $$

$$+ \frac{k_1}{2} * (1 + cos(\theta)) + \frac{k_2}{2} * (1-cos(2 * \theta))$$
The `MoSDeF-Dihedral-Fit` [@Crawford:2023b] library allows users to quickly calculate the MM dihedrals directly from the QM simulations for several force fields (OPLS, TraPPE, AMBER, Mie, and Exp6) [@Jorgensen:1996; @Martin:1998; @Weiner:1984; @Weiner:1986; @Potoff:2009; @Hemmen:2015; @Errington:1999]. The user simply has to generate or use an existing Molecular Simulation Design Framework (MoSDeF) force field `.xml` file [@Cummings:2021; @Summers:2020; @GMSO:2019; @forcefield-utilities:2022], provide Gaussian 16 `.log` or Gaussian-style QM simulation files that cover the dihedral rotation (typically between 0-360 degrees), and provide the molecular structure information in a `.mol2` format [@Gaussian16:2016]. The `MoSDeF-Dihedral-Fit` software uses the QM and MM data to produce the dihedral for the specific force field, fitting the constants for the OPLS dihedral form (equation \ref{eqn:oplseqn})

$$ U_{\text{OPLS}} = \frac{k_0}{2} $$
$$+ \frac{k_1}{2} * (1 + \cos(\theta)) + \frac{k_2}{2} * (1-\cos(2 * \theta))$$
\begin{equation}
+ \frac{k_3}{2} * (1 + cos(3 * \theta)) + \frac{k_4}{2} *(1-cos(4 * \theta))
+ \frac{k_3}{2} * (1 + \cos(3 * \theta)) + \frac{k_4}{2} *(1-\cos(4 * \theta))
\label{eqn:oplseqn}
\end{equation}

<u>Ryckaert-Bellemans torsions</u>:

$$U_{Ryckaert-Bellemans} = C_0$$
and then analytically converting them to the Ryckaert-Bellemans torsion (equation \ref{eqn:RBeqn})

$$+ C_1 * cos(\psi) + C_2 * cos(\psi)^2$$
$$U_{\text{Ryckaert-Bellemans}} = C_0$$
$$+ C_1 * \cos(\psi) + C_2 * \cos(\psi)^2$$
$$+ C_3 * \cos(\psi)^3 + C_4 * \cos(\psi)^4$$
\begin{equation}
\text{where: } \psi = \theta - 180^o
\label{eqn:RBeqn}
\end{equation}

$$+ C_3 * cos(\psi)^3 + C_4 * cos(\psi)^4$$
and the periodic dihedral forms (equation \ref{eqn:periodiceqn}).

$$U_{\text{Periodic}} = K_0 * (1 + \cos(n_0*\theta - d_0))$$
$$+ K_1 * (1 + \cos(n_1*\theta - d_1)) + K_2 * (1 + \cos(n_2*\theta - d_2))$$
$$+ K_3 * (1 + \cos(n_3*\theta - d_3)) + K_4 * (1 + \cos(n_4*\theta) - d_4)$$
$$\text{where: } n_0 = 0 ; n_1 = 1 ; n_2 = 2 ; n_3 = 3 ; n_4 = 4 $$
\begin{equation}
\psi = \theta - 180^o
\label{eqn:RBeqn}
d_0 = 90^o ; d_1 = 180^o ; d_2 = 0^o ; d_3 = 180^o ; d_4 = 0^o
\label{eqn:periodiceqn}
\end{equation}

<u>Periodic dihedral</u>:
This analytical conversion from the OPLS dihedral form requires setting the specified parameters in the Ryckaert-Bellemans torsion and periodic dihedral forms (see equations \ref{eqn:RBeqn} and \ref{eqn:periodiceqn}). The software outputs the calculated MM dihedral points, enabling users to fit unsupported dihedral forms, provided the force fields are supported by the MoSDeF, GPU Optimized Monte Carlo (GOMC), MoSDeF-GOMC [@Crawford:2023a; @Crawford:2022; @Crawford:2023b; @Nejahi:2019; @Nejahi:2021], and vmd-python [@vmd-python:2016] software (a derivative of the VMD software [@Humphrey:1996; @Stone:2001]).


# Statement of need

While many of these MM force field parameters can be transferred between force fields, such as bonds, angles, and improper dihedrals (often referred to as "impropers"), the proper dihedrals (dihedrals) can not be easily transferred due to the different combining rules (arithmetic and geometric) and 1-4 scaling factors (i.e., between the 1st and 4th bonded atoms) that were used in the development of the original parameters [@Berthelot:1898; @Good:1970; @Lorentz:1881]. The accuracy of these dihedral parameters is critical in obtaining the correct molecular conformations and configurations, which are required for understanding and analyzing the system's microstructure and physical properties (e.g., free energies, viscosities, adsorption loading, diffusion constants, and many more).

$$U_{Periodic} = K_0 * (1 + cos(n_0*\theta - d_0))$$
Some integrated dihedral fitting software currently exists for AMBER [@Horton:2022] or CHARMM-style force fields [@Mayne:2013], and other software will fit the dihedral constants to the final MM and QM energies, which need to be calculated by other means [@Guvench:2008]. However, there is a need for a simple, generalized software package that supports multiple potential functions, imports QM and MM files, automatically reads and organizes the QM data, calculates the MM energies, auto-corrects the dihedral fit to account for multiple instances of the dihedral, and automatically removes the unusable cosine power series combinations due to this symmetry. The `MoSDeF-dihedral-fit` software accomplishes all this and automatically accounts for any of the common combining rules and the 1-4 scaling factors specified via the MoSDeF `.xml` (i.e., force field) files [@Cummings:2021; @Summers:2020; @GMSO:2019; @forcefield-utilities:2022]. By allowing the user to set any other dihedral in the molecule to zero, this software avoids forcing one dihedral fit to correct the inaccurate forces of another dihedral, resulting in a problematic or bad cosine series fit; thus, providing a more flexible and accurate fit by combining multiple dihedral conformational energies in a single dihedral, a strategy used in the original and modern OPLS force fields [@Jorgensen:1996; @Chao:2021]. For example, a carboxylic acid with an alkyl tail has two dihedrals in the same rotation cycle; the C-C-C-O: (O: = oxygen without hydrogen) dihedral is set to zero while the C-C-O-H dihedral is fit [@Jorgensen:1996; @Chao:2021; @Ganesh:2004]. The `MoSDeF-dihedral-fit` [@Crawford:2023b] API fills the missing gap by providing a generalized and easy solution to fitting dihedrals in their commonly used forms and outputting the MM dihedral data points so users can fit other custom dihedral forms.

$$+ K_1 * (1 + cos(n_1*\theta - d_1)) + K_2 * (1 + cos(n_2*\theta - d_2))$$

$$+ K_3 * (1 + cos(n_3*\theta - d_3)) + K_4 * (1 + cos(n_4*\theta) - d_4)$$
# Acknowledgements

$$where: n_0 = 0 ; n_1 = 1 ; n_2 = 2 ; n_3 = 3 ; n_4 = 4 $$
This research was partially supported by the National Science Foundation (grants OAC-1835713, OAC-1835874, and CBET 2052438). Atomfold LLC also donated research and development time and computational resources for this research and software. Wayne State University Grid provided some of the computational resources used in this work.

\begin{equation}
d_0 = 90^o ; d_1 = 180^o ; d_2 = 0^o ; d_3 = 180^o ; d_4 = 0^o
\label{eqn:periodiceqn}
\end{equation}

# References

0 comments on commit 33dc0b5

Please sign in to comment.