-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize the fit gradients #729
Conversation
09def4b
to
2da467b
Compare
@HanatoK We don't currently have a CI job with Intel, but that compiler uses |
Thanks! I will try using the same way to apply __restrict. |
Awesome! I added an Intel oneAPI CI job in the meantime, which is overkill for just this PR but definitely was a missing feature otherwise. |
643ed9e
to
3de535c
Compare
I think it is better to document how the fit gradients are computed so I tried my best to improve the doxygen documentation in 2d353eb. I know the SI of the new Colvars paper includes the computation but that is too simplified and not quite informative. |
Wikipedia says "__restrict" is recognized by all three compilers. Let me try it.
2d353eb
to
8e0b9a2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice optimization I rebased and made tiny cosmetic changes, and I have one question. Apart from that, this is ready to merge IMO.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great, apart from the speedup the code is also getting more readable.
@@ -1420,7 +1420,7 @@ FORMULA_TRANSPARENT = YES | |||
# The default value is: NO. | |||
# This tag requires that the tag GENERATE_HTML is set to YES. | |||
|
|||
USE_MATHJAX = NO | |||
USE_MATHJAX = YES |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome, thanks!
calc_fit_forces_impl
;cvm::quaternion::position_derivative_inner
.This PR should be only marked as ready after #713 is merged.