-
Notifications
You must be signed in to change notification settings - Fork 371
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Polaris thread test failing on Perlmutter-CPU with Intel #6515
Comments
@mark-petersen and @hyungyukang, do you have any ideas about why #6035 could be causing threading to be non-BFB? My hunch would be that it's some weird order of operations difference or something like that. While I know things are busy with Omega, this seems worth tracking down sooner or later because this could also affect production runs with E3SM on Perlmutter with Intel. I will run some thread testing with E3SM to see. |
@xylar , I'll take a look and run some tests. I agree with your intuition, but just to be sure, I was wondering if you had a chance to run the same tests with the GNU (or NVIDIA, Cray) compiler. |
The test passes with gnu. Nvidia isn't yet supported. |
I tried |
I'm seeing the same issue in Polaris on Compy with Intel and Intel-MPI (so it seems to be a problem with Intel, but not with Intel on every machine). |
As reported in E3SM-Project/polaris#205, we are seeing failures in the Polaris test:
when running on Perlmutter-CPU with Intel. Differences between runs with 1 and 2 threads are at machine precision but not zero.
I used
git bisect
to determine that the PR that causes this to emerge is #6035, which set new weights for the split-explicit time stepping.This tread test still passes with the previous PR, #5989, which introduced the Adams-Bashforth 2nd order time stepping scheme.
The text was updated successfully, but these errors were encountered: