-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
site unroll not supported for nSpin = 2 nColor = 32
in coarse-grid-deflated MG
#1378
Comments
Note that everything works fine when I disable coarse-grid deflation. The failure seems to occur during the launch of the eigensolver:
|
This was meant to read Note :) |
Some more context:
and the setup process seems to work fine:
where I used |
|
Increasing the level of verbosity step-by-step, in particular increasing the verbosity on level 1, reveals some more details on where this is failing (likely because of buffers being flushed more frequently):
At least from the last few lines here it appears that the issue occurs in the eigensolver, running with a global |
After adding a manual debug statement I've figured out that the issue comes from here: Lines 152 to 175 in 68d7d20
In particular, the issue seems to be with
Lines 40 to 58 in 68d7d20
as far as I can tell. I'm wondering if the check in Lines 72 to 77 in 68d7d20
is warranted if the eigensolver (and hence
and there are no changes in multigrid.cpp which seem to suggest that anything was changed... |
Hi @kostrzewa. This issue looks like a precision one I think: I don't think we should ever be using half precision on the coarse grids here. Can you enable FWIW: the "site unrolling" refers to the fact that the entire site (all spin and color for a given site in spacetime) is handled by a single thread. |
Thanks for this hint! Setting all I'm aware of course that the Wiki page will be three years old in two weeks, so it might well have grown inconsistent. For example,
Will do and report back, hopefully soon. |
Thanks. How come this is being done on the coarsest grid? |
very useful, will certainly use backward-cpp in the future!
|
This just looks like the wiki pages have grown stale: the eigenvector precision option was added after they were written. So we have five precisions to worry about now:
So in general one would want to use a I will update the wiki to fix this deficit, and apologies for this incongruity between the wiki and the code. Glad you find the |
This is an issue which we encountered already quite some time ago but haven't had time to report yet.
When running coarse-grid-deflated MG from within tmLQCD using a relatively "recent" commit of QUDA's develop branch (32bb266) I encounter:
I know that switching to much older commits "solves" this, so that's something we can explore if necessary (I don't know how compatible the current version of our interface is with these older QUDA versions).
I'm testing with higher verbosity to see what's going on but perhaps you might already have a change in mind from the past couple of months which could have caused this?
The text was updated successfully, but these errors were encountered: