-
Notifications
You must be signed in to change notification settings - Fork 192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Instance <DbAuthInfo> is not bound to a Session error #6024
Comments
Just find I encounter this before #4596 and also reported by @sphuber #1292 EDIT: According to what I reported in #4596, I need to restart not only the daemon but also restart DB services. Anyway it is very annoying issue prevent me from running "real" high-throughputs calculation, I have to using submission control script to make sure no more than 10 workchain run at the same time. |
I am pretty sure you only need to reset the daemon, not the DB service. But I agree, this needs to be fixed. Let's continue discussion in the other issue |
The problem is I do |
You do |
@sphuber, I encounter it again and restart the daemon clearly, all the processes are back and working fine. Thanks! I guess maybe you are correct I didn't assure the daemon is fully restarted. |
Describe the bug
When there are > 500 calcjobs in the process list, some processes quickly run into exceptions below,
verdi process play -a
not help.Steps to reproduce
Steps to reproduce the behavior:
Only happened when I submit 40 of my pseudopotential workchains, each one will spawn 100 small pw.x calculation. Therefore not easy to reproduce from scratch, but interestingly since in the process list I have many processes is the pausing state after 5 maximum attempts, I can reproduce with and submit 10 of my workchains.
Expected behavior
Your environment
Other relevant software versions, e.g. Postres & RabbitMQ
Additional context
The text was updated successfully, but these errors were encountered: