-
Notifications
You must be signed in to change notification settings - Fork 728
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cmdLineTester_criu_jitserverPostRestore zLinux assertion JITServerPersistentCHTable.cpp:172 #17474
Comments
https://openj9-jenkins.osuosl.org/job/Test_openjdk17_j9_sanity.functional_s390x_linux_OMR_testList_1/345
|
@dsouzai ^^ |
Because of this message:
It looks like the crash is on the JITserver instance, and not on the restored JVM (a crash in the restored JVM wouldn't bring down the jitserver instance). @mpirvu do you have any ideas as to what could be going on here? |
I have seen that message when the client was not generating a new clientUID after restore. From the diagnostic data I cannot tell the UID of the client before the checkpoint and after the restore to determine that they are different. |
Why would that cause the JITServer instance to go down? Shouldn't it just reject the client? |
The server thinks it's the same client and that it encountered a bug. Rather than reject the client and hide the bug, we prefer to fatal assert. |
Because this message
is coming from the jitserver, if it did fatal assert because the client failed to generate a new UID, we should see that assert in the output. However, maybe what I'll do is update the tests to have better separation of the the various things that get |
https://openj9-jenkins.osuosl.org/job/Test_openjdk17_j9_sanity.functional_s390x_linux_Nightly_testList_0/509
|
JDK21 s390x_linux(
|
https://openj9-jenkins.osuosl.org/job/Test_openjdk22_j9_sanity.functional_s390x_linux_Release_testList_1/4
|
JDK17 s390x_linux(
|
https://openj9-jenkins.osuosl.org/job/Test_openjdk22_j9_sanity.functional_aarch64_linux_Release_testList_0/11 |
@dsouzai Is this something we're looking at for the 0.48 release? |
No; I spent some time a few months ago looking at this but couldn't make much progress. It's fairly intermittent and seems to only happen with JITServer + CRIU. |
https://openj9-jenkins.osuosl.org/job/Test_openjdk11_j9_sanity.functional_s390x_linux_Release_testList_0/122
|
https://openj9-jenkins.osuosl.org/job/Test_openjdk23_j9_sanity.functional_ppc64le_linux_Nightly_testList_0/41/
|
https://openj9-jenkins.osuosl.org/job/Test_openjdk21_j9_sanity.functional_ppc64le_linux_OMR_testList_1/85\
|
@dsouzai, should this move out to the 0.51 release? |
Yes, in my opinion. |
Yeah I agree, it's an issue that only seems to happen on zLinux with JITServer + CRIU. I've tried looking into it a while back but it's so intermittent that I haven't been able to make much progress. |
Failure link
From an internal build(
rhel7s390x-4-1
):Rerun in Grinder - Change TARGET to run only the failed test targets.
Optional info
Failure output (captured from console output)
50x internal grinder - all passed
FYI @tajila
The text was updated successfully, but these errors were encountered: