Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CKB Process Occasionally Fails to Terminate #4607

Closed
sunchengzhu opened this issue Aug 23, 2024 · 3 comments
Closed

CKB Process Occasionally Fails to Terminate #4607

sunchengzhu opened this issue Aug 23, 2024 · 3 comments
Labels
t:bug Type: This doesn't seem right.

Comments

@sunchengzhu
Copy link

sunchengzhu commented Aug 23, 2024

Bug Report

Current Behavior

I restart CKB every two hours and I've noticed that there are occasional issues with missing metrics data.

Terminate command:

killckb() {
    PIDS=$(sudo lsof -ti:8114)
    for i in $PIDS; do
        echo "killed the ckb $i"
        sudo kill $i
    done
}

killckb
image image ### Expected Behavior metrics data is not lost

Environment

  • CKB version: ckb 0.118.0-rc1 (63ae338 2024-08-19)
  • Chain: mainnet
  • Operating system: Ubuntu 20.04.6 LTS
  • Arch: x64

Additional context/Screenshots

My ckb.toml configuration:

[metrics.exporter.prometheus]
target = { type = "prometheus", listen_address = "0.0.0.0:8100" }

# # Experimental: Monitor memory changes.
[memory_tracker]
# # Seconds between checking the process, 0 is disable, default is 0.
interval = 5
@sunchengzhu sunchengzhu added the t:bug Type: This doesn't seem right. label Aug 23, 2024
@15168316096
Copy link
Contributor

This is an intermittent issue.

The situation you described indicates that other nodes running ckb v118 did not experience missing metrics during the same time period. If the data were not collected from the source, it would suggest that the ckb process was terminated. However, based on the ckb logs we reviewed earlier, it shows that the service restarted successfully and continued to sync normally.

Therefore, it is more likely that the issue occurred when Prometheus was scraping data from ckb.

@sunchengzhu sunchengzhu changed the title Metrics data may be lost after CKB restarts CKB Process Occasionally Fails to Terminate Aug 27, 2024
@sunchengzhu
Copy link
Author

From the run.log, it can be seen that the CKB process did not exit promptly:

  1. Received the Ctrl-C signal at 15:20.
  2. At 15:25, an attempt to start CKB resulted in an ERROR.
  3. The CKB shutdown log was not seen until 15:26.

@sunchengzhu
Copy link
Author

This issue has been verified and resolved in PR 4615.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
t:bug Type: This doesn't seem right.
Projects
None yet
Development

No branches or pull requests

2 participants