Add a default systemReserved configuration to the kubelet-config.json #1490

reegnz · 2023-10-26T10:37:02Z

What would you like to be added:

The default kubelet should configure dedicated systemReserved cpu and memory.

Something like the following should be added to the kubelet-config.json:

"systemReserved": {
  "cpu": "50m",
  "memory": "128Mi"
}

Why is this needed:

We are experiencing nodes going from Ready to NotReady when the node has high memory pressure caused by pods with unset memory limits.
Expectation is that the kubelet kills the pods if they over-allocate, or if other pods arrive with requests and the over-committed pods should then get evicted by the kubelet.
Instead in these high memory pressure cases the entire node seems to die when using default EKS AMI configuration. Kubelet doesn't report back to the API server, and we also cannot connect to the nodes with SSM or Instance Connect.

Adding systemReserved configuration through --kubelet-extra-args might be an acceptable workaround, but it seems like something that should be configured by default on the nodes, so that even if the kubelet becomes unresponsive, services in the system.slice keep working so one can go and troubleshoot the node.

The text was updated successfully, but these errors were encountered:

cartermckinnon linked a pull request May 16, 2024 that will close this issue

[WIP] feat(nodeadm): Add default system reserved resources #1808

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a default systemReserved configuration to the kubelet-config.json #1490

Add a default systemReserved configuration to the kubelet-config.json #1490

reegnz commented Oct 26, 2023 •

edited

Loading

Add a default systemReserved configuration to the kubelet-config.json #1490

Add a default systemReserved configuration to the kubelet-config.json #1490

Comments

reegnz commented Oct 26, 2023 • edited Loading

reegnz commented Oct 26, 2023 •

edited

Loading