79659532

Date: 2025-06-09 21:13:07
Score: 2.5
Natty:
Report link

βœ… Best Fix:
In many cases, the issue is due to missing essential EKS add-ons. The first thing to try:

πŸ’‘ Install these EKS add-ons via AWS Console or CLI:

This resolves common causes like NetworkPluginNotReady or nodes stuck in NotReady state.


πŸ§ͺ Still facing issues? Here's a structured troubleshooting guide:

1. Check Node Status

Run:

kubectl get nodes

Look for nodes in states like NotReady.


2. Inspect Node Conditions

For detailed info on an unhealthy node:

kubectl describe node <node-name>

Check for messages like CNI plugin failures, disk pressure, or kubelet issues.


3. Investigate Pod Status on That Node

kubectl get pods --all-namespaces -o wide --field-selector spec.nodeName=<node-name>

This helps pinpoint pods causing resource issues or crashes.


4. Review Node Logs

If needed, SSH into the EC2 instance and check logs:


5. Check Resource Utilization

High CPU, memory, or disk usage can make nodes unhealthy:

kubectl top nodes

6. Common Root Causes

a) Resource Pressure

b) CNI/Networking Issues

c) Kubelet or Runtime Failures

d) IAM Role Misconfigurations

Ensure your NodeGroup has:


7. Consider Node Replacement

If the node doesn't recover:


8. Update Node Group AMI

If you're using a newer EKS version (e.g., v1.30), use the compatible Amazon Linux 2 AMI. Amazon Linux 2023 often causes issues with CNI and kubelet.


9. Enable Auto Repair (Optional)

Enable Node Auto Repair in your EKS settings for future automatic recovery.


πŸ“Œ Bonus: What helps most is context.

If you’re still stuck, please share:


πŸ”— References:

Reasons:
  • RegEx Blacklisted phrase (2.5): please share
  • Long answer (-1):
  • Has code block (-0.5):
  • Contains question mark (0.5):
  • Low reputation (1):
Posted by: Tariq's Hacks