feat: core service ready/liveness probes to be configurable #1855
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Recently ran into an issue on our cluster that has a lot of churn (we use karpenter and leverage spot instances pretty heavily). For some reason there are points when a new node in the cluster comes up and the AWS VPC CNI resyncs we see a bit of latency. I couldn't quite figure out why. The 1 second timeout on the readiness probe procs fails quite a bit causing noise in our events monitoring as well as sometimes restarting the container when it seems otherwise healthy.
The only way I found to solve the issue is to increase the
timeoutSeconds
.Let me know what you all think.
Cheers