Comments (4)
Are you able to update the description to supply more context about the specific load balancer/health check setup is required to cause this behaviour in an EKS context?
Personally, in my setup, we don't observe this behaviour while using the tool; but we use external NLBs with basic TCP checks on NodePort
ingress Service
s for the ingress so perhaps that is where things differ. Does this require use of LoadBalancer
Service
s to cause an issue?
This will help provide more useful feedback on the PR, especially regarding documentation that might indicate to users which strategy is the right one for them to use.
from eks-rolling-update.
I observed this issue when dealing with ELB as the service controller removes the cordoned nodes from loadbalancer. However, this may not cause downtime as the script scales up ASG before cordoning the nodes and routes to pods are updated on all the new nodes.
from eks-rolling-update.
We experienced this same issue. It's especially acute if you use RUN_MODE=2 and clusterTrafficPolicy: Local
on your services. You sometimes get lucky since traffic gets routed to all nodes if none of the nodes are available. But, if you have a lot of traffic and you only have one node in service you can saturate that node very quickly.
It's much better to use run modes that only cordon AZ at a time.
The proposed fix seems like a good one though, so I'm for some variant of #49 being merged.
from eks-rolling-update.
Resolved by #49
from eks-rolling-update.
Related Issues (20)
- Question: IAM and K8s Permissions
- RequestExpired when calling the DescribeInstances operation
- Error 'NoneType' object is not iterable with Kubernetes Version 1.19
- add AZRebalance also in suspend action
- Dry run doesn't care about RUN_MODE HOT 3
- Changelog / Release notes HOT 1
- K8S_CONTEXT environment variable ignored when draining nodes
- Using API to drain kubernetes node HOT 1
- Last build / deployment of master failed
- Ambiguous check of running instances HOT 1
- [Feature Request] Detach from ASG or Load Balancer / Target Group
- eks-rolling-update is going into crashloopfailure HOT 1
- New release on pypi ?
- Publish docker image on ghcr.io
- Repo maintainers ? HOT 8
- Question - Throttling request took 2.429459004s HOT 3
- Waiting for k8s nodes to reach count HOT 7
- Allow a configurable buffer of extra instances HOT 1
- Could not configure Kubernetes Python Client HOT 5
- NLB still sending requests to node while shutting down
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from eks-rolling-update.