You reached a point in kubergrunt that should not happen and is almost certainly a bug

Thanks for reporting this bug. Can you share: Are you using th

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

This should be fixed in <a href="https://github.com/gruntwork-io/kubergrunt/releases/t

ELB_HAS_UNEXPECTED_NUMBER_OF_TARGET_GROUPS about kubergrunt HOT 7 CLOSED

gruntwork-io commented on May 28, 2024

ELB_HAS_UNEXPECTED_NUMBER_OF_TARGET_GROUPS

from kubergrunt.

Comments (7)

yorinasub17 commented on May 28, 2024 1

Thanks for the info! This should be enough for me to try to repro this and resolve this. I should be able to get working on this today if not tomorrow, and hopefully have a fix soon after.

from kubergrunt.

yorinasub17 commented on May 28, 2024

Thanks for reporting this bug. Can you share:

Are you using the kubernetes in-tree controller (type nlb) or the AWS LoadBalancer controller (type external) for the LoadBalancer Service?
Are you using instance mode (default) or ip mode?
Do you have more than one LoadBalancer Service using NLBs?

Once I have this information, I'll start working on it to repro and investigate a fix.

from kubergrunt.

mraslam commented on May 28, 2024

@yorinasub17

Thanks for the quick follow up.

Are you using the kubernetes in-tree controller (type nlb) or the AWS LoadBalancer controller (type external) for the LoadBalancer Service?
Internal NLB
Are you using instance mode (default) or ip mode?
Default
Do you have more than one LoadBalancer Service using NLBs?
Yes

from kubergrunt.

mraslam commented on May 28, 2024

FYI I tried manually increasing the desired ec2 count and draining the node but faced two issues. Could you please provide your insight on this.

We have pvc for some of our apps and it throwing an error saying volume affinity not matched since we do not control the AZ when new instances are created by auto scalar. Ex: New node is assigned in us-east-1b but the drained node and its pvc is in us-east-1a.
Some of the pods wont delete due to pod disruption budget.

from kubergrunt.

yorinasub17 commented on May 28, 2024

We have pvc for some of our apps and it throwing an error saying volume affinity not matched since we do not control the AZ when new instances are created by auto scalar. Ex: New node is assigned in us-east-1b but the drained node and its pvc is in us-east-1a.

Ah this is unfortunately a known issue from the k8s community with using EBS based PVs and ASGs. You need to actually make sure your ASGs are isolated to a single AZ to resolve this. See the note in cluster-autoscaler that highlights this (pasted below for convenience)

If you’re using Persistent Volumes, your deployment needs to run in the same AZ as where the EBS volume is, otherwise the pod scheduling could fail if it is scheduled in a different AZ and cannot find the EBS volume. To overcome this, either use a single AZ ASG for this use case, or an ASG-per-AZ while enabling --balance-similar-node-groups. Alternately, and depending on your use-case, you might be able to switch from using EBS to using shared storage that is available across AZs (for each pod in its respective AZ). Consider AWS services like Amazon EFS or Amazon FSx for Lustre.

from kubergrunt.

yorinasub17 commented on May 28, 2024

Some of the pods wont delete due to pod disruption budget.

Is the PDB set above or equal to the number of replicas in the deployment? If so then yes there is no way to drain the nodes. You need to either add more replicas in the deployment so it is above the PDB, or decrease the PDB.

If not, then usually you can resolve this by doing a rolling update (draining 1 node at a time for small clusters, or N nodes for larger clusters where N is less than the number of nodes currently active).

from kubergrunt.

yorinasub17 commented on May 28, 2024

This should be fixed in https://github.com/gruntwork-io/kubergrunt/releases/tag/v0.7.5 (binaries should show up in 15-30 mins).

from kubergrunt.

ELB_HAS_UNEXPECTED_NUMBER_OF_TARGET_GROUPS about kubergrunt HOT 7 CLOSED

Comments (7)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent