Comments (8)
I generated temporary credentials manually using the AssumeRole. I believe it is working now.
Previously my node's role permissions included the following as per this policy:
"cloudwatch:GetMetricData"
"sqs:GetQueueAttributes"
"sqs:ReceiveMessage"
It was resolved by granting all read permissions on SQS:
"cloudwatch:GetMetricData"
"sqs:GetQueueAttributes"
"sqs:GetQueueUrl"
"sqs:ListDeadLetterSourceQueues"
"sqs:ListQueueTags"
"sqs:ListQueues"
"sqs:ReceiveMessage"
My restarted WPA no longer logs any errors.
from k8s-worker-pod-autoscaler.
Possible to share the complete log?
from k8s-worker-pod-autoscaler.
Hi @alok87 : please see the example included in my issue. The WPA starts spitting out Unable to fetch no of messages
messages as soon as the container starts. There are no other kinds of log messages.
Note that I have anonymized the account number and queue name.
E1104 12:42:26.926463 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926476 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926498 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926513 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926527 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926540 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
.
E1104 12:42:26.926552 1 sqs.go:406] Unable to fetch no of messages to the queue "queue", Client not found for queue: https://sqs.us-east-1.amazonaws.com/<account>/queue
from k8s-worker-pod-autoscaler.
Does the queue exist in sqs? Possible to try using sqs client with same creds and see data comes?
Just want to rule out the possibility of configuration issue first
from k8s-worker-pod-autoscaler.
can we close this?
from k8s-worker-pod-autoscaler.
Do you think we should update something in the doc here on policy, https://github.com/practo/k8s-worker-pod-autoscaler#install
from k8s-worker-pod-autoscaler.
I feel like there may be something else missing.
Even though I get no permissions errors, I am unable to trigger a scaling operation on the deployment. Any ideas?
I have 10000+ messages in the queue and only one deployment pod running.
k get pods
NAME READY STATUS RESTARTS AGE
example-deployment-795d868d4-8nzfv 1/1 Running 0 7m19s
Does the WPA require some kind of write or tag attributes?
I can submit a PR for the documentation once I confirm this is working.
from k8s-worker-pod-autoscaler.
WPA has verbosity in logs, may be try that. -v=4
- Also share the output of WPA yaml
k get wpa -o yaml <wpa_object>
- check if deployment replicas changed with queue length
- check the queue length in AWS shows the 1000 messages? sqs metrics picture if posted here can help.
from k8s-worker-pod-autoscaler.
Related Issues (20)
- Scaling with spiky queues HOT 6
- Deployment gets stuck MinimumReplicasUnavailable HOT 2
- Question: Gracefull shutdown? HOT 2
- Multi Queue support with one WPA object. HOT 4
- WPA Status is not updating in k8s1.19 - add support for k8s 1.19
- Scale based on: `numMessages +1` HOT 2
- How to achieve near realtime scheduling of pods? HOT 3
- What is the best way to access WPA controllers? HOT 6
- min=0, max=0 should not lead to scale up of deployments HOT 1
- Cannot get qMsgs if the WPA deleted and re-created HOT 3
- Queues scale temporarily for to -1 after autoscaler restart HOT 6
- Changes for Kubernetes 1.22 support
- does WPA support Amazon MQ, will it be provided in near future? HOT 1
- Deploymets scaled down very quickly despite low maxDisruption HOT 20
- Does WPA kill pods if the queue length decreases? HOT 7
- The manifest for public.ecr.aws/practo/workerpodautoscaler:v1.6.0 is not found on the public ECR HOT 3
- Does this support multiple SQS Queues monitoring? HOT 1
- Does't work with localstack
- IRSA (IAM Roles for Service Accounts) Support
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from k8s-worker-pod-autoscaler.