Comments (9)
CMO image on quay.io is updated after each merge to master. At the time of writing image with tag latest
was updated 3 hours ago.
Source: https://quay.io/repository/openshift/origin-cluster-monitoring-operator?tab=tags
from cluster-monitoring-operator.
Yes, exactly. The operator will reconcile every few seconds and override those changes. What would you like to change for the taint to be recognized?
from cluster-monitoring-operator.
I was thinking you would just add it to the cluster-monitoring-config under the node-exporter section, similar to the node selector sections, except for tolerations.
for example:
prometheusOperator:
baseImage: registry.redhat.io/openshift3/ose-prometheus-operator
prometheusConfigReloaderBaseImage: registry.redhat.io/openshift3/ose-prometheus-config-reloader
configReloaderBaseImage: registry.redhat.io/openshift3/ose-configmap-reloader
nodeSelector:
node-role.kubernetes.io/infra: "true"
prometheusK8s:
baseImage: registry.redhat.io/openshift3/prometheus
nodeSelector:
node-role.kubernetes.io/infra: "true"
externalLabels:
cluster: console-a.openshift.example.com
alertmanagerMain:
baseImage: registry.redhat.io/openshift3/prometheus-alertmanager
nodeSelector:
node-role.kubernetes.io/infra: "true"
nodeExporter:
baseImage: registry.redhat.io/openshift3/prometheus-node-exporter
tolerations:
- key: "example-taint"
operator: "Equal"
value: "true"
effect: "NoSchedule"
grafana:
baseImage: registry.redhat.io/openshift3/grafana
nodeSelector:
node-role.kubernetes.io/infra: "true"
kubeStateMetrics:
baseImage: registry.redhat.io/openshift3/ose-kube-state-metrics
nodeSelector:
node-role.kubernetes.io/infra: "true"
kubeRbacProxy:
baseImage: registry.redhat.io/openshift3/ose-kube-rbac-proxy
auth:
baseImage: registry.redhat.io/openshift3/oauth-proxy
or ignore taints altogether for the node-exporter if thats somehow easily achievable (and won't cause problems, this could be a bad idea though, not sure).
from cluster-monitoring-operator.
I believe node-exporter has to tolerate any taints:
tolerations:
- operator: "Exists"
from cluster-monitoring-operator.
Just ran into this issue with tainted nodes as well: These will not have the node-exporter
pod scheduled on them and thus no metric of them is available...
Is there any workaround that you know of?
from cluster-monitoring-operator.
Seems like this is fixed in master and release-3.11 branch roughly three weeks ago (file: daemonset.yaml ) : fe4ef56 and 3b4ebe6
However, the last release on quay.io seems to be 9 months old.
Is there a possibility of creating a new release of this and pushing it to Quay?
from cluster-monitoring-operator.
Closing as this is already fixed on master branch.
from cluster-monitoring-operator.
@paulfantom Is the 3.11 Image not being updated from the release-3.11 branch?
The labels for the quay.io Image for 3.11 points to the commit 10e4317 which is from February this year.
from cluster-monitoring-operator.
@padyx Yes, that's a bug and we are working on a solution.
from cluster-monitoring-operator.
Related Issues (20)
- Allow settings of kube-state-metrics args HOT 2
- job:kube_pv:count counts ODF PVs only HOT 6
- Configure grafana for user workload monitoring HOT 6
- Release 4.11 checklist HOT 5
- Support `Probe` resources HOT 3
- {Request} Ability to configure Prometheus Exporters HOT 5
- Invalid externalURL since ad2d747f2ef037647ebc9f7d5b22118d0dab4bd2 HOT 4
- Release 4.12 Checklist HOT 3
- Missing Metrics in Prometheus on openshift 3.11 HOT 4
- User workload prometheus scraps targets managed by monitoring prometheus instance HOT 4
- Release 4.13 Checklist HOT 5
- crio metrics via http is not safe HOT 7
- Allow kube-state-metrics configuration HOT 3
- Reject invalid PrometheusRule objects beforehand rather than failing the reconciliation HOT 5
- Alerts generated by user-workload-monitoring are not included in remoteWrite HOT 4
- typo in 4.14 CHANGELOG.md and CONTRIBUTING.md HOT 3
- OpenTelemetry Support HOT 16
- kubelet metrics not working in after upgrade to OKD 4.13 HOT 12
- wrong annotation for ThanosRulerConfig.Resources HOT 1
- I can't increase Prometheus startupProbe failureThreshold value HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cluster-monitoring-operator.