Giter VIP home page Giter VIP logo

Comments (17)

geerlingguy avatar geerlingguy commented on May 25, 2024

I have the node_exporter role just about up and running. Doing some final testing on it now.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Grafana example deployment (I may just use this): https://github.com/kubernetes/ingress-nginx/tree/master/deploy/grafana

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Trying to get it running with kube-prometheus, but I've run into prometheus-operator/kube-prometheus#545

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

It looks like there are actually a lot of different bits and pieces that are not compatible with ARM/Pi, so next step is trying out https://github.com/carlosedp/cluster-monitoring instead. See related blog post: Creating a full monitoring solution for ARM Kubernetes Cluster.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

I hit an issue with Getting /bin/jb: not found when I try running make vendor. But resolved it by setting GOPATH: /home/pirate/go in the make command environment differently.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

New issue: During 'make', I get 'gojsontoyaml: not found'.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Another new issue: Prometheus operator giving RBAC error with latest commit.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

And: CPU Temperature alert giving 'No data' on Raspberry Pi CM3+.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

All those issues are fixed, though now I'm not getting any node metrics :D

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

And one more: 'arm-exporter' isn't running on master node, only workers.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

I have this all working now:

Screenshot_2020-05-23 Kubernetes cluster monitoring (via Prometheus) - Grafana

Two open PRs are in the works or the upstream repo. The two changes that are required currently until carlosedp/cluster-monitoring#41 and carlosedp/cluster-monitoring#42 are merged:

Add the following in the armexporter-daemonset.yaml DaemonSet spec:

    spec:
      affinity:                                                                           
        nodeAffinity:                                                  
          requiredDuringSchedulingIgnoredDuringExecution:              
            nodeSelectorTerms:                                         
            - matchExpressions:                                        
              - key: kubernetes.io/arch                                
                operator: In                                           
                values:                                                
                - arm                                                  
                - arm64 
      tolerations:
      - operator: Exists

And remove the nodeSelector from the same spec.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Another issue, just an annoyance more than anything: CPU Temperature monitor giving Pod IPs instead of node IPs, so DNS names don't display.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Affinity PR was fixed up and merged, woohoo!

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

One other note: It seems that Prometheus dies when the worker on which it's scheduled gets any extra load, and starts swapping like mad. Swap is soooooo slow on the Pi's eMMC or microSD, so load goes north of 9-10, then the node becomes unresponsive.

I opened a support issue How to change Prometheus' scrapeInterval?, to see if maybe there's a way to easily work around this by scraping less often.

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Sheesh. Every little bug, fixed in a couple days. Thanks so much to @carlosedp for helping out!

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

Testing a final rebuild on the latest commit, if it works, I can close this out, woohoo!

from turing-pi-cluster.

geerlingguy avatar geerlingguy commented on May 25, 2024

It's all working quite well now.

Screen Shot 2020-05-25 at 8 02 35 PM

from turing-pi-cluster.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.