Giter VIP home page Giter VIP logo

Comments (5)

syncronize-issues-to-jira avatar syncronize-issues-to-jira commented on September 18, 2024

Thank you for reporting us your feedback!

The internal ticket has been created: https://warthogs.atlassian.net/browse/KF-5487.

This message was autogenerated

from bundle-kubeflow.

orfeas-k avatar orfeas-k commented on September 18, 2024

admission-webhook

Code doesn't implement any metrics although there are some references in its go.mod and .sum files.

argo-controller

dex

envoy

istio-gateway

istio-pilot

jupyter-controller

jupyter-ui

Code doesn't implement any metrics.

katib-controller

katib-db-manager

Code doesn't implement any prometheus metrics.

katib-ui

Code doesn't implement any metrics.

kfp-api

Pipeline steps don't expose metrics by default. Feature requests:

kfp-metadata-writer

Code doesn't implement anything related to metrics https://github.com/kubeflow/pipelines/tree/master/backend/metadata_writer

kfp-persistence

Code doesn't implement any metrics. The only reference to metrics is about "metrics" provided from the application for exposing artifacts in the UI.

kfp-profile-controller

Code doesn't implement any metrics.

kfp-schedwf

Code doesn't implement any prometheus metrics.

kfp-ui

Code doesn't implement any prometheus metrics.

kfp-viewer

Code doesn't implement any prometheus metrics.

kfp-viz

Code doesn't implement any prometheus metrics.

knative-eventing & knative-serving

knative-operator

kserve-controller

kubeflow-dashboard

  • implements metrics but are only accessible in the case of a deployment in GCE. When deploying CKF, that's what the dashboard logs:
     2024-04-02T08:03:21.120Z [serve] > [email protected] serve
     2024-04-02T08:03:21.120Z [serve] > node dist/server.js
     2024-04-02T08:03:21.120Z [serve] 
     2024-04-02T08:03:23.555Z [serve] Initializing Kubernetes configuration
     2024-04-02T08:03:23.611Z [serve] Unable to fetch Application information: 404 page not found
     2024-04-02T08:03:23.611Z [serve] 
     2024-04-02T08:03:23.637Z [serve] "other" is not a supported platform for Metrics
     2024-04-02T08:03:23.638Z [serve] Using Profiles service at http://kubeflow-profiles.kubeflow:8081/kfam
     2024-04-02T08:03:23.645Z [serve] Server listening on port http://localhost:8082 (in production mode)
    
  • Curling the pod's endpoint <pod-ip>:80802/api/metrics returns
     {"error":"Operation not supported"}%
    

Thus, metrics are not available from upstream in our case.

kubeflow-profiles

kfam

profiles

kubeflow-roles

There isn't an upstream app for this charm.

kubeflow-volumes

Code doesn't implement any metrics.

metacontroller

minio

mlmd

Code doesn't implement any metrics.

oidc-gatekeeper

There are some references to prometheus packages in go.mod and .sum files but nothing is implemented in its code.

pvcviewer-operator

  • Thus, no text file containing exposed metrics here.

seldon-controller-manager

tensorboard-controller

tensorboard-web-app

Code doesn't implement any metrics.

training-operator

from bundle-kubeflow.

orfeas-k avatar orfeas-k commented on September 18, 2024

Upstream apps that do not already expose metrics (wip)

  1. admission-webhook
  2. jupyter-ui (jupyter-web-app)
  3. katib-db-manager
  4. katib-ui
  5. kfp-metadata-writer
  6. kfp-persistence
  7. kfp-profile-controller
  8. kfp-schedwf
  9. kfp-ui
  10. kfp-viewer
  11. kfp-viz
  12. kubeflow-dashboard (explanation in previous comment)
  13. kubeflow-roles. This isn't an upstream app but we 'd still need an exporter if we 'd like metrics from this charm.
  14. kubeflow-volumes (volumes-web-app)
  15. mlmd
  16. oidc-authservice
  17. tensorboards-web-app

from bundle-kubeflow.

kimwnasptd avatar kimwnasptd commented on September 18, 2024

Regarding all the K8s Controllers from kubeflow/kubeflow (notebooks, profiles, tensorboards) they will get some quite useful metrics by default because of controller-runtime golang package, that comes with Kubebuilder
https://book.kubebuilder.io/reference/metrics-reference

Those are perfect for capturing if the controllers are working as expected, and it's great it will be handled by default.

In order for this to happen though, someone upstream will need to bump the controller-runtime package from 0.11 to 0.16.3

from bundle-kubeflow.

orfeas-k avatar orfeas-k commented on September 18, 2024

Related sheet

from bundle-kubeflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.