Comments (11)
To start with we can start with the following metrics. Metric exporter will run in all Pods as container or as shared process within the container in case accessing required resource is challenge.
- Storage utilization(Size and inodes) csi-provisioner will have mounts of all Kadalu Storage Volumes(Hosting Volumes), so it is the best candidate to export all the utilization metrics(
kadalu_storage_count
,kadalu_storage_health
,kadalu_storage_capacity_bytes
,kadalu_storage_capacity_used_bytes
,kadalu_storage_capacity_free_bytes
,kadalu_storage_inodes_count
,kadalu_storage_inodes_used_count
,kadalu_storage_inodes_free_count
etc) - PV utilization Simple Quota Crawler is anyways doing the crawl and collecting the stat of each subvols/Pvs, This can export the metrics related to the PVs(
kadalu_pv_count
,kadalu_pv_capacity_bytes
,kadalu_pv_capacity_used_bytes
,kadalu_pv_capacity_free_bytes
,kadalu_pv_inodes_count
,kadalu_pv_inodes_used_count
,kadalu_pv_inodes_free_count
etc). - Memory, CPU, Uptime and other metrics Export these metrics for all the processes in Kadalu managed Containers.
- Heal related metrics CSI Provisioner or Operator can run
glfsheal
command and export the heal metrics.
Two options to export these metrics to Prometheus:
- Expose single
/metrics
endpoint from Operator. When Prometheus requests the metrics then internally collect from other pods usingkubectl exec
. - Expose metrics from respective containers itself. Prometheus Server(Prometheus Operator) can collect metrics from these individual exporters by using Pod/Container label.
Bonus: kubectl kadalu storage-health
CLI can be created by consuming these metrics. So that Admins can get the realtime status without using Prometheus or Grafana.
Let us know if any other metrics required other than the ones listed above?
from kadalu.
Thank you for your contributions.
We are marking this issue as stale because it has not had recent activity. It will be closed in 2 weeks if no further activity occurs.
from kadalu.
Still needed.
from kadalu.
Thank you for your contributions.
Noticed that this issue is idle since 180 days! There is a possibility that this issue is already fixed in later releases. Please upgrade and check! If I don't hear any update in this issue in next 2 weeks, will be closing the issue.
That doesn't mean one can't re-open the issue! Just comment on the issue, and click 'Reopen', if you still have the issue.
from kadalu.
still needed.
from kadalu.
like the storage-health command idea, not always convenient to bring up grafana for viewing health status
from kadalu.
Expose single /metrics endpoint from Operator. When Prometheus requests the metrics then internally collect from other pods using kubectl exec.
Little change to this approach. Each containers exposes ReST APIs and Operator will collect from all these containers and exposes a single API.
Operator(GET /metrics
) -> Collect metrics from all the containers in Kadalu namespace by calling respective ReST APIs(internal APIs).
Advantages:
- Easy to integrate with internal or externally hosted Prometheus server.
- Easily avoid collecting duplicate metrics. Operator can intelligently pick any one storage pod to get the metrics.
Additionally expose GET /metrics.json
for all non-Prometheus purposes.
This approach is very similar to Gluster Metrics Exporter project.
from kadalu.
@aravindavk can this issue be closed?
from kadalu.
ping @aravindavk ^^, can this be closed?
from kadalu.
Thank you for your contributions.
Noticed that this issue is idle since 180 days! There is a possibility that this issue is already fixed in later releases. Please upgrade and check! If I don't hear any update in this issue in next 2 weeks, will be closing the issue.
That doesn't mean one can't re-open the issue! Just comment on the issue, and click 'Reopen', if you still have the issue.
from kadalu.
from kadalu.
Related Issues (20)
- Node plugin is failing with NodeUnpublishVolume exception constanly HOT 4
- [Bug]: FUSE mount goes to Transport Endpoint not connected HOT 2
- [Bug]: nomad [nodeserver - 141:NodeUnpublishVolume] HOT 1
- [RFE]: Mounting provisioned volumes outside of cluster HOT 1
- [RFE]: Docker CSI Support HOT 2
- [Bug]: Link in about section of github gives a 404 HOT 2
- [RFE]: Expose pvc utilization metrics to prometheis HOT 4
- [RFE]: how to expand the KadaluStorage size? HOT 3
- [Bug]: healinfo is not working on 1.0.0 HOT 2
- [Bug]: Not able to connect to api-server 10.96.0.1 from kadalu operator HOT 1
- [Bug]: Volume creation stuck on Kadalu in Nomad with gluster HOT 4
- [RFE]: Arbiter Storage pool support
- [Bug]: Kadalu Storage miscalculated HOT 6
- [Bug]: kadalu-csi-nodeplugin is not able to authenticate gluster fs server HOT 4
- [Bug]: Suddenly stopped creating pvc after it started working. PVC stays in pending state! HOT 8
- [RFE]: Support Storage pool options HOT 3
- [Bug]: Slack space is closed/private HOT 4
- [RFE]: If devel version is installed, report the commit in output
- [Bug]: for busybox container in csiprovisioner Imagepull policy is not set and it's defaulting to Always
- kadalu support for IPv6 host HOT 10
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kadalu.