Expose MR Metrics #555

humoflife · 2023-09-21T00:55:05Z

What problem are you facing?

Crossplane users would benefit from the ability to offer service level objectives (SLOs) to their users. To do this they need MR related metrics.

How could Crossplane help solve your problem?

At the Crossplane pod port 8080/metrics interface, expose the following:

number of MRs total, by claim, by composition
number of MRs in Synched False State total, by claim, by composition
number of MRs in Synched True State total, by claim, by composition
number of MRs in Synched Unkown State total, by claim, by composition
number of MRs in Ready False State total, by claim, by composition
number of MRs in Ready True State total, by claim, by composition
number of MRs in Ready Unknown State total, by claim, by composition
number of MRs in Synched True and Ready True State total, by claim, by composition
number of MRs in Synched False and Ready True State total, by claim, by composition
number of MRs in Synched Unknown and Ready True State total, by claim, by composition
number of MRs in Synched True and Ready False State total, by claim, by composition
number of MRs in Synched False and Ready False State total, by claim, by composition
number of MRs in Synched Unknown and Ready False State total, by claim, by composition
for each MR, time to readiness (Synched True -> Ready True)
for each MR, time to non-readiness (Ready True -> Synched !True)
for each MR, number of Synched state changes
for each MR, number of Ready state changes
for each MR, time from deletion request to external resource and MR deletion
for each claim, time to readiness (from claim created to ready)
for each claim, time to delete all associated resources (from claim to external resources removed)
if possible, control plane availability and uptime (since last Crossplane pod restart).

The implementation of the metric gathering could be performed in the Crossplane runtime reconciler code, line 675+.
https://github.com/crossplane/crossplane-runtime/blob/master/pkg/reconciler/managed/reconciler.go#L675

An example time to readiness calculation is located in Uptest at https://github.com/upbound/uptest/blob/6e567ebd9ed30f1b1670d2cbbb679fde9beebc6b/cmd/perf/internal/managed/managed.go#L171

negz · 2024-02-28T18:50:33Z

Something to double-check for whoever works on this: do we need to add all of these metrics as individual time series? Let's make sure we focus on adding low-level, flexible time series that can be composed to produce useful higher level metrics.

blut · 2024-03-20T13:04:02Z

For anyone working with crossplane https://github.com/crossplane-contrib/x-metrics might provide useful metrics to aggregate the requested metrics.

pierluigilenoci · 2024-07-30T08:32:09Z

@ezgidemirel, from your experience and knowledge, how much hope is there to see this or something similar implemented in Crossplane? 🙏🏻

Thank you. ❤️

momoXD007 · 2024-07-30T09:21:46Z

From an observability point of view Crossplane so far is lacking a good strategy to monitor the MRs and claims in "kubernetes native way".
It is hard to track the overall health of an environment without understanding if just one single claim or MR is not synched or suddenly hundreds of them are not synched.

It is currently also hard to understand adoption of Crossplane in a bigger environment: if you can't pinpoint which compositions are used the most.

ezgidemirel · 2024-07-30T11:01:42Z

Hi @pierluigilenoci, as we discussed in the slack thread, I'm closing this issue as completed.

We have introduced MR metrics with #683 but decided not to add MR names, claim names or composite names to the exposed metrics as labels. The reason is, we don't want to create a metric for each managed resource created on the cluster. This will increase the cardinality dramatically. You can see more details in the comment here.

Further discussions about having more detailed metrics can be carried on within this issue.

humoflife added the enhancement New feature or request label Sep 21, 2023

jbw976 mentioned this issue Sep 21, 2023

High level metrics crossplane/crossplane#4620

Open

7 tasks

chlunde mentioned this issue Dec 12, 2023

feat: add managed resource metrics crossplane-contrib/provider-aws#1964

Closed

2 tasks

jbw976 added metrics observability labels Mar 4, 2024

jbw976 assigned ezgidemirel Mar 7, 2024

ezgidemirel mentioned this issue Apr 4, 2024

Introduce High Level MR metrics #683

Merged

2 tasks

Shwethamuralikrishnaa mentioned this issue Jul 29, 2024

Request to Add Name Label to Crossplane Managed Resource Metrics crossplane/crossplane#5850

Open

ezgidemirel closed this as completed Jul 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose MR Metrics #555

Expose MR Metrics #555

humoflife commented Sep 21, 2023 •

edited

Loading

negz commented Feb 28, 2024

blut commented Mar 20, 2024

pierluigilenoci commented Jul 30, 2024

momoXD007 commented Jul 30, 2024

ezgidemirel commented Jul 30, 2024

Expose MR Metrics #555

Expose MR Metrics #555

Comments

humoflife commented Sep 21, 2023 • edited Loading

What problem are you facing?

How could Crossplane help solve your problem?

negz commented Feb 28, 2024

blut commented Mar 20, 2024

pierluigilenoci commented Jul 30, 2024

momoXD007 commented Jul 30, 2024

ezgidemirel commented Jul 30, 2024

humoflife commented Sep 21, 2023 •

edited

Loading