You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We start having some performance issues with metrics rendering but more important with making analysis on top of already available metrics.
Solution
To make the performance better for real-time metrics we have to think about time-series databases. Having Prometheus as the main data source for metrics can be useful as we make integration with Grafana available by default.
Proof of concept
Use pushgateway to send metrics from every agent
Integration with Prometheus API to make queries
The text was updated successfully, but these errors were encountered:
Since we have a lot of dynamic agents that can be provisioned for Maestro it makes sense to allow them automatically push data to Prometheus instead of scrapping metrics by Prometheus.
Another thing it's security. Agents by default don't expose any port outside. They work just on Pull basics and get regularly updated by making requests to API.
Saying that I think the best way of using Prometheus as a time-serious database we need to use Prometheus pushgateway to allow Maestro agents to send metrics directly there. This is a good way to keep metrics aggregation and sending as agent responsibility based on runner type.
The main downside is Prometheus pushgateway is going to be a single point of failure and the main performance bottleneck. In the future, we probably can have a way to use more than one gateway to scale things up but I don't see this is as a problem for the first versions.
Here is also a quick diagram of how the things will look like:
Description
We start having some performance issues with metrics rendering but more important with making analysis on top of already available metrics.
Solution
To make the performance better for real-time metrics we have to think about time-series databases. Having Prometheus as the main data source for metrics can be useful as we make integration with Grafana available by default.
Proof of concept
pushgateway
to send metrics from every agentThe text was updated successfully, but these errors were encountered: