add class ClusterReplicationCollector #166

themoriarti · 2023-09-13T19:55:05Z

This class should collect the status of replication that is performed for each lxc/vm for all servers in the cluster.

znerol · 2023-10-01T11:20:24Z

Thanks for taking the time to file a PR.

Unfortunately scraping /nodes has turned out to be inherently inefficient (see #55 and #58). Especially for big and growing deployments, this can get nasty quite quickly.

This PR is using the exact same known-to-be-faulty mechanism to collect the desired data. To make matters worse, the problematic loop would be running twice after that PR landed and as a result the time to collect all metrics will double for many users.

See the comments in #115 for alternative ideas on how to scrape config efficiently.

Also note that we might scrape /nodes in a different manner after #164 landed.

themoriarti · 2023-10-02T17:21:12Z

Thanks for taking the time to file a PR.

Unfortunately scraping /nodes has turned out to be inherently inefficient (see #55 and #58). Especially for big and growing deployments, this can get nasty quite quickly.

This PR is using the exact same known-to-be-faulty mechanism to collect the desired data. To make matters worse, the problematic loop would be running twice after that PR landed and as a result the time to collect all metrics will double for many users.

See the comments in #115 for alternative ideas on how to scrape config efficiently.

Also note that we might scrape /nodes in a different manner after #164 landed.

Yes, it really simplifies the process and speeds it up, and also makes it possible to now install the export on each of proxmox servers of the cluster and collect metrics from each server separately, but some metrics of the cluster will be duplicated. I have made changes to the process of collecting replication tasks status.

znerol · 2023-10-09T15:53:07Z

but some metrics of the cluster will be duplicated. I have made changes to the process of collecting replication tasks status.

Not sure what you mean by duplicated metrics. If you have specific feedback on the refactoring, then please comment over there: #164

znerol · 2023-11-05T19:56:02Z

I released 3.0.0 and also merged #198. This PR now needs a little refactoring for the new file layout.

Add replication metrics as requested in issue #112. * Replication Metrics are fetched per node * The metrics can be enabled or disabled Based on the original PR #166 adapted the new file structure. --------- Signed-off-by: Sven Gerber <[email protected]> Co-authored-by: znerol <[email protected]> Co-authored-by: Marian Koreniuk <[email protected]>

znerol · 2024-04-27T09:54:44Z

Closing in favor of #243 (also credited @themoriarti over there for the original work).

themoriarti added 4 commits September 13, 2023 22:39

add class ClusterReplicationCollector

7752c03

This class should collect the status of replication that is performed for each lxc/vm for all servers in the cluster.

get replica status

4203a44

fix got an unexpected keyword argument 'replication'

071ffd9

fix fail_count GaugeMetricFamily

531a86d

add fix scraping /nodes

4c437dd

Remove node['node'], add replication to readme

cea30af

svengerber mentioned this pull request Apr 18, 2024

Add ZFS replication metrics #243

Merged

znerol closed this Apr 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add class ClusterReplicationCollector #166

add class ClusterReplicationCollector #166

themoriarti commented Sep 13, 2023

znerol commented Oct 1, 2023

themoriarti commented Oct 2, 2023

znerol commented Oct 9, 2023

znerol commented Nov 5, 2023

znerol commented Apr 27, 2024

add class ClusterReplicationCollector #166

add class ClusterReplicationCollector #166

Conversation

themoriarti commented Sep 13, 2023

znerol commented Oct 1, 2023

themoriarti commented Oct 2, 2023

znerol commented Oct 9, 2023

znerol commented Nov 5, 2023

znerol commented Apr 27, 2024