How to use in a cluster #14

PhillippOhlandt · 2018-05-28T13:15:31Z

Hey,

how would one use this in a cluster of nodes? I don't use it to collect operating system or vm metrics but metrics about my application state. That means every node should give me (kinda) the same metrics.

Has someone done this yet? What is the easiest approach?

deadtrickster · 2018-05-28T14:05:07Z

Hi, could you please be more specific? The reason I asked is that there is no really difference whether it's a cluster or not. You custom metrics are just metrics, they have instance label attached, etc.

PhillippOhlandt · 2018-05-28T16:10:32Z

The lib itself works with ETS so the metrics state is not distributed to all members in a cluster. I want that the /metrics endpoint of all of my nodes return the same metrics so I just need to scrape one of my nodes and not all of them. I think I need a wrapper that does the replication of all metric changes across the cluster.

deadtrickster · 2018-05-28T16:23:14Z

but why?

PhillippOhlandt · 2018-05-28T16:57:14Z

Because I see my state as one unit and not as a unit per node. Each node will have a different part of the whole state and all entrypoints (/metrics, graphql api, etc.) should show the whole state and not just the one from the current node.

deadtrickster · 2018-05-28T17:01:12Z

but what you will do if network split, how you'll know the state (and the issues) of the particular node? plus prometheus does exactly this - aggregates.

PhillippOhlandt · 2018-05-28T17:05:30Z

but what you will do if network split, how you'll know the state

Valid question for everything distributed

plus prometheus does exactly this - aggregates.

yes it does, but I do not want to reconfigure prometheus every time I scale up. I also don't want to limit myself with having each node exposing a webserver.

deadtrickster · 2018-05-28T17:13:06Z

reconfigure prometheus every time I scale up

I think this is what service discovery for.

I understand that other concerns (incl not having http endpoint) are important to you, but in prometheus world even reachability is a metric itself (and a good source of alerts). If you don't want to have plugs/phoenix, you can look at zero-dependency exporter - https://github.com/deadtrickster/prometheus-httpd.

PhillippOhlandt · 2018-05-28T17:19:40Z

I actually use prometheus as my main time series long time storage and I don't trust service discovery that much. It also often requires yet another piece of software and from a quick google search, prometheus doesn't seem to support SSDP.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use in a cluster #14

How to use in a cluster #14

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

How to use in a cluster #14

How to use in a cluster #14

Comments

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018

deadtrickster commented May 28, 2018

PhillippOhlandt commented May 28, 2018