Prometheus monitoring #37

kingdonb · 2023-06-27T20:51:14Z

Whether it's done through kube-state-metrics support for monitoring Custom Resources, or if we actually go to the trouble of emitting Prometheus metrics, we should have some AlertManager controls around "what if things go wrong"

I think the web server should emit the health metrics and it should be passively monitoring the database

However that conflicts with the serverless design, which says that nothing passively monitors the database on a continuous basis (how do you expect to get wall clock usage down if we're doing continuous monitoring? no, we load test at rollout time, do canary analysis, and then scale down until an event requires scaling back up...)

Anyway, in the context of all of that, we need an alert that will tell us when Production is not staying fresh.

It can be that Production is kept fresh by the GHA workflows from #22 – the alert should not be firing simply because we don't see a cronjob that has run recently.

We can dial back the KEDA frequency to match what we've told GitHub once we have KEDA, and monitor kube-state-metrics without any resident process required, just something to reconcile once before the health checking times out.

Not sure how to handle this yet. Punting to a later release, as I'm out of release tokens for today.

It isn't necessary to run this because we have had this solved since: * kingdonb/stats-tracker-ghcr#22 kingdonb/stats-tracker-ghcr#22 https://github.com/kingdonb/stats-tracker-ghcr/actions/workflows/execute.yaml It is noteworthy that we don't have monitoring, I opened kingdonb/stats-tracker-ghcr#37 to address this later Signed-off-by: Kingdon Barrett <[email protected]>

kingdonb mentioned this issue Jul 2, 2023

Coordination for 0.2.0 Release #42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prometheus monitoring #37

Prometheus monitoring #37

kingdonb commented Jun 27, 2023

Prometheus monitoring #37

Prometheus monitoring #37

Comments

kingdonb commented Jun 27, 2023