Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prometheus monitoring #37

Open
kingdonb opened this issue Jun 27, 2023 · 0 comments
Open

Prometheus monitoring #37

kingdonb opened this issue Jun 27, 2023 · 0 comments

Comments

@kingdonb
Copy link
Owner

Whether it's done through kube-state-metrics support for monitoring Custom Resources, or if we actually go to the trouble of emitting Prometheus metrics, we should have some AlertManager controls around "what if things go wrong"

I think the web server should emit the health metrics and it should be passively monitoring the database

However that conflicts with the serverless design, which says that nothing passively monitors the database on a continuous basis (how do you expect to get wall clock usage down if we're doing continuous monitoring? no, we load test at rollout time, do canary analysis, and then scale down until an event requires scaling back up...)

Anyway, in the context of all of that, we need an alert that will tell us when Production is not staying fresh.

It can be that Production is kept fresh by the GHA workflows from #22 – the alert should not be firing simply because we don't see a cronjob that has run recently.

We can dial back the KEDA frequency to match what we've told GitHub once we have KEDA, and monitor kube-state-metrics without any resident process required, just something to reconcile once before the health checking times out.

Not sure how to handle this yet. Punting to a later release, as I'm out of release tokens for today.

kingdonb pushed a commit to kingdonb/bootstrap-repo that referenced this issue Jun 27, 2023
It isn't necessary to run this because we have had this solved since:

* kingdonb/stats-tracker-ghcr#22

kingdonb/stats-tracker-ghcr#22
https://github.com/kingdonb/stats-tracker-ghcr/actions/workflows/execute.yaml

It is noteworthy that we don't have monitoring, I opened
kingdonb/stats-tracker-ghcr#37 to address this later

Signed-off-by: Kingdon Barrett <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant