stability: git event watch and visualization #272

deads2k · 2020-04-01T19:43:42Z

This helps solve problems with flakes in CI tests that can be due to operator problems and re-uses existing and valuable visualization for the run.

openshift-ci-robot · 2020-04-01T19:44:08Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [deads2k]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

stevekuznetsov · 2020-04-09T18:56:21Z

enhancements/kube-apiserver/stability-history-of-resources.md

+
+When debugging a failed e2e test (or a string of them), one common question is, "what is the status of clusteroperator/foo
+when this particular test was running".
+While we could consider one-off solutions to this, we have a solution for storing this information inside of a local


Could you link to the tools metioned here? This document as written is so vague as to not be understandable :|

Could you link to the tools metioned here? This document as written is so vague as to not be understandable :|

It captured the idea for @damemi who I think has found the tool and has a PR to add it.

@stevekuznetsov the tool being referred to is https://github.com/mfojtik/ci-monitor-operator and we are working on adding it in openshift/origin#24845

This has inspired a longer-term goal for me to add distributed tracing throughout our components

lilic · 2020-05-19T16:13:25Z

enhancements/kube-apiserver/stability-history-of-resources.md

+
+### Goals
+
+1. Know the state of clusteroperators, events, and pod at any given time.


Know the state of clusteroperators, events, and pod at any given time

What kind of state do you need to know, curious what metrics are missing that should be sent from CI clusters, as we plan on adding an ability to search through CI cluster metrics at some point in the near future. I am curious if that would be useful to connect the different traces, the metrics we send and this what you are proposing?

lilic · 2020-05-19T16:15:36Z

enhancements/kube-apiserver/stability-history-of-resources.md

+
+## Proposal
+
+1. Install Michal's tool in every cluster


Currently we plan on enabling Prometheus remote write for CI clusters to send some metrics and alerts in pending state where they can be queried in a timeline. Would love to get your feedback on which metrics should be included in the first batch to send out, thanks!

https://docs.google.com/document/d/1_ILVUYNBC07EHaIlqel9EL1UCWLQlKlMJtTz2Xq9Tmo/edit

wking · 2020-05-29T04:03:48Z

enhancements/kube-apiserver/stability-history-of-resources.md

+approvers:
+creation-date: yyyy-mm-dd
+last-updated: yyyy-mm-dd
+status: provisional|implementable|implemented|deferred|rejected|withdrawn|replaced


you need to pick values for these headers right? Or should we just drop them from the template?

openshift-bot · 2020-10-15T06:36:51Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

openshift-bot · 2020-11-14T08:25:21Z

Stale issues rot after 30d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-bot · 2020-12-14T11:34:55Z

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

openshift-ci-robot · 2020-12-14T11:35:15Z

@openshift-bot: Closed this PR.

Details

In response to this:

Rotten issues close after 30d of inactivity.

Reopen the issue by commenting /reopen.
Mark the issue as fresh by commenting /remove-lifecycle rotten.
Exclude this issue from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

stability: git event watch and visualization

9eabbfc

openshift-ci-robot requested review from kbsingh and timlnx April 1, 2020 19:44

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Apr 1, 2020

stevekuznetsov reviewed Apr 9, 2020

View reviewed changes

lilic reviewed May 19, 2020

View reviewed changes

damemi mentioned this pull request May 19, 2020

Add run-resourcewatch to gcp-upgrade test for git monitoring openshift/release#9198

Closed

wking reviewed May 29, 2020

View reviewed changes

openshift-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Oct 15, 2020

openshift-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Nov 14, 2020

openshift-ci-robot closed this Dec 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

stability: git event watch and visualization #272

stability: git event watch and visualization #272

Uh oh!

deads2k commented Apr 1, 2020

Uh oh!

openshift-ci-robot commented Apr 1, 2020

Uh oh!

stevekuznetsov Apr 9, 2020

Uh oh!

deads2k Apr 9, 2020

Uh oh!

damemi Apr 9, 2020 •

edited

Loading

Uh oh!

lilic May 19, 2020 •

edited

Loading

Uh oh!

lilic May 19, 2020 •

edited

Loading

Uh oh!

wking May 29, 2020

Uh oh!

openshift-bot commented Oct 15, 2020

Uh oh!

openshift-bot commented Nov 14, 2020

Uh oh!

openshift-bot commented Dec 14, 2020

Uh oh!

openshift-ci-robot commented Dec 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants


		### Goals

		1. Know the state of clusteroperators, events, and pod at any given time.

stability: git event watch and visualization #272

stability: git event watch and visualization #272

Uh oh!

Conversation

deads2k commented Apr 1, 2020

Uh oh!

openshift-ci-robot commented Apr 1, 2020

Uh oh!

stevekuznetsov Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

deads2k Apr 9, 2020

Choose a reason for hiding this comment

Uh oh!

damemi Apr 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lilic May 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lilic May 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wking May 29, 2020

Choose a reason for hiding this comment

Uh oh!

openshift-bot commented Oct 15, 2020

Uh oh!

openshift-bot commented Nov 14, 2020

Uh oh!

openshift-bot commented Dec 14, 2020

Uh oh!

openshift-ci-robot commented Dec 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

damemi Apr 9, 2020 •

edited

Loading

lilic May 19, 2020 •

edited

Loading

lilic May 19, 2020 •

edited

Loading