Skip to content

Conversation

@deads2k
Copy link
Contributor

@deads2k deads2k commented Sep 28, 2021

if job runs have timed out, we can ignore them for the purposes of aggregation. This only becomes a practical problem if the jobs start running very very long.

@openshift-ci openshift-ci bot requested review from droslean and smg247 September 28, 2021 19:47
@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Sep 28, 2021
func (o *JobRunAggregatorAnalyzerOptions) Run(ctx context.Context) error {
// if it hasn't been more than hour since the jobRuns started, the list isn't complete.
readyAt := o.jobRunStartEstimate.Add(1 * time.Hour)
timeToStopWaiting := o.jobRunStartEstimate.Add(3*time.Hour + 10*time.Minute)
Copy link
Contributor

@dgoodwin dgoodwin Sep 28, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Eyeballing sippy and prow: I see some 4.10 Azure ovn upgrade jobs passing after 3:20 pretty consistently, so might need a little more time here.
AWS around this range, 2:45 -> 3:05,
GCP a little faster around 2:35 often.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Eyeballing sippy and prow: I see some 4.10 Azure ovn upgrade jobs passing after 3:20 pretty consistently, so might

there is no more time. this is pressing up against the max ci-operator time

Copy link
Member

@wking wking Sep 28, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In openshift/release#22289, I'm talking about this a bit, and I think we might need to talk the test-platform folks into raising the Plank/Prow timeout above its current 4h, or giving us a way to do that for particular generated jobs (there's already a way to raise it for non-generated jobs via decoration_config.timeout).

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And IIRC we're not planning on aggregating ovn-azure anyhow?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And IIRC we're not planning on aggregating ovn-azure anyhow?

we will

@deads2k
Copy link
Contributor Author

deads2k commented Sep 29, 2021

/retest

@deads2k
Copy link
Contributor Author

deads2k commented Sep 29, 2021

/retest

Copy link
Contributor

@dgoodwin dgoodwin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like we've got something to search on to see how much we're hitting this.

/lgtm

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Sep 29, 2021
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Sep 29, 2021

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, dgoodwin

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-merge-robot openshift-merge-robot merged commit 07bbd6f into openshift:master Sep 29, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants