Skip to content

Conversation

@eggfoobar
Copy link
Contributor

This is to avoid errors in timing between file creation and when the system comes up, since the prune logic skips touching the directory if backup exists, this tests could fail sometimes if ran out of order.

Moving to one test to avoid issue.

/assign @pmtk

This is to avoid errors in timing between file creation and when the
system comes up, since the prune logic skips touching the directory if
backup exists, this tests could fail sometimes if ran out of order.

Moving to one test to avoid issue.

Signed-off-by: ehila <ehila@redhat.com>
@openshift-ci-robot openshift-ci-robot added the jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. label Jul 26, 2023
@openshift-ci-robot
Copy link

openshift-ci-robot commented Jul 26, 2023

@eggfoobar: This pull request references USHIFT-1445 which is a valid jira issue.

Details

In response to this:

This is to avoid errors in timing between file creation and when the system comes up, since the prune logic skips touching the directory if backup exists, this tests could fail sometimes if ran out of order.

Moving to one test to avoid issue.

/assign @pmtk

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@openshift-ci openshift-ci bot requested review from copejon and pliurh July 26, 2023 12:30
@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 26, 2023
Signed-off-by: ehila <ehila@redhat.com>
reverted to having two tests
changed file name creation for unknown files to remove underscore

Signed-off-by: ehila <ehila@redhat.com>
@eggfoobar
Copy link
Contributor Author

@dhellmann and @pmtk I updated the test but I also ran across why things were failing, the issue was that we prune all directories with an underscore, is that desired or should we post fix something to our backups?

@eggfoobar eggfoobar changed the title USHIFT-1445: fix: collapse tests to one USHIFT-1445: fix: prune test Jul 26, 2023
@dhellmann
Copy link
Contributor

@dhellmann and @pmtk I updated the test but I also ran across why things were failing, the issue was that we prune all directories with an underscore, is that desired or should we post fix something to our backups?

Is there any reason for us to tolerate anything in that directory that we don't create?

@dhellmann
Copy link
Contributor

/lgtm

Let's get CI working and come back to the question about how much we should retain or how aggressively we should delete things.

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 26, 2023
@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 26, 2023

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: dhellmann, eggfoobar

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:
  • OWNERS [dhellmann,eggfoobar]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@eggfoobar
Copy link
Contributor Author

@dhellmann and @pmtk I updated the test but I also ran across why things were failing, the issue was that we prune all directories with an underscore, is that desired or should we post fix something to our backups?

Is there any reason for us to tolerate anything in that directory that we don't create?

Currently, we place a failed log there that we push up to greenboot, which does have the name prerun_failed.log, that won't be deleted before it's presented to greenboot, since it's a result of a backup failure and subsequent failed restarts mean that that file won't be removed, but just something to keep in mind for other future instances where we store files there.

@dhellmann
Copy link
Contributor

@dhellmann and @pmtk I updated the test but I also ran across why things were failing, the issue was that we prune all directories with an underscore, is that desired or should we post fix something to our backups?

Is there any reason for us to tolerate anything in that directory that we don't create?

Currently, we place a failed log there that we push up to greenboot, which does have the name prerun_failed.log, that won't be deleted before it's presented to greenboot, since it's a result of a backup failure and subsequent failed restarts mean that that file won't be removed, but just something to keep in mind for other future instances where we store files there.

We put that in the backup directory? Could it live in /usr/lib/microshift instead?

@openshift-ci-robot
Copy link

/retest-required

Remaining retests: 0 against base HEAD c3a8c05 and 2 for PR HEAD 957dc76 in total

@pmtk
Copy link
Member

pmtk commented Jul 26, 2023

@dhellmann and @pmtk I updated the test but I also ran across why things were failing, the issue was that we prune all directories with an underscore, is that desired or should we post fix something to our backups?

Seems like I've skipped few thought steps lgtming previous PR 😆
This is actually intentional, how do you differentiate between known and unknown? We take all backups matching %s_%s and remove them if deployment id is no longer on the system.

Currently, we place a failed log there that we push up to greenboot, which does have the name prerun_failed.log, that won't be deleted before it's presented to greenboot, since it's a result of a backup failure and subsequent failed restarts mean that that file won't be removed, but just something to keep in mind for other future instances where we store files there.

Does prerun_failed.log gets deleted when pruning the backups? dataManager's GetBackupList() should only return directories.
I wouldn't rely on "backup pruning" to remove that file (even sounds like a side effect, if that works that is, but it shouldn't)

@eggfoobar
Copy link
Contributor Author

Does prerun_failed.log gets deleted when pruning the backups? dataManager's GetBackupList() should only return directories. I wouldn't rely on "backup pruning" to remove that file (even sounds like a side effect, if that works that is, but it shouldn't)

Nah, by default on startup that file gets deleted and only get's written when a failure happens, so pruning never gets a chance to touch it. On failures, greenboot will read that file during healthchecks

@openshift-ci
Copy link
Contributor

openshift-ci bot commented Jul 26, 2023

@eggfoobar: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/microshift-e2e-arm 957dc76 link false /test microshift-e2e-arm

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

@openshift-merge-robot openshift-merge-robot merged commit 23a7895 into openshift:main Jul 26, 2023
@eggfoobar eggfoobar deleted the collapse-prune-test-to-one branch August 11, 2023 12:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type. lgtm Indicates that a PR is ready to be merged.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants