Skip to content

Conversation

@sallyom
Copy link
Contributor

@sallyom sallyom commented Oct 25, 2019

It will be convenient to have a shared cluster (via a bot in channel that will pin a kubeconfig/kubeadmin-pw to that channel?)

For now, I've been manually creating/pinning a kubeconfig for groub-b to share, and I know console team does the same.

/cc @smarterclayton

@openshift-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: sallyom
To complete the pull request process, please assign stevekuznetsov
You can assign the PR to them by writing /assign @stevekuznetsov in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@openshift-ci-robot openshift-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Oct 25, 2019
@smarterclayton
Copy link
Contributor

@bparees owns all the long running clusters, you'll need to work with him on this.

@smarterclayton
Copy link
Contributor

This won't work as is because there is a fixed time limit for prow jobs, and if the job is evicted the cluster is leaked. It also absorbs quota.

If we want to have a single cluster up for a day that everyone shares, that was supposed to be the long running clusters.

@bparees
Copy link
Contributor

bparees commented Oct 29, 2019

@sallyom is this tied to https://jira.coreos.com/browse/DPP-1338 ?

@bparees
Copy link
Contributor

bparees commented Oct 29, 2019

@sallyom see also #3887

@sallyom
Copy link
Contributor Author

sallyom commented Oct 30, 2019

@bparees thanks, I'll follow up w/ you on this! I'll close this for now.

@sallyom sallyom closed this Oct 30, 2019
@soltysh
Copy link

soltysh commented Oct 31, 2019

@bparees it's a combination of both, I guess. We have a cluster created by Sally every other day that is shared between all group b members. The idea is to limit the time needed to wait for cluster where you need to check small things, like configs, running a simple app, verify oc, etc.

@bparees
Copy link
Contributor

bparees commented Oct 31, 2019

@soltysh i know @chancez has a cluster like that which he maintains.. you might see if he's willing to let you use it also if you're doing non-destructive things. Alternatively you can get your own set of openshift-llc creds and run your own long-lived cluster like he's doing.

@soltysh
Copy link

soltysh commented Oct 31, 2019

@bparees truth to be told, I don't mind the fact that this cluster has to be renewed every 2 days, actually that's pretty good b/c we have reasonably fresh cluster, so I don't think we care about LLC here. Although I'd like to have automation around it so that @sallyom (who's graciously doing this for group-b) can be let go of remembering about it.

@sallyom
Copy link
Contributor Author

sallyom commented Oct 31, 2019

Yes @bparees we would like a slack bot to create a new cluster every day or 2, and pin the kubeconfig to our channel/remove the old pin. right now that bot is me. This PR introduces a new job similar to cluster-bot but instead of a few hrs, the cluster remains for a day. My idea was to add this job, then configure a new slack bot to launch it, 1 per channel or something.

@bparees
Copy link
Contributor

bparees commented Oct 31, 2019

maybe @smarterclayton would be willing to let cluster-bot launch long-lived clusters and you could just use a slack reminder to trigger the bot?

@smarterclayton
Copy link
Contributor

smarterclayton commented Oct 31, 2019 via email

@chancez
Copy link
Contributor

chancez commented Oct 31, 2019 via email

@soltysh
Copy link

soltysh commented Nov 5, 2019

It doesn't have to be long lived one, but I like @smarterclayton's idea of 8h one, although maybe extended to 12h, that should cover 2 TZ which is my entire team, at least.

@smarterclayton
Copy link
Contributor

smarterclayton commented Nov 7, 2019 via email

@bparees
Copy link
Contributor

bparees commented Nov 7, 2019

I still want Ben's infra to be hooked in so we start growing that (we have
to have long lived clusters).

I was about to switch gears to baremetal clusters, but i just got the LLC creds so i'm going to try to get the AWS longlived cluster jobs going again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size/M Denotes a PR that changes 30-99 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants