doc(bundles): propose strategy for handling big bundles by njhale · Pull Request #40 · operator-framework/enhancements

njhale · 2020-08-13T18:44:28Z

Address operator-framework/operator-lifecycle-manager#1523

To ensure Operator Bundles of arbitrary size can be unpacked and used in a Kubernetes cluster by OLM, we can ...

ecordell · 2020-09-01T21:25:31Z

+
+#### Partitioning and Storage
+
+Instead of extracting bundle content directly to a `ConfigMap` -- as outlined in the [previous section](#podspec-pull-requirement) -- the unpack `Job` will first encode it as JSON and compress the result. The compressed JSON will then be split into chunks of a [fixed size](#determining-chunk-size), each wrapped with a header to keep track of its position in the chunk sequence. Each chunk is then encoded as JSON and stored in a respective `ConfigMap`. To each of these `ConfigMaps` a `porcelain.operators.coreos.com/bundle: <bundle-name>` label is applied to faciliate later retrieval. A `porcelain.operators.coreos.com/signature: <signature>` annotation is also generated for each chunk and is used to validate its integrity. When the content for a particular bundle is then retrieved, the `ConfigMaps` with the matching bundle name label are collected, sorted by the position tracked in each header, reconstituted into a single artifact, and decompressed before being returned for a `/content` request. If the signature verification fails for any chunk, a status condition indicating will be added to the associated bundle which will trigger a new unpack `Job` to run in a bid to rectify the issue.


my guess is that the majority of individual manifests (I have no data, just guessing) will fit within a single configmap. could we reserve the chunking for manifests that don't fit?

Then unpacking a large bundle could look like:

ConfigMap1

service

serviceaccount

rbac

ConfigMap2

csv

CRD A

ConfigMap3

CRD B Part 1

ConfigMap4

CRD B Part 2

This seems like it will increase the debuggability of bundle unpacking for the common case

If the majority of resource will fit into a single ConfigMap, wouldn't we get something fairly reasonable anyway (with better space utilization)?

e.g.

ConfigMap1:

service

serviceaccount

rbac

CRD A

CSV part 1

ConfigMap2:

CSV part 2
...

That looks good to me - I'm not sure I'm seeing the distinction as writ though.

will the configmap look like

data: {service}{serviceaccount}{rbac}{CRDA}{CSVpart1} --- data: {CSVPart2}

or like:

data: - {service} - {serviceaccount} - {rbac} - {CRDA} - {CSVpart1} --- data: - {CSVPart2}

Ah, I see now. I had envisioned the former, but the latter may be more clear. I do think this will be opaque to most end users, but I see your point.

jmccormick2001 · 2020-09-02T15:33:18Z

https://etcd.io/docs/v3.4.0/dev-guide/limit/

that shows a request limit of 1.5Mb with a --max-request-bytes flag to allow for a larger request limit...was wondering about your thoughts of just leveraging this etcd feature or if that is not possible?

njhale · 2020-09-03T02:45:33Z

https://etcd.io/docs/v3.4.0/dev-guide/limit/

that shows a request limit of 1.5Mb with a --max-request-bytes flag to allow for a larger request limit...was wondering about your thoughts of just leveraging this etcd feature or if that is not possible?

@jmccormick2001 that's something we've considered in earlier discussions.

AFAIK, changing the storage limit for a cluster's etcd is a highly privileged (and possibly disruptive?) task -- further, we would need to bump this value whenever we unpackage a bundle larger than the current limit. So, while I do think that successively large bundles will be a rarity, I would prefer a solution that would avoid perturbing etcd directly.

ecordell · 2020-10-06T11:35:07Z

+
+#### Determining Chunk Size
+
+Chunk-size will be determined by an option on the `catalog-operator` command, which if unset, directs the operator to probe the cluster for the resource storage limit. One potential probing strategy could be to create ConfigMaps of various sizes; akin to a "binary search" for the limit.


I would leave the size probing as future work - there doesn't seem to be an immediate need, and it may be worth pursuing more direct options (i.e. should this value be exposed as a property of the cluster than anyone can query from kube-apisever?)

Remove descriptions from unstored versions of CRDs to keep the bundle size < 1MiB. Until operator-framework/enhancements#40 got implemented, a bundle should not exceed 1MiB because its internally (in the opm tool) managed with a configmap. Signed-off-by: Simone Tiraboschi <stirabos@redhat.com>

zcahana · 2021-06-17T08:14:09Z

+    message: "no manifests detected in bundle"
+```
+
+Most importantly, a `Bundle` will expose a `/content` subresource which -- upon a successful unpackaging -- will return a [JSON encoded representation]() of that bundle.


Would it make sense to have the /content subresource accept a filename parameter and return only that specific manifest? If I understand correctly, the catalog-operator consume manifests step by step, so this can avoid some wasteful traffic between the catalog-operator and the aggregated api server serving that subresource.

njhale · 2021-08-12T03:20:56Z

I think @zcahana's Gzip PR scratches the immediate itch (thanks!).

I'm going to close this out in favor of work in rukpak.

Remove descriptions from unstored versions of CRDs to keep the bundle size < 1MiB. Until operator-framework/enhancements#40 got implemented, a bundle should not exceed 1MiB because its internally (in the opm tool) managed with a configmap. Signed-off-by: Simone Tiraboschi <stirabos@redhat.com>

openshift-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Aug 13, 2020

njhale force-pushed the big-bundles branch 2 times, most recently from 3ae16bc to 7d938f7 Compare September 1, 2020 20:18

njhale marked this pull request as ready for review September 1, 2020 20:20

njhale changed the title ~~WIP doc(bundles): propose strategy for handling big bundles~~ doc(bundles): propose strategy for handling big bundles Sep 1, 2020

openshift-ci-robot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Sep 1, 2020

doc(bundles): propose strategy for handling big bundles

92ae301

njhale force-pushed the big-bundles branch from 7d938f7 to 92ae301 Compare September 1, 2020 21:01

ecordell reviewed Sep 1, 2020

View reviewed changes

exdx reviewed Sep 3, 2020

View reviewed changes

Comment thread enhancements/big-bundle-unpacking.md

exdx reviewed Sep 3, 2020

View reviewed changes

Comment thread enhancements/big-bundle-unpacking.md

exdx reviewed Sep 3, 2020

View reviewed changes

Comment thread enhancements/big-bundle-unpacking.md

exdx reviewed Sep 3, 2020

View reviewed changes

Comment thread enhancements/big-bundle-unpacking.md

ecordell reviewed Oct 6, 2020

View reviewed changes

tiraboschi mentioned this pull request Jun 14, 2021

Remove descriptions from unstored versions of CRDs kubevirt/hyperconverged-cluster-operator#1396

Merged

11 tasks

zcahana reviewed Jun 17, 2021

View reviewed changes

wallrj mentioned this pull request Jun 17, 2021

upstream-community-operators [N] [CI] cert-manager (1.4.0) operator-framework/community-operators#4103

Merged

27 tasks

zcahana mentioned this pull request Jun 21, 2021

Add an option to store Gzip-encoded bundle data in ConfigMaps operator-framework/operator-registry#685

Merged

5 tasks

njhale closed this Aug 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc(bundles): propose strategy for handling big bundles#40

doc(bundles): propose strategy for handling big bundles#40
njhale wants to merge 1 commit into
operator-framework:masterfrom
njhale:big-bundles

njhale commented Aug 13, 2020

Uh oh!

ecordell Sep 1, 2020

Uh oh!

njhale Sep 2, 2020 •

edited

Loading

Uh oh!

ecordell Sep 3, 2020 •

edited

Loading

Uh oh!

njhale Sep 3, 2020

Uh oh!

jmccormick2001 commented Sep 2, 2020

Uh oh!

njhale commented Sep 3, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ecordell Oct 6, 2020

Uh oh!

zcahana Jun 17, 2021

Uh oh!

njhale commented Aug 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants


		#### Partitioning and Storage

		Instead of extracting bundle content directly to a `ConfigMap` -- as outlined in the [previous section](#podspec-pull-requirement) -- the unpack `Job` will first encode it as JSON and compress the result. The compressed JSON will then be split into chunks of a [fixed size](#determining-chunk-size), each wrapped with a header to keep track of its position in the chunk sequence. Each chunk is then encoded as JSON and stored in a respective `ConfigMap`. To each of these `ConfigMaps` a `porcelain.operators.coreos.com/bundle: <bundle-name>` label is applied to faciliate later retrieval. A `porcelain.operators.coreos.com/signature: <signature>` annotation is also generated for each chunk and is used to validate its integrity. When the content for a particular bundle is then retrieved, the `ConfigMaps` with the matching bundle name label are collected, sorted by the position tracked in each header, reconstituted into a single artifact, and decompressed before being returned for a `/content` request. If the signature verification fails for any chunk, a status condition indicating will be added to the associated bundle which will trigger a new unpack `Job` to run in a bid to rectify the issue.


		#### Determining Chunk Size

		Chunk-size will be determined by an option on the `catalog-operator` command, which if unset, directs the operator to probe the cluster for the resource storage limit. One potential probing strategy could be to create ConfigMaps of various sizes; akin to a "binary search" for the limit.

Conversation

njhale commented Aug 13, 2020

Uh oh!

ecordell Sep 1, 2020

Choose a reason for hiding this comment

Uh oh!

njhale Sep 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ecordell Sep 3, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

njhale Sep 3, 2020

Choose a reason for hiding this comment

Uh oh!

jmccormick2001 commented Sep 2, 2020

Uh oh!

njhale commented Sep 3, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ecordell Oct 6, 2020

Choose a reason for hiding this comment

Uh oh!

zcahana Jun 17, 2021

Choose a reason for hiding this comment

Uh oh!

njhale commented Aug 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

njhale Sep 2, 2020 •

edited

Loading

ecordell Sep 3, 2020 •

edited

Loading