config: minor cleanup #608

vbatts · 2017-03-13T14:05:26Z

does that empty bracket work?

#608 (comment)

stevvooe · 2017-03-14T00:11:00Z

Is this actually a must? I don't think we need to require that implementation require tar-split or that layers be packed reproducibly. Neither are necessary for a working container runtime.

This is more of an implementation note, maybe.

I suppose (?) as written it's not supposed to be an imperative must (i.e. "to avoid changing the headers, it is necessary for layers to be packed reproducibly" rather than "implementers must pack reproducibly") but I think we should probably standardise on RFC2119 language to be clearer. #611

-Original file line number
+Diff line change
@@ Expand Up @@
     ### Layer DiffID
     A layer DiffID is the digest over the layer's uncompressed tar archive and serialized in the descriptor digest format, e.g., `sha256:a9561eb1b190625c9adb5a9513e72c4dedafc1cb2d4c5236c9a6957ec7dfd5a9`.
-    Layers must be packed and unpacked reproducibly to avoid changing the layer DiffID, for example by using tar-split to save the tar headers.
+    Layers must be packed and unpacked reproducibly to avoid changing the layer DiffID, for example by using [tar-split][] to save the tar headers.
     NOTE: Do not confuse DiffIDs with [layer digests](manifest.md#image-manifest-property-descriptions), often referenced in the manifest, which are digests over compressed or uncompressed content.
@@ Expand All @@
     For example, given base layer `A` and a changeset `B`, we refer to the result of applying `B` to `A` as `A|B`.
     Above, we define the `ChainID` for a single layer (`L₀`) as equivalent to the `DiffID` for that layer.
-    Otherwise, the `ChainID` for `L₀|...|Lₙ₋₁|Lₙ` is defined as recursion `Digest(ChainID(L₀|...|Lₙ₋₁) + " " + DiffID(Lₙ))`.
+    Otherwise, the `ChainID` for a set of applied layers (`L₀|...|Lₙ₋₁|Lₙ`) is defined as the recursion `Digest(ChainID(L₀|...|Lₙ₋₁) + " " + DiffID(Lₙ))`.
     #### Explanation
     Let's say we have layers A, B, C, ordered from bottom to top, where A is the base and C is the top.
     Defining `|` as a binary application operator, the root filesystem may be `A|B|C`.
     While it is implied that `C` is only useful when applied to `A|B`, the identifier `C` is insufficient to identify this result, as we'd have the equality `C = A|B|C`, which isn't true.
-    The main issue is when we have two definitions of `C`, `C = C` and `C = A|B|C`. If this is true (with some handwaving), `C = x|C` where `x = any application` must be true.
+    The main issue is when we have two definitions of `C`, `C = C` and `C = A|B|C`.
+    If this is true (with some handwaving), `C = x|C` where `x = any application` must be true.
     This means that if an attacker can define `x`, relying on `C` provides no guarantee that the layers were applied in any order.
     The `ChainID` addresses this problem by being defined as a compound hash.
-    __We differentiate the changeset `C`, from the order dependent application `A|B|C` by saying that the resulting rootfs is identified by ChainID(A|B|C), which can be calculated by `ImageConfig.rootfs`.__
+    __We differentiate the changeset `C`, from the order-dependent application `A|B|C` by saying that the resulting rootfs is identified by ChainID(A|B|C), which can be calculated by `ImageConfig.rootfs`.__
     Let's expand the definition of `ChainID(A|B|C)` to explore its internal structure:
@@ Expand All / @@ -72,7 +73,7 @@ ChainID(A|B) = Digest(ChainID(A) + " " + DiffID(B)) @@
     ChainID(A|B|C) = Digest(ChainID(A|B) + " " + DiffID(C))
     ```
-    We can replace the each definition and reduce to a single equality:
+    We can replace each definition and reduce to a single equality:
     ```
     ChainID(A|B|C) = Digest(Digest(DiffID(A) + " " + DiffID(B)) + " " + DiffID(C))
@@ Expand All @@
     Each image's ID is given by the SHA256 hash of its [configuration JSON](#image-json).
     It is represented as a hexadecimal encoding of 256 bits, e.g., `sha256:a9561eb1b190625c9adb5a9513e72c4dedafc1cb2d4c5236c9a6957ec7dfd5a9`.
-    Since the [configuration JSON](#image-json) that gets hashed references hashes of each layer in the image, this formulation of the ImageID makes images content-addresable.
+    Since the [configuration JSON](#image-json) that gets hashed references hashes of each layer in the image, this formulation of the ImageID makes images content-addressable.
     ## Properties
@@ Expand Down Expand Up / @@ -276,3 +277,4 @@ Here is an example image configuration JSON document: @@
     [rfc3339-s5.6]: https://tools.ietf.org/html/rfc3339#section-5.6
     [runtime-platform]: https://github.com/opencontainers/runtime-spec/blob/v1.0.0-rc3/config.md#platform
+    [tar-split]: https://github.com/vbatts/tar-split

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config: minor cleanup #608

Uh oh!

Diff view

Diff view

There are no files selected for viewing

vbatts Mar 13, 2017

Uh oh!

vbatts Mar 13, 2017

Uh oh!

jonboulle Mar 13, 2017

Uh oh!

stevvooe Mar 14, 2017

Uh oh!

jonboulle Mar 14, 2017

Uh oh!

config: minor cleanup #608

Uh oh!

config: minor cleanup #608

Uh oh!

Uh oh!

Diff view

Diff view

There are no files selected for viewing

vbatts Mar 13, 2017

Choose a reason for hiding this comment

Uh oh!

vbatts Mar 13, 2017

Choose a reason for hiding this comment

Uh oh!

jonboulle Mar 13, 2017

Choose a reason for hiding this comment

Uh oh!

stevvooe Mar 14, 2017

Choose a reason for hiding this comment

Uh oh!

jonboulle Mar 14, 2017

Choose a reason for hiding this comment

Uh oh!