V1.14.5 release #999

jhaynes · 2017-09-30T00:27:15Z

1.14.5

Enhancement - Retry failed container image pull operations #975
Enhancement - Set read and write timeouts for websocket connectons #993
Enhancement - Add support for the SumoLogic Docker log driver plugin
#992
Bug - Fixed a memory leak issue when submitting the task state change #967
Bug - Fixed a race condition where a container can be created twice when agent restarts. #939
Bug - Fixed an issue where microsoft/windowsservercore:latest was not
pulled on Windows under certain conditions.
#990
Bug - Fixed an issue where task IAM role credentials could be logged to disk. #998

1. api: Added a new container state called `ContainerResourcesProvisioned`, which represents if a container that has completed provisioning all of its resources. Non-internal containers transition to this state without doing any additional work. However, containers that are added to a task by the ECS Agent would possibly need to perform additional actions. For example, the "pause" container would be provisioned by invoking CNI plugins 2. api: Tasks do not transition into "RUNNING" unless all the containers in the task have transitioned into "ResourcesProvisioned" state. The `TaskStatus()` method in the `ContainerStatus` type has been updated for this 3. api: Similarly, task transitions that lead to container transitions using the `ContainerStatus()` method in `TaskStatus` type has been updated to reflect this change 4. api, engine: All references to `api.ContainerRunning` have been replaced with the new `GetContainerSteadyStateStatus()` method, which returns `ContainerResourcesProvisioned` instead of `ContainerRunning` 5. engine/dependencygraph: `seelog` logger has replaced the module logger 6. engine/dependencygraph: `onRunIsResolved` has been renamed to `onProvisionResourcesIsResolved` to better reflect what the method is supposed to do 7. engine: `DockerTaskEngine.transitionFunctionMap()` has been updated with the function pointer to the new `DockerTaskEngine.provisionContainerResources()` method to perform the necessary actions required to update the state

Also modify `task.dockerHostConfig()` to set the network mode field in container's host config json to `container:<pause-container>` for non-pause containers in the task

* Instead of assuming that ContainerRunning is the steady state for all containers, add the notion of each container being able to define its own steady state. * Rename task.RunDependencies to task.SteadyStateDependencies * Add unit tests for containerstatus and task status updates

This change adds the ability for each container to specify its own steady state. Doing so enables the Task stats of 'RUNNING' to be reached when all of the containers in the task have reached their respective steady states. The 'pause' container can specify its steady state as 'RESOURCES_PROVISION', where as other normal containers can specify their steady states as 'RUNNING'. This change also adds a new field in the 'api.Container' type, called 'InternalContainerType'. We support two types of internal containers, one for creating empty container volumes, another for provisioning the network namespace (via the pause container). This gets us away from using container names for determining different types of internal containers.

The IsInternal field in api.Container is being refactored as an enum instead (as 'api.Container.Type ContainerType'), with values that indicate what type of container it is. It can be one of 'ContainerNormal' (indicates that this is not an internal container, but something that was sent as part of the payload from the communication service) or 'ContainerEmptyHostVolume' (indicates that this is an internal container created for attaching ephemeral empty host volumes) or 'ContainerCNIPause' (indicates that this is an internal container created for attaching ENIs or some other custom network configuration) The 'api/json.go' file has been deleted and its contents moved to files for each respective type that has custom marshal/unmarshal methods overridden. The version of the ECS Data file has been incremented as well.

Signed-off-by: Vinothkumar Siddharth <[email protected]>

* Return watcher, error from agent initialization * Amend log messages to provide improved context Signed-off-by: Vinothkumar Siddharth <[email protected]>

Signed-off-by: Vinothkumar Siddharth <[email protected]>

… terminal containers * bugfix: Container transition for RUNNING -> STOPPED was deemed in-actionable in this method. Modified it so that if container's known status is RUNNING or RESOURCES_PROVISIONED, we return true for the action needed flag * The managedTask.handleStoppedToRunningContainerTransition method has been refactored to get rid of chained if conditions so that its easier to read * The DockerTaskEngine's transition function map has been refactored as a field in the DockerTaskEngine. This helps in testing as we can override this map in our tests. Ideally we would have used an interface method. But, that's a bigger refactor for another day * Added unit tests for task manager for steady state == RESOURCES_PROVISIONED

… the verbose flag

Here is a link to minor modifications to the source: kubernetes/kubernetes#43578 Signed-off-by: Vinothkumar Siddharth <[email protected]>

Transition dependencies will deprecate and replace the SteadyStateDependencies list that exists in api.Container. Transition dependencies could also replace implicit link- and volume-dependencies in the future, but that change is not yet planned.

Code handling AdditionalLocalRoutes was inadvertently removed in 10fb083. This commit adds it back, and adjusts the TestSetupNS unit test to explicitly check for it.

…or adding and dropping Linux capabilities The changes include the following: - Model changes in ContainerDefinition of the form: linuxParameters: { capabilities: { add: [""], drop: [""] } } - Functional Tests that verify if the specified capability has been added to and dropped from the task's container

…test

…mentation, change agent version required

ecs_client/model, functional_tests: updated model, functional tests for adding and dropping Linux capabilities

The wc command prefixes spaces to the output, which corrupts the GIT_PORCELAIN variable, thus failing the build. This change removes spaces from the output.

This commit aims to make the websocker connection management better by implementing the following improvements: 1. Set read and write deadlines for websocket ReadMessage and WriteMessage operations. This is to ensure that these methods do not hang and result in io timeout if there's issues with the connection 2. Reduce the scope of the lock in the Connect() method. The lock was being held for the length of Connect() method, which meant that it wouldn't be relnquished if there was any delay in establishing the connection. The scope of the lock has now been reduced to just accessing the cs.conn variable 3. Start ACS heartbeat timer after the connection has been established. The timer was being started before a call to Connect, which meant that the connection could be prematurely terminated for being idle if there was a delay in establishing the connection These changes should improve the disconnection behavior of the websocket connection, which should help with scenarios where the Agent never reconnects to ACS because it's forever waiting in Disconnect() method waiting to acquire the lock (aws#985)

Increase the websocket read and write timeouts as per review comments

The messages that come over the websockets can potentially contain sensitive information that shouldn't be put in logs. Separately, if reading the message results in an error, the content of the message is irrelevant and unreliable.

aaithal and others added 30 commits May 2, 2017 13:05

api: Add the pause container to the task

1c26a4a

Also modify `task.dockerHostConfig()` to set the network mode field in container's host config json to `container:<pause-container>` for non-pause containers in the task

Makefile changes for packaging Pause Container

0b0801e

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Vendor netlink

c44c013

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Vendor errors

b357245

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Vendor Udev

4803fae

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Adding NetlinkWrapper

a2a116c

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Adding UDevWrapper

f9445d6

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Adding Initial ENI Watcher

4c9fef8

Signed-off-by: Vinothkumar Siddharth <[email protected]>

ENI-Watcher: Address review comments

13d8fc3

* Return watcher, error from agent initialization * Amend log messages to provide improved context Signed-off-by: Vinothkumar Siddharth <[email protected]>

Make build platform agnostic

823925b

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Address CR Comments

bf4a426

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Fix appveyor build

0031f07

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Address CR Comments

31e6ad6

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Adding debug log for udev events in eni eventHandler

7a75405

Signed-off-by: Vinothkumar Siddharth <[email protected]>

Merge remote-tracking branch 'origin/dev' into mergeDevToEni05112017

57de4f4

Makefile: added a 'test-silent' target for running unit tests without…

2dc34c4

… the verbose flag

removed spurious temp file from previous commit

a0d374a

Update pause.c following upstream changes

a0d6369

Here is a link to minor modifications to the source: kubernetes/kubernetes#43578 Signed-off-by: Vinothkumar Siddharth <[email protected]>

Merge branch 'pause-container' into enis

ecaf20d

Merge branch 'pause-release' into enis

685bfa5

Add capability for task networking

f18e94a

Add ecs_cni to invoke the cni plugin

0fb8df9

Added environment variable to config cni plugin path

257fb22

Added vendor for containernetworking/cni

0f74ce4

Fix the unit test for task-network capabilities

d9e0764

samuelkarp and others added 26 commits September 26, 2017 11:34

dependencygraph: Verify steady state can resolve

1ffe4d1

api: Use transition dependencies for empty vols

df2a441

api: Use transition dependencies for pause

80b8315

engine: Convert tests to transition dependencies

9026239

engine: Add back old test, code is still there

b9bb089

engine: Fix ContainerName field

dbc7bbc

statemanager: Record dependency changes for v6

3916dc4

api: move pause dependency to resource provisioned

1b7ef88

Merge remote-tracking branch 'origin/pr/989' into dev

afd0740

ecscni: Re-implement removed AdditionalLocalRoutes

2589268

Code handling AdditionalLocalRoutes was inadvertently removed in 10fb083. This commit adds it back, and adjusts the TestSetupNS unit test to explicitly check for it.

Merge remote-tracking branch 'origin/pr/991' into dev

d09c3a2

Add support for SumoLogic logging driver

6861b84

functional_tests: Combining cap-add and cap-drop tests into a single …

6e7ce9b

…test

ecs_client/model: Updated API and Documentation changes

d568430

ecs_client/model, functional_tests: Update to latest ECS API and Docu…

152a49c

…mentation, change agent version required

Merge pull request aws#956 from sharanyad/capadddrop

640d704

ecs_client/model, functional_tests: updated model, functional tests for adding and dropping Linux capabilities

Makefile: fix cni-plugins to build on OSX

ff6f81b

The wc command prefixes spaces to the output, which corrupts the GIT_PORCELAIN variable, thus failing the build. This change removes spaces from the output.

CHANGELOG entry for 8703f78 (wsclient: add read write deadlines)

ec4d577

acs,tcs handlers: increase websocket rw timeout

b3db90b

Increase the websocket read and write timeouts as per review comments

Merge remote-tracking branch 'origin/pr/992' into sumologic

908bf34

readme: Add line about SumoLogic plugin

62f3de1

Don't log websocket messages if there are errors.

ae897cf

The messages that come over the websockets can potentially contain sensitive information that shouldn't be put in logs. Separately, if reading the message results in an error, the content of the message is irrelevant and unreliable.

update agent version to 1.14.5

0dcd02c

aaithal approved these changes Sep 30, 2017

View reviewed changes

aaithal mentioned this pull request Oct 2, 2017

Do not explicitly set host IP to 0.0.0.0 for Linux, defer to Docker #972

Merged

8 tasks

richardpen approved these changes Oct 3, 2017

View reviewed changes

petderek merged commit 0dcd02c into aws:master Oct 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V1.14.5 release #999

V1.14.5 release #999

jhaynes commented Sep 30, 2017

V1.14.5 release #999

V1.14.5 release #999

Conversation

jhaynes commented Sep 30, 2017

1.14.5