Harden elasticsearch chart for Kube 1.5 #1062

icereval · 2017-05-11T23:29:09Z

Further extends upon PR #890

Added configmap to explicitly provide cluster configurations and scripts
Replace depreciating ES_HEAP_SIZE with ES_JAVA_OPTS to position for ES v5 support
Removed alpha storage class operators
Removed catastrophic liveness probe checking entire clusters health
Readiness probe now inspects local node health
Added termination grace period (defaults to 60m) to allow pre-stop-script.sh time to gracefully migrate shards
Added init container to configure vm.max_map_count
Updated elasticsearch.yaml:
- Added PROCESSOR configuration to prevent large cluster garbage collection issues leading to node eviction
- Added configurable gateway defaults to help avoid a split brain, requiring two masters online and in consensus before recovery can continue
Updated pre-stop-script.sh:
- Check v1beta1 statefulset endpoint
- Evalute .spec.replicas for statefulset desired size
- Clear _cluster/settings ip exclusion prior to shutdown to avoid a possible (random) ip match scenario on expansion of the clsuter
Data nodes now use default storage class if one is not specified
Apply Helm best practices to prep for stable

k8s-ci-robot · 2017-05-11T23:29:16Z

Hi @icereval. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with @k8s-bot ok to test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

unguiculus · 2017-05-12T06:45:37Z

stable/elasticsearch/values.yaml

-DataStorageClass: "anything"
-DataStorageClassVersion: "alpha"
+# DataStorageClass: "ssd"
+DataTerminationGracePeriodSeconds: 900


I would prefer to not move the chart to stable as long as config values start with an uppercase letter. In this case, I would also switch to nesting first. I think this make sense here, e. g.:

client: replicas: 2 port: 2379 heapSize: "128m" master: replicas: 2 heapSize: "128m" data: replicas: 3 storage: "30Gi"

See https://github.com/kubernetes/helm/blob/master/docs/chart_best_practices/values.md

Generally, I think a move to stable should be a separate PR so it is explicit.

Regarding resources, it's becoming increasingly popular to specify resources like so (without specifying defaults): https://github.com/kubernetes/charts/blob/master/stable/nginx-ingress/templates/controller-deployment.yaml#L93

I completely agree and all are very good points. Would you mind informing the base PR #890, if we can get this merged, even if still in incubator status one of us can follow up with a new PR and the final changes needed to make this stable.

viglesiasce · 2017-06-18T09:42:28Z

@simonswine @unguiculus any update on this review? Were the latest changes enough to merge?

unguiculus · 2017-06-18T12:30:02Z

As suggested in #890, a move to stable should only happen after applying best practices, e. g. regarding values. See https://github.com/kubernetes/helm/blob/master/docs/chart_best_practices/values.md.

* Add environment variable KUBERNETES_MASTER, resolves issue documented here: fabric8io/fabric8#6229 (comment) * Rename PetSet to StatefulSet, rename template file * Add initialDelay and increase timesouts to all liveness and readiness checks. This was the only way I could get it to deploy reliably in my environment. * Update to a newer image version

* Added configmap to explicitly provide cluster configurations and scripts * Replace depreciating `ES_HEAP_SIZE` with `ES_JAVA_OPTS` to position for ES v5 support * Removed alpha storage class operators * Removed catastrophic liveness probe checking entire clusters health * Readiness probe now inspects local node health * Added termination grace period (defaults to 15m) to allow pre-stop-script.sh time to gracefully migrate shards * Added init container to configure `vm.max_map_count` * Updated elasticsearch.yaml: * Added `PROCESSOR` configuration to prevent large cluster garbage collection issues leading to node eviction * Added configurable gateway defaults to help avoid a split brain, requiring two masters online and in consensus before recovery can continue * Updated pre-stop-script.sh: * Check `v1beta1` `statefulset` endpoint * Evalute `.spec.replicas` for statefulset desired size * Clear `_cluster/settings` ip exclusion prior to shutdown to avoid a possible (random) ip match scenario on expansion of the clsuter * Data nodes now use default storage class if once is not specified

icereval · 2017-06-19T13:49:01Z

@unguiculus & @prydonius, rebase complete and a first pass at helm best practices applied to the incubator chart.

unguiculus · 2017-06-20T15:39:30Z

Excellent. This goes in the right direction. Now, please have a look at app labels, which should be {{ template "name" . }}. Note that you will then have to add the release label to the selector in services. nginx-ingress an excellent example:

https://github.com/kubernetes/charts/blob/master/stable/nginx-ingress/templates/controller-deployment.yaml#L6
https://github.com/kubernetes/charts/blob/master/stable/nginx-ingress/templates/controller-service.yaml#L56-L58

unguiculus · 2017-07-02T18:08:36Z

@icereval Are you working on an update?

icereval · 2017-07-03T02:11:17Z

@unguiculus, yep, I'll have my updates ready soon

icereval · 2017-07-03T03:37:27Z

@unguiculus, best practice changes pushed up

unguiculus · 2017-07-04T19:15:15Z

Would you mind adding a NOTES.txt? Otherwise it looks nice but I have yet to review more thoroughly. I installed it on GKE and everything came up nicely.

unguiculus · 2017-07-04T19:15:33Z

@k8s-bot ok to test

icereval · 2017-07-04T20:44:54Z

Added NOTES.txt and client.serviceType based on review of the concourse stable chart.

* Update elasticsearch chart to work with Kube 1.5 * Add environment variable KUBERNETES_MASTER, resolves issue documented here: fabric8io/fabric8#6229 (comment) * Rename PetSet to StatefulSet, rename template file * Add initialDelay and increase timesouts to all liveness and readiness checks. This was the only way I could get it to deploy reliably in my environment. * Update to a newer image version * Harden aspects of the elasticsearch chart * Added configmap to explicitly provide cluster configurations and scripts * Replace depreciating `ES_HEAP_SIZE` with `ES_JAVA_OPTS` to position for ES v5 support * Removed alpha storage class operators * Removed catastrophic liveness probe checking entire clusters health * Readiness probe now inspects local node health * Added termination grace period (defaults to 15m) to allow pre-stop-script.sh time to gracefully migrate shards * Added init container to configure `vm.max_map_count` * Updated elasticsearch.yaml: * Added `PROCESSOR` configuration to prevent large cluster garbage collection issues leading to node eviction * Added configurable gateway defaults to help avoid a split brain, requiring two masters online and in consensus before recovery can continue * Updated pre-stop-script.sh: * Check `v1beta1` `statefulset` endpoint * Evalute `.spec.replicas` for statefulset desired size * Clear `_cluster/settings` ip exclusion prior to shutdown to avoid a possible (random) ip match scenario on expansion of the clsuter * Data nodes now use default storage class if once is not specified * Apply best practices * Add Notes for client service types, and warnings

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label May 11, 2017

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label May 11, 2017

unguiculus suggested changes May 12, 2017

View reviewed changes

icereval force-pushed the feature/elasticsearch branch 2 times, most recently from fd939ad to 918f888 Compare May 13, 2017 01:56

unguiculus mentioned this pull request May 14, 2017

Update elasticsearch chart to work with Kube 1.5 #890

Closed

unguiculus self-assigned this May 24, 2017

unguiculus requested a review from simonswine May 24, 2017 18:52

unguiculus added the awaiting review label May 30, 2017

icereval force-pushed the feature/elasticsearch branch from 918f888 to 38dc3f0 Compare June 1, 2017 21:52

mkrakowitzer and others added 2 commits June 19, 2017 06:40

icereval force-pushed the feature/elasticsearch branch from 38dc3f0 to b62ccae Compare June 19, 2017 10:59

icereval force-pushed the feature/elasticsearch branch from c1e43a6 to c425ab8 Compare June 19, 2017 13:50

Apply best practices

e1ed701

icereval force-pushed the feature/elasticsearch branch from c425ab8 to e1ed701 Compare July 3, 2017 03:35

k8s-ci-robot removed the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Jul 4, 2017

Add Notes for client service types, and warnings

5b6139f

icereval force-pushed the feature/elasticsearch branch from ea779d1 to 5b6139f Compare July 4, 2017 20:49

unguiculus added code reviewed and removed awaiting review labels Jul 5, 2017

unguiculus approved these changes Jul 5, 2017

View reviewed changes

unguiculus added UX reviewed lgtm Indicates that a PR is ready to be merged. labels Jul 5, 2017

unguiculus merged commit 09892a3 into helm:master Jul 5, 2017

unguiculus mentioned this pull request Jul 5, 2017

Use consistent whitespace in template placeholders #1437

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harden elasticsearch chart for Kube 1.5 #1062

Harden elasticsearch chart for Kube 1.5 #1062

icereval commented May 11, 2017 •

edited

Loading

k8s-ci-robot commented May 11, 2017

unguiculus May 12, 2017

icereval May 14, 2017 •

edited

Loading

viglesiasce commented Jun 18, 2017

unguiculus commented Jun 18, 2017

icereval commented Jun 19, 2017

unguiculus commented Jun 20, 2017

unguiculus commented Jul 2, 2017

icereval commented Jul 3, 2017

icereval commented Jul 3, 2017 •

edited

Loading

unguiculus commented Jul 4, 2017

unguiculus commented Jul 4, 2017

icereval commented Jul 4, 2017

Harden elasticsearch chart for Kube 1.5 #1062

Harden elasticsearch chart for Kube 1.5 #1062

Conversation

icereval commented May 11, 2017 • edited Loading

k8s-ci-robot commented May 11, 2017

unguiculus May 12, 2017

Choose a reason for hiding this comment

icereval May 14, 2017 • edited Loading

Choose a reason for hiding this comment

viglesiasce commented Jun 18, 2017

unguiculus commented Jun 18, 2017

icereval commented Jun 19, 2017

unguiculus commented Jun 20, 2017

unguiculus commented Jul 2, 2017

icereval commented Jul 3, 2017

icereval commented Jul 3, 2017 • edited Loading

unguiculus commented Jul 4, 2017

unguiculus commented Jul 4, 2017

icereval commented Jul 4, 2017

icereval commented May 11, 2017 •

edited

Loading

icereval May 14, 2017 •

edited

Loading

icereval commented Jul 3, 2017 •

edited

Loading