GCE-support by chengchengmu · Pull Request #658 · openshift/openshift-ansible

chengchengmu · 2015-10-06T09:44:38Z

Hello,

The file that has been merged by error is reverted to the previous and correct version : playbooks/openstack/openshift-cluster/launch.yml

The other file that impacts Openstack is nova.py. The modification is a fix.
Now openstack deployment can be launched from a directory different than openshift-ansible.

This code successfully deploys clusters on openstack and it seems to be working well with or without infra node.

Thank you

Template name is conflicting with the template name from 'eap6-basic-sti.json' .

…version

openshift-bot · 2015-10-06T09:44:41Z

Can one of the admins verify this patch?

Fix for name conflict

Add kind/apiVersion to scheduler.json template

This reverts commit 3073d1f.

…ible into gce-support

chengchengmu · 2015-10-06T15:06:06Z

The only difference of my code with previous PR #641 is that playbooks/openstack/openshift-cluster/launch.yml is right now

twiest · 2015-10-07T13:23:18Z

@wshearn can you please make sure this PR works for you. You said the last one (PR #641) broke you guys.

twiest · 2015-10-07T13:26:21Z

ok to test

twiest · 2015-10-07T13:28:12Z

@sdodson are you ok with this version?

sdodson · 2015-10-07T13:35:33Z

inventory/openstack/hosts/nova.py

Even though this change makes sense, this seems unrelated to the purpose of this PR, why's this here?

Actually, this doesn't make sense to me. The way it's written it would use nova.ini in the same location as nova.py (ie in the git checkout) which I don't suspect anyone would modify in place. I think it'd be more likely that they'd expect it to look for it in cwd, do you agree?

I agree with you that this change is not related at all to GCE ... however this change makes sense :

nova.ini is located in the same directory than nova.py
Getcwd will return the location from where we execute the script nova.py resulting in an error if the script is executed from directory different. So this is fixing that issue.

On AWS, GCE, LIBVIRT we take the .ini file in the same location as the .py
So I think it was just a mistake to use getcwd for Openstack

Anyways we can take this change off and make an other PR.
It's up to you to judge if it's worth for a minor change.

I'm fine with leaving it in.

sdodson · 2015-10-07T14:18:49Z

@twiest LGTM

twiest · 2015-10-07T15:14:19Z

@sdodson thanks

Just waiting on a +1 from @wshearn. Once we get that, I think we'll be ready to merge.

chengchengmu · 2015-10-07T15:30:26Z

@sdodson @twiest Thank you

Just waiting for green light of @wshearn

wshearn · 2015-10-08T18:05:46Z

👎 Still breaks node labels

localhost                  : ok=79   changed=9    unreachable=0    failed=0   
whearntest-master-0a2ff    : ok=166  changed=57   unreachable=0    failed=0   
whearntest-node-compute-69228 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-compute-a2a1f : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-compute-f6333 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-infra-8c6d1 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-infra-97b9f : ok=61   changed=25   unreachable=0    failed=0   

[02:01 PM] [gce-support %]
[0]Pluto:~/Work/ansible/test_labels $ ssh root@52.23.238.XXX
Warning: Permanently added '52.23.238.XXX' (ECDSA) to the list of known hosts.
Last login: Thu Oct  8 14:01:52 2015 from XXXZZZWWWYYY
[root@172 ~]# oc get nodes
NAME          LABELS                               STATUS                     AGE
172.20.3.23   kubernetes.io/hostname=172.20.3.23   Ready                      3m
172.20.3.24   kubernetes.io/hostname=172.20.3.24   Ready                      3m
172.20.3.58   kubernetes.io/hostname=172.20.3.58   Ready                      3m
172.20.3.59   kubernetes.io/hostname=172.20.3.59   Ready                      3m
172.20.3.60   kubernetes.io/hostname=172.20.3.60   Ready                      3m
172.20.3.76   kubernetes.io/hostname=172.20.3.76   Ready,SchedulingDisabled   2m

chengchengmu · 2015-10-08T18:11:57Z

@wshearn what do we expect for node's labels ?

When the variable openshift_node_labels is not defined, should it default to none instead of empty ?

sdodson · 2015-10-08T18:14:35Z

roles/openshift_node/tasks/main.yml

As @wshearn noted this breaks node labels, should be

labels: "{{ lookup('oo_option', 'openshift_node_labels') | default( openshift_node_labels | default(none) ) }}"

Nevermind, that doesn't work either.

labels: "{{ lookup('oo_option', 'openshift_node_labels') | default( openshift_node_labels | default(none) , true) }}"
With @twiest 's adivce, what do you think about this ?

@menren Looks good to me, but will need to be tested by @wshearn to verify it works for him.

Please update this PR with the change and when you're done, ask @wshearn to test it.

Thanks!

wshearn · 2015-10-08T18:19:08Z

This is a recently built cluster:

[root@172 ~]# oc get nodes
NAME           LABELS                                                              STATUS                     AGE
172.31.25.70   kubernetes.io/hostname=172.31.25.70,region=us-east-1,type=master    Ready,SchedulingDisabled   2d
172.31.30.88   kubernetes.io/hostname=172.31.30.88,region=us-east-1,type=compute   Ready                      2d
172.31.30.89   kubernetes.io/hostname=172.31.30.89,region=us-east-1,type=compute   Ready                      2d
172.31.30.90   kubernetes.io/hostname=172.31.30.90,region=us-east-1,type=compute   Ready                      2d
172.31.30.91   kubernetes.io/hostname=172.31.30.91,region=us-east-1,type=compute   Ready                      2d
172.31.30.92   kubernetes.io/hostname=172.31.30.92,region=us-east-1,type=compute   Ready                      2d
172.31.30.93   kubernetes.io/hostname=172.31.30.93,region=us-east-1,type=compute   Ready                      2d
172.31.31.6    kubernetes.io/hostname=172.31.31.6,region=us-east-1,type=infra      Ready                      2d
172.31.31.7    kubernetes.io/hostname=172.31.31.7,region=us-east-1,type=infra      Ready                      2d

Both of these clusters were built in AWS and not GCE.

chengchengmu · 2015-10-08T18:49:26Z

oo_option returns an empty array, even empty the variable is defined right ?
if right, it won't default to the variable openshift_node_labels

If this is the case, giving priority to the variable openshift_node_labels should resolve this issue.

I will check it tomorrow at office

@wshearn @sdodson
Thank you

twiest · 2015-10-08T19:03:47Z

@menren To replace an empty array (or anything that evaluates to "false" in python), pass a second parameter to default like this:

"{{ [] | default(my_default, true) }}"

In this case, my_default will replace the empty list.

For more information, see our best practices guide:
https://github.com/openshift/openshift-ansible/blob/master/docs/best_practices_guide.adoc#filters

chengchengmu · 2015-10-08T19:10:07Z

@twiest thank you for sharing knowledge !

…_labels when oo_option returns an empty list

chengchengmu · 2015-10-09T07:52:08Z

@wshearn
Now labels takes the value of the oo_option variable (passed by -o option of bin/cluster), then defaults to the variable openshift_node_labels defined by playbooks if defined, else defaults to none.
This should fix the labels. Could you check please ?

However this choice may be discussed :
Currently oo_option variable overrides the playbook's variable.
We could also concatenate both variables

What is your opinion ?

twiest · 2015-10-09T13:04:20Z

playbooks/common/openshift-cluster/set_infra_launch_facts_tasks.yml

Per the best practices guide, we should always pass in "true" as the second param of default, unless we're dealing with boolean values.

So, this should be:

infra_names: "{{ infra_names_output.results | default([], true)

Best practices guide:
https://github.com/openshift/openshift-ansible/blob/master/docs/best_practices_guide.adoc#filters

twiest · 2015-10-09T13:06:22Z

playbooks/gce/openshift-cluster/terminate.yml

pass in true for both default calls (same as above)

twiest · 2015-10-09T13:08:47Z

ok to test

chengchengmu · 2015-10-09T14:29:33Z

@twiest All the default mentionned have second param. as true now.
Thanks

twiest · 2015-10-09T15:09:58Z

ok to test

twiest · 2015-10-09T15:11:16Z

This LGTM. Waiting on @wshearn to see if the problem he was seeing has been fixed.

twiest · 2015-10-12T15:06:42Z

playbooks/gce/openshift-cluster/launch.yml

@menren hey, sorry for missing this earlier, but I just noticed this.

So we use infra nodes, why is this commented out?

We want it like the AWS playbook which has this section in.

@twiest

We use set_node_launch_facts_tasks.yml for infra_node (https://github.com/openshift/openshift-ansible/blob/master/playbooks/gce/openshift-cluster/launch.yml#L31) : that's not working obviously, the playbook stop on error while trying to launch infra nodes instances.

I tried to repair it with set_infra_launch_facts_tasks.yml, the instances were correctly launched, however I encountered many issues at that time (in August).

Currently the GCE deployment isn't working at all, with or without infra-node. I suppose nobody is working with it. If somebody is working with it, he/she has to tell me how it can work in the current state.

By commenting this code, the rest is working.
We have to wait an other patch in order to make the infra-node work.

This PR fixes the standard deployment without infra and will allow people to work on GCE deployment and repair the infra-node

Ah, I see. I just tried and I was able to launch by making the infra section look the same on gce as it does in aws.

So, it now looks like this in my branch:

- include: ../../common/openshift-cluster/set_node_launch_facts_tasks.yml vars: type: "infra" count: "{{ num_infra }}" - include: tasks/launch_instances.yml vars: instances: "{{ node_names }}" cluster: "{{ cluster_id }}" type: "{{ k8s_type }}" g_sub_host_type: "{{ sub_host_type }}" - add_host: name: "{{ master_names.0 }}" groups: service_master when: master_names is defined and master_names.0 is defined

Locally, I also removed this file as it's no longer being used:

playbooks/common/openshift-cluster/set_infra_launch_facts_tasks.yml

I'm ok if we get this as a separate PR as I have some other things I had to patch to be able to launch a cluster in GCE. I'll @ mention you when I create my PR to make sure it works for you too.

@twiest good if you can repair the infra nodes for GCE because I work without it and don't know how it works exactly

wshearn · 2015-10-12T17:32:46Z

👍

PLAY RECAP ******************************************************************** 
localhost                  : ok=79   changed=9    unreachable=0    failed=0   
whearntest-master-7b695    : ok=166  changed=58   unreachable=0    failed=0   
whearntest-node-compute-2ec70 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-compute-a24b5 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-compute-b899a : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-infra-c02a3 : ok=61   changed=25   unreachable=0    failed=0   
whearntest-node-infra-d2a77 : ok=61   changed=25   unreachable=0    failed=0   

[01:28 PM] [gce-support %]
[0]Pluto:~/Work/ansible/test_labels $ ssh root@XX.YY.WW.ZZ "oc get nodes"
Warning: Permanently added 'XX.YY.WW.ZZ' (ECDSA) to the list of known hosts.
NAME           LABELS                                                              STATUS                     AGE
172.20.3.126   kubernetes.io/hostname=172.20.3.126,region=us-east-1,type=infra     Ready                      4m
172.20.3.127   kubernetes.io/hostname=172.20.3.127,region=us-east-1,type=infra     Ready                      4m
172.20.3.230   kubernetes.io/hostname=172.20.3.230,region=us-east-1,type=compute   Ready                      4m
172.20.3.231   kubernetes.io/hostname=172.20.3.231,region=us-east-1,type=compute   Ready                      4m
172.20.3.232   kubernetes.io/hostname=172.20.3.232,region=us-east-1,type=compute   Ready                      4m
172.20.3.54    kubernetes.io/hostname=172.20.3.54,region=us-east-1,type=master     Ready,SchedulingDisabled   4m

chengchengmu · 2015-10-12T17:39:19Z

@wshearn Thank you for the test and the thumb

twiest · 2015-10-12T18:37:40Z

Merging because major issues have been addressed. We'll fix the infra node issue in a separate PR.

GCE-support

twiest · 2015-10-12T18:38:28Z

@menren great job on this PR. Thank you for your tireless efforts to make our GCE support better.

arsogukpinar and others added 2 commits October 5, 2015 18:20

Fix for name conflict

f0b52c7

Template name is conflicting with the template name from 'eap6-basic-sti.json' .

playbooks/openstack/openshift-cluster/launch.yml back to its correct …

6e80868

…version

brenton added 2 commits October 6, 2015 08:37

Merge pull request #653 from arsogukpinar/master

c22b5f2

Fix for name conflict

Merge pull request #657 from liggitt/scheduler

071eee9

Add kind/apiVersion to scheduler.json template

chengchengmu mentioned this pull request Oct 6, 2015

GCE support #641

Merged

Chengcheng Mu added 2 commits October 6, 2015 16:59

Revert "Revert "GCE support""

a3ba027

This reverts commit 3073d1f.

Merge branch 'gce-support' of https://github.com/menren/openshift-ans…

6b511f1

…ible into gce-support

sdodson reviewed Oct 7, 2015
View reviewed changes

sdodson reviewed Oct 8, 2015
View reviewed changes

fix : (node) labels defaults correctly to the variable openshift_node…

46f10c8

…_labels when oo_option returns an empty list

twiest reviewed Oct 9, 2015
View reviewed changes

playbooks/gce/openshift-cluster/terminate.yml Outdated

Copy link

Contributor

twiest Oct 9, 2015

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

pass in true for both default calls (same as above)

Adding second param. true to many default filters

a8171a6

twiest reviewed Oct 12, 2015
View reviewed changes

twiest added a commit that referenced this pull request Oct 12, 2015

Merge pull request #658 from menren/gce-support

3547698

GCE-support

twiest merged commit 3547698 into openshift:master Oct 12, 2015

twiest mentioned this pull request Oct 12, 2015

Fixed GCE playbooks so that they're more like the AWS playbooks. #688

Merged

Conversation

chengchengmu commented Oct 6, 2015

Uh oh!

openshift-bot commented Oct 6, 2015

Uh oh!

chengchengmu commented Oct 6, 2015

Uh oh!

twiest commented Oct 7, 2015

Uh oh!

twiest commented Oct 7, 2015

Uh oh!

twiest commented Oct 7, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sdodson commented Oct 7, 2015

Uh oh!

twiest commented Oct 7, 2015

Uh oh!

chengchengmu commented Oct 7, 2015

Uh oh!

wshearn commented Oct 8, 2015

Uh oh!

chengchengmu commented Oct 8, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wshearn commented Oct 8, 2015

Uh oh!

chengchengmu commented Oct 8, 2015

Uh oh!

twiest commented Oct 8, 2015

Uh oh!

chengchengmu commented Oct 8, 2015

Uh oh!

chengchengmu commented Oct 9, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

twiest commented Oct 9, 2015

Uh oh!

chengchengmu commented Oct 9, 2015

Uh oh!

twiest commented Oct 9, 2015

Uh oh!

twiest commented Oct 9, 2015

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wshearn commented Oct 12, 2015

Uh oh!

chengchengmu commented Oct 12, 2015

Uh oh!

twiest commented Oct 12, 2015

Uh oh!

twiest commented Oct 12, 2015

Uh oh!

Reviewers

Assignees

Labels