Skip to content

Commit

Permalink
SIG Cloud Provider KEP: Reporting Conformance Test Results to Testgrid
Browse files Browse the repository at this point in the history
  • Loading branch information
andrewsykim committed Jun 7, 2018
1 parent e5e4b58 commit d6b676b
Show file tree
Hide file tree
Showing 2 changed files with 230 additions and 0 deletions.
224 changes: 224 additions & 0 deletions keps/sig-cloud-provider/003-testgrid-e2e.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,224 @@
---
kep-number: 003
title: Reporting Conformance Test Results to Testgrid
authors:
- "@andrewsykim"
owning-sig: sig-cloud-provider
participating-sigs:
- sig-testing
- sig-release
- sig-aws
- sig-azure
- sig-gcp
- sig-ibmcloud
- sig-openstack
- sig-vmware
reviewers:
- TBD
approvers:
- TBD
editor: TBD
creation-date: 2018-06-06
last-updated: 2018-06-06
status: provisional

---

# Reporting Conformance Test Results to Testgrid

## Table of Contents

* [Summary](#summary)
* [Motivation](#motivation)
* [Goals](#goals)
* [Non-Goals](#non-goals)
* [Proposal](#proposal)
* [Implementation Details/Notes/Constraints](#implementation-detailsnotesconstraints)
* [Risks and Mitigations](#risks-and-mitigations)
* [Graduation Criteria](#graduation-criteria)
* [Implementation History](#implementation-history)

## Summary

This is a KEP outlining the motivation behind why cloud providers should frequently upload E2E conformance test results to [Testgrid](https://github.com/kubernetes/test-infra/tree/master/testgrid) and how a cloud provider can go about doing this.

## Motivation

The primary motivation behind collecting conformance test results from various cloud providers on a regular basis is to inform sig-release of any critical bugs. It's important to collect results from various cloud providers to increase coverage and inform other cloud providers of bugs they may be impacted by.

### Goals

* All SIGs / Subprojects owners representing a cloud provider is aware of the importance of frequently uploading conformance test results to testgrid
* There is a clear and detailed process in place for any cloud provider to upload conformance test results to Testgrid.

### Non-Goals

* Test coverage - increasing test coverage is outside the scope of this KEP
* CI/CD - what CI/CD tool is used to run E2E tests and upload results is outside the scope of this KEP. It is up to the cloud provider to decide where/how tests are run.
* Cluster Provisioning - how a cluster is provisioned is outside the scope of this KEP.

## Proposal

We would like to propose that every Kubernetes cloud provider reports conformance test results for every patch version of Kubernetes at the minimum. Running conformance tests against master and on pre-release versions are highly encouraged but will not be a requirement.

### Implementation Details/Notes/Constraints

Before continuing, it is highly recommended you read the following documentation provided by sig-testing:
* [Testgrid Configuration](https://github.com/kubernetes/test-infra/tree/master/testgrid#testgrid)
* [Display Conformance Tests with Testgrid](https://github.com/kubernetes/test-infra/blob/master/testgrid/conformance/README.md)

#### How to Run E2E Conformance Tests

[Sonobuoy](https://github.com/heptio/sonobuoy) is the recommended tool to run conformance tests on your cluster. At this point we will assume that you have a running cluster and that `kubectl` is configured to point to that cluster with admin access.

#### Installing Sonobuoy

The following is mostly a [copy of the sonobuoy documentation](https://github.com/heptio/sonobuoy#download-and-run).

First install sonobuoy. The following command adds it to the `$GOBIN` environment variable which is expected to be part of your `$PATH` environment variable.
```
$ go get -u github.com/heptio/sonobuoy
```

#### Running Conformance Tests with Sonobuoy

You can then start e2e tests by simply running:
```
$ sonobuoy run
Running plugins: e2e, systemd-logs
INFO[0001] created object name=heptio-sonobuoy namespace= resource=namespaces
INFO[0001] created object name=sonobuoy-serviceaccount namespace=heptio-sonobuoy resource=serviceaccounts
INFO[0001] created object name=sonobuoy-serviceaccount-heptio-sonobuoy namespace= resource=clusterrolebindings
INFO[0001] created object name=sonobuoy-serviceaccount namespace= resource=clusterroles
INFO[0001] created object name=sonobuoy-config-cm namespace=heptio-sonobuoy resource=configmaps
INFO[0002] created object name=sonobuoy-plugins-cm namespace=heptio-sonobuoy resource=configmaps
INFO[0002] created object name=sonobuoy namespace=heptio-sonobuoy resource=pods
INFO[0002] created object name=sonobuoy-master namespace=heptio-sonobuoy resource=services
```

You can then check the status of your e2e tests by running
```
$ sonobuoy status
PLUGIN STATUS COUNT
e2e running 1
systemd_logs running 3
Sonobuoy is still running. Runs can take up to 60 minutes.
```

E2E tests can take up to an hour. Once your tests are done, you can download a snapshot of your results like so:
```
$ sonobuoy retrieve .
```

Once you have the following snapshot, extract it's contents like so
```
$ mkdir ./results; tar xzf *.tar.gz -C ./results
```

At this point you should have the log file and JUnit results from your tests:
```
results/plugins/e2e/results/e2e.log
results/plugins/e2e/results/junit_01.xml
```

#### How to Upload Conformance Test Results to Testgrid

##### Requesting a GCS Bucket

Testgrid requires that you store results in a publicly readable GCS bucket. If for whatever reason you cannot set up a GCS bucket, please contact @BenTheElder or more generally the [gke-kubernetes-engprod](mailto:[email protected]) team to arrange for a Google [GKE](https://cloud.google.com/kubernetes-engine/) EngProd provided / maintained bucket for hosting your results.

##### Authenticating to your Testgrid Bucket

Assuming that you have a publicly readable bucket provided by the GKE team, you should have been provided a service account JSON file which you can use with [gcloud](https://cloud.google.com/sdk/downloads) to authenticate with your GCS bucket.

```
$ gcloud auth activate-service-account --key-file /path/to/k8s-conformance-serivce-accout.json
Activated service account credentials for: [demo-bucket-upload@k8s-federated-conformance.iam.gserviceaccount.com]
```

##### Uploading results to Testgrid

At this point you should be able to upload your testgrid results to your GCS bucket. You can do so by running a python script availabile [here](https://github.com/kubernetes/test-infra/tree/master/testgrid/conformance). For this example, we upload results for v1.10 into it's own GCS prefix.
```
git clone https://github.com/kubernetes/test-infra
cd testgrid/conformance
./upload_e2e.py --junit /path/to/results/plugins/e2e/results/e2e.log \
--log /path/to/results/plugins/e2e/results/e2e.log \
--bucket=gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10
Uploading entry to: gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10/1528333637
Run: ['gsutil', '-q', '-h', 'Content-Type:text/plain', 'cp', '-', 'gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10/1528333637/started.json'] stdin={"timestamp": 1528333637}
Run: ['gsutil', '-q', '-h', 'Content-Type:text/plain', 'cp', '-', 'gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10/1528333637/finished.json'] stdin={"timestamp": 1528337316, "result": "SUCCESS"}
Run: ['gsutil', '-q', '-h', 'Content-Type:text/plain', 'cp', '/Users/AndrewSyKim/go/src/k8s.io/kubernetes/results/plugins/e2e/results/e2e.log', 'gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10/1528333637/build-log.txt']
Run: ['gsutil', '-q', '-h', 'Content-Type:text/plain', 'cp', '/Users/AndrewSyKim/go/src/k8s.io/kubernetes/results/plugins/e2e/results/e2e.log', 'gs://k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10/1528333637/artifacts/e2e.log']
Done.
```

##### Testgrid Configuration

Next thing you want to do is configure testgrid to read results from your GCS bucket. There are two [configuration](https://github.com/kubernetes/test-infra/tree/master/testgrid#configuration) steps required. One for your [test group](https://github.com/kubernetes/test-infra/tree/master/testgrid#test-groups) and one for your [dashboard](https://github.com/kubernetes/test-infra/tree/master/testgrid#dashboards).

To add a test group update [config.yaml](https://github.com/kubernetes/test-infra/blob/master/testgrid/config.yaml) with something like the following:
```
test_groups:
...
...
- name: cloud-provider-demo-e2e-conformance-release-v1.10
gcs_prefix: k8s-conformance-demo/cloud-provider-demo/e2e-conformance-release-v1.10
```

To add a link to your results in the testgrid dashboard, update [config.yaml](https://github.com/kubernetes/test-infra/blob/master/testgrid/config.yaml) with something like the following:
```
dashboards:
...
...
- name: conformance-demo-cloud-provider
dashboard_tab:
- name: "Demo Cloud Provider, v1.10"
description: "Runs conformance tests for cloud provier demo on release v1.10"
test_group_name: cloud-provider-demo-e2e-conformance-release-v1.10
```

Once you've made the following changes, open a PR against the test-infra repo. Once your PR merges you should be able to view your results on https://k8s-testgrid.appspot.com/ which should be ready to be consumed by the necessary stakeholders (sig-release, sig-testing, etc).


#### Examples

Here are some more concrete examples of how other cloud providers are running conformance tests and uploading results to testgrid:
* Open Stack
* [OpenLan zuul job for running/uploading testgrid results](https://github.com/theopenlab/openlab-zuul-jobs/tree/master/playbooks/cloud-provider-openstack-acceptance-test-e2e-conformance)
* [OpenStack testgrid config](https://github.com/kubernetes/test-infra/pull/7670)
* [OpenStack conformance tests dashboard](https://github.com/kubernetes/test-infra/pull/8154)


### Risks and Mitigations

#### Operational Overhead

Operating CI/CD system to run conformance tests on a regular basis may incur extra work from every cloud provider. Though we anticipate the benefits of running conformance tests to outweight the operational overhead, in some cases it may not.

Mitigation: TODO

#### Misconfigured Tests

There are various scenarioes where cloud providers may mistakenly upload incorrect conformance tests results. One example being uploading results for the wrong Kubernetes version.

Mitigation: TODO

#### Flaky Tests

Tests can fail for various reasons in any cloud environment and may raise false negatives for the release team.

Mitigation: TODO


## Graduation Criteria

All providers are uploading conformance test results as part of every Kubernetes major/minor/patch release.

[umbrella issues]: TODO

## Implementation History

- the `Summary` and `Motivation` sections being merged signaling SIG acceptance

6 changes: 6 additions & 0 deletions keps/sig-cloud-provider/OWNERS
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
reviewers:
- sig-cloud-provider-leads
approvers:
- sig-cloud-provider-leads
labels:
- sig/cloud-provider

0 comments on commit d6b676b

Please sign in to comment.