Continuous benchmark tests as part of the CI #1174

esigo · 2022-01-16T09:22:42Z

Fixes #1170 (issue)

Changes

Runs benchmark tests at every push to the main branch and pushes the results to gh-pages branch. The results are grouped into api, sdk and exporters.

An example of working graphs can be seen here.

For significant contributions please make sure you have completed the following items:

CHANGELOG.md updated for non-trivial changes
Unit tests have been added
Changes in public API reviewed

codecov · 2022-01-16T09:29:11Z

Codecov Report

Merging #1174 (149d010) into main (fed56cc) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##             main    #1174   +/-   ##
=======================================
  Coverage   93.29%   93.29%           
=======================================
  Files         174      174           
  Lines        6404     6404           
=======================================
  Hits         5974     5974           
  Misses        430      430

lalitb · 2022-01-19T08:16:50Z

.github/workflows/benchmark.yml

+          github-token: ${{ secrets.GITHUB_TOKEN }}
+          auto-push: true
+          # Show alert with commit comment on detecting possible performance regression
+          alert-threshold: '200%'


This is the threshold of comparison with the previous result from the main branch? Trying to understand the flow here.

yes there will be an alert similar to this, when there is an slowdown bigger than 200%.
Smaller threshold could be an issue, as the machines used for CI are not always the same.

Ok, will fail-on-alert block PR merge if the threshold is higher? If yes, is it possible to let CI job fail, but allow merge on case to case basis?

Ok, will fail-on-alert block PR merge if the threshold is higher? If yes, is it possible to let CI job fail, but allow merge on case to case basis?

No the merge will go through, only the job will fail with some alert as a comment on the commit.

lalitb · 2022-01-19T08:26:12Z

ci/do_ci.sh

+  # collect benchmark results into one array
+  components=(api sdk exporters)
+  pushd $BENCHMARK_DIR
+  components=(api sdk exporters)


nit - this is already defined.

thanks, cleaned.

lalitb · 2022-01-19T08:33:28Z

exporters/otlp/test/otlp_grpc_exporter_benchmark.cc

@@ -4,6 +4,23 @@
 #include "opentelemetry/exporters/otlp/otlp_grpc_exporter.h"
 #include "opentelemetry/exporters/otlp/otlp_recordable.h"

+#include <benchmark/benchmark.h>
+#include "opentelemetry/exporters/otlp/otlp_grpc_exporter.h"


nit - this is included above.

thanks, cleaned.

lalitb · 2022-01-20T01:54:58Z

exporters/otlp/test/otlp_grpc_exporter_benchmark.cc

+  trace::Provider::SetTracerProvider(provider);
+}
+
+void BM_otlp_grpc_with_collector(benchmark::State &state)


Do we need to measure benchmark with the actual collector, or it would be sufficient to have results using faking the service stub, as we are more interested in the stats resulting from the otel-cpp code?

I think it's a good-to-have number for the users.
Can we have both with and without the actual collector?

I would ideally like to keep the stats with an actual collector. We have lately seen CI failures because of transient network timeout issues, I am just concerned if testing with real collector instance shouldn't add to that. Also, whether adding docker instances to the VM consume more resources and slows the CI jobs. And we are spawning another docker instance for jq parsing. We can keep them if you don't see any such slowness in CI with multiple iterations.

The test with the collector was pretty stable in my test CI. This can't be guaranteed to be the case always though. The job will be executed only when we merge a commit to the main branch, so we will see the issues if any only after merge to main.
This can't be part of checks for RPs as it will be noisy.
Shall we keep it as it is, in case we got failures, I can raise a PR to use mock.

Shall we keep it as it is, in case we got failures, I can raise a PR to use mock.

Should be fine for me. Let's wait for suggestions from @ThomsonTan too.

Agree with @lalitb , I prefer to use mock to avoid that the result could be affected by environment workload and network traffic, but we could do this in later PRs.

lalitb · 2022-01-20T21:22:45Z

ci/do_ci.sh

+  do
+    out=$component-benchmark_result.json
+    find ./$component -type f -name "*_result.json" -exec cat {} \; > $component_tmp_bench.json
+    cat $component_tmp_bench.json | docker run -i --rm itchyny/gojq:0.12.6 -s \


nit - one minor comment, if it's not a major rework, can we use jq instead of gojq, it's more lightweight without any external dependencies.

I've tried jq, it was showing different behavior on my local (Ubuntu 20.4) and on the GHA vm (Ubuntu latest). Collecting the benchmark results of all the tests into one array didn't work on the jq of GHA vm. So I switched to gojq which has a docker image.

esigo and others added 2 commits January 16, 2022 09:07

google bench json

2038ee2

benchmark workflow

50c88aa

esigo requested a review from a team January 16, 2022 09:22

esigo added 2 commits January 16, 2022 10:30

no cc user on failure

9c8a484

fix CI

980a364

lalitb reviewed Jan 19, 2022

View reviewed changes

comments

152b306

lalitb reviewed Jan 20, 2022

View reviewed changes

Merge branch 'main' into continuous-benchmark

149d010

lalitb reviewed Jan 20, 2022

View reviewed changes

lalitb approved these changes Jan 20, 2022

View reviewed changes

ThomsonTan approved these changes Jan 21, 2022

View reviewed changes

ThomsonTan merged commit 2a821fd into open-telemetry:main Jan 21, 2022

esigo deleted the continuous-benchmark branch January 21, 2022 18:39

This was referenced Jan 25, 2022

Add @esigo as approver lalitb/opentelemetry-cpp#80

Closed

Add @esigo as approver #1183

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous benchmark tests as part of the CI #1174

Continuous benchmark tests as part of the CI #1174

esigo commented Jan 16, 2022

codecov bot commented Jan 16, 2022 •

edited

Loading

lalitb Jan 19, 2022

esigo Jan 19, 2022

lalitb Jan 19, 2022

esigo Jan 19, 2022

lalitb Jan 19, 2022

esigo Jan 19, 2022

lalitb Jan 19, 2022

esigo Jan 19, 2022

lalitb Jan 20, 2022

esigo Jan 20, 2022

lalitb Jan 20, 2022

esigo Jan 20, 2022

lalitb Jan 20, 2022

ThomsonTan Jan 21, 2022

lalitb Jan 20, 2022

esigo Jan 20, 2022

Continuous benchmark tests as part of the CI #1174

Continuous benchmark tests as part of the CI #1174

Conversation

esigo commented Jan 16, 2022

Changes

codecov bot commented Jan 16, 2022 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented Jan 16, 2022 •

edited

Loading