Benchmark runner (WIP) #143

mistakia · 2018-07-26T16:28:36Z

Broke ground on a barebones benchmark runner based on the benchmarks that currently exist.

I was thinking about having benchmarks that capture average ops/second over a set time span (10s?) and use various log sizes (1, 100, 1000, 10000, 50000) for reads and certain static methods.

Let me know if you have a better approach to fromEntryHash,fromJSON, etc where ops/second are not as meaningful for larger log sizes.

Then we can cache the results (via committing a file to the repo? or maybe on circle somehow) so that circle CI can run and compare PRs to master.

Let me know what you think about the overall approach — then I can round out this PR to separate the benchmarks.js file, add all the benchmark tests for the public & static methods, and setup fancy logging.

Todo

Benchmarks

Ref: #140

haadcode · 2018-08-01T10:28:45Z

@mistakia fantastic! 👍❤️ I really like the approach you've taken to create a benchmark runner. It'll be much easier to add more benchmarks in the future as well as have coherent api/code for all the benchmarks.

It would be great if we can specify how long each benchmark in run, both as in "how many cycles to run" as well as "how many second". The latter is hugely important to be able to let the benchmark run for a long period of time to observe behaviour (algorithmic complexity, memory usage, stress testing) over a long period of time while the first is great in getting a "stable" benchmark baseline that can be compared, for example in CI, against previous versions. What do you think?

mistakia · 2018-08-02T15:47:39Z

Agreed.

As the PR currently stands that would be accomplished by two different tests like so:

{
  name: 'append-baseline',
  ...
  while: (stats, startTime) => {
    return stats.count < 1000 // 1000 iterations
  },
  ...
}

{
  name: 'append-stress',
  ...
  while: (stats, startTime) => {
    return process.hrtime(startTime)[0] < 900 // 15 mins
  },
  ...
}

I can create two groups of tests: baseline and stress. Then create separate commands to run each suite: npm run benchmark:baseline and npm run benchmark:stress.

Let me know if thats acceptable (and if you have better names for the groups) and I can update the PR accordingly.

aphelionz · 2018-11-03T23:08:18Z

Hi @mistakia, how are you? I'm the new guy on the Haja networks team and I'm reaching out to see if I can help with this PR in any way?

mistakia · 2018-12-29T15:03:27Z

@aphelionz nice to meet ya! Just getting back into this and catching up on all the changes in ipfs/orbit-db/ipfs-log. Once I have this rebased to master, I'll have a better idea of what help/questions I may need/have to get this to the finish line.

aphelionz · 2018-12-29T16:15:57Z

@mistakia Great! I'm here to help. Let's get this over the finish line :)

mistakia · 2019-01-01T20:22:32Z

@aphelionz I think I'm caught up — rebased, updated (linting + new constructor) and added the remaining benchmarks. It's nearly ready for review after addressing the following:

(1) I would like to add a feature where CircleCI can compare PR benchmarks to master by committing a file to the repo that the benchmark runner can use to compare. Let me know if there's a better approach and/or if this is a desired feature.

(2) ~~Also, the "stress" benchmarks all currently run for 300 seconds. Let me know if it's desirable to allow for a limit to be passed in via the command line.~~ (added)

(3) ~~The memory usage seems to be off/uninformative as many of the benchmarks produce negative memory usage.~~ (fixed)

mistakia · 2019-03-19T20:27:17Z

seeing some issues related to ipfs/js-ipfs-repo/issues/188

mistakia · 2019-05-17T19:42:13Z

Rebased & fixed memory measurement issue. I think this is ready for review. Having to run with node v11 to avoid alanshaw/pull-stream-to-async-iterator#1

Let me how to proceed.

mistakia · 2019-05-23T19:16:48Z

can now run stress benchmarks for any time limit by passing in a limit (in seconds or Infinity)

node --expose-gc benchmarks/runner/index.js -r --grep append-stress --stress-limit Infinity

aphelionz · 2019-05-25T04:40:55Z

This looks great @mistakia !

I'd love a quick primer on how to use this, either in a README or another MD file, and then I'd love to see it merged. What I really want to see in the end is this incorporated into CI so developers can see the impact their work has on performance with each PR.

What say you, @haadcode and @shamb0t ?

aphelionz · 2019-05-25T05:18:02Z

Cool! Playing with it now and having a lot of fun. THANK YOU so much for doing this.

Here's stuff I'd love to see:

The ability to pass in a baseline-limit as well as a stress-limit
The ability to pass in a "count" in some way. I see you have then at 1, 100, 1000, and 10000. Maybe something like max-count that accepts that set of values and only runs the benchmarks that are <= max-count
Finally, can we optimize by sharing the same log instantiation of 1, 100, 1000, and 10000 for baseline and stress, instead of creating it separately for each benchmark.

All that being said, this is fine to merge as is and we can move the above list to an issue to tackle asynchronously from this. I actually really want to use this ASAP to sink my teeth into the append-performance branch, and then start using this as a paradigm for the various *-stores, orbit-db-storage-adapter, and orbit-db itself.

aphelionz · 2019-05-25T05:22:34Z

Sorry, not "Finally" :)

It should clean up its ipfs-log-benchmarks folder

mistakia · 2019-05-25T19:30:07Z

I should have some time in the next few days to tackle those issues. Most of it should be quick and straightforward. Only thing that requires some thinking is incorporating it into CI.

If that’s not quick enough, feel free to merge because I can’t really think of a great reason to wait.

As for the CI integration, my thinking was adding a flag to the command that will write the results to a json file that can be committed into the repo. If such a file exists, the benchmark reporter will use it to generate a comparison. Then we can just add the benchmark runner/reporter to CI. Let me know if you have a better approach.

mistakia · 2019-05-28T19:48:31Z

I've updated the PR with the following changes:

added a baselineLimit
added a logLimit (synonymous with max-count)
cleaning up the files/folder on completion
added a help output documenting all the options + a couple example commands:

I'm working on integrating with CircleCI but am struggling a bit to plot a path forward. My previous suggestion doesn't quite make sense as the saved benchmark file that is committed to the repo is run on a different machine and thus is not a good comparison when used in CircleCI. I'm not very familiar with CircleCI yet but it appears I can save test metadata as "artifacts" but I'm not sure how I would access it later inside a job. The build/test job of a pr branch will have to access the metadata generated from the last master build/test job.

aphelionz · 2019-05-28T21:03:20Z

@mistakia Looks great. I'm thinking we can merge this and I can work with CI since I'm already in there doing a bunch of other stuff.

Final words, @shamb0t or @haadcode ?

mistakia · 2019-05-28T21:14:47Z

Sounds good. Let me know how I can help.

Also, I should note that this PR adds yargs as a dev dep.

shamb0t · 2019-05-29T05:27:45Z

This is fantastic, thank you @mistakia! ❤️

haadcode · 2019-05-29T17:05:39Z

Awesome work @mistakia and @aphelionz 👏 This is a really great addition to the dev tooling, LGTM 👍

aphelionz · 2019-05-29T17:09:34Z

Congrats @mistakia :)

mistakia mentioned this pull request Jul 26, 2018

tests: add benchmarks for log.values, log.has(), log.get() #137

Closed

mistakia force-pushed the test/benchmark-runner branch 2 times, most recently from 24b7539 to da978be Compare August 15, 2018 16:45

mistakia force-pushed the test/benchmark-runner branch from 8520e96 to 558e29d Compare January 1, 2019 02:01

mistakia force-pushed the test/benchmark-runner branch from 0e5ef13 to 3d98d8d Compare March 19, 2019 20:25

mistakia added 18 commits May 17, 2019 14:35

wip

83b59a6

refactor: use process.hrtime

e62925f

fix: release repo + separate benchmarks

5089471

feat: grep for benchmark name

8a5d64d

feat: add join + signed benchmarks

fd40b36

feat: add values benchmarks

1ece78f

feat: add heads benchmarks

0c1d457

feat: add tails benchmarks

09cd4c2

feat: add tailHashes benchmarks

363e025

feat: add get benchmarks

5381f7b

feat: add has benchmarks

c5e24d0

feat: add toString benchmarks

365a183

feat: add toMultihash benchmarks

0f85042

feat: add fromMultihash benchmarks

04fb6fa

feat: add fromEntry benchmarks

b72d036

feat: update fromEntryHash benchmarks

244428b

feat: add basic logging to benchmarks

1d7a569

feat: add basic reporter

822a925

mistakia added 4 commits May 17, 2019 14:35

feat: add benchmark npm scripts

e9e8829

feat: add traverse benchmarks

e871723

feat: add findHeads benchmarks

256aeb0

feat: update to latest ipfs-log

8cca728

mistakia force-pushed the test/benchmark-runner branch from 3d98d8d to 8cca728 Compare May 17, 2019 18:35

fix: garbage collect before running benchmark

f8699f1

mistakia force-pushed the test/benchmark-runner branch from 2a0d334 to f8699f1 Compare May 17, 2019 19:38

feat: pass in stress test time limit

15b2702

mistakia added 4 commits May 28, 2019 14:33

docs: add help ouput to benchmark runner

0c8fd4f

feat: add baseline limit

dddfbbf

feat: pass in optional log limit for baseline benchmarks

77f6aa0

feat: cleanup benchmark runner folder/files

d02c85f

mistakia force-pushed the test/benchmark-runner branch from 558dd68 to d02c85f Compare May 28, 2019 19:15

aphelionz added 3 commits May 29, 2019 09:17

Create README.md

5a029f4

Update README.md

3c05b10

Update README.md

c830c40

aphelionz merged commit 621e38c into orbitdb-archive:master May 29, 2019

mistakia deleted the test/benchmark-runner branch May 29, 2019 18:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark runner (WIP) #143

Benchmark runner (WIP) #143

mistakia commented Jul 26, 2018 •

edited

Loading

haadcode commented Aug 1, 2018

mistakia commented Aug 2, 2018 •

edited

Loading

aphelionz commented Nov 3, 2018 •

edited

Loading

mistakia commented Dec 29, 2018

aphelionz commented Dec 29, 2018

mistakia commented Jan 1, 2019 •

edited

Loading

mistakia commented Mar 19, 2019

mistakia commented May 17, 2019

mistakia commented May 23, 2019

aphelionz commented May 25, 2019

aphelionz commented May 25, 2019

aphelionz commented May 25, 2019

mistakia commented May 25, 2019

mistakia commented May 28, 2019 •

edited

Loading

aphelionz commented May 28, 2019

mistakia commented May 28, 2019

shamb0t commented May 29, 2019

haadcode commented May 29, 2019

aphelionz commented May 29, 2019

Benchmark runner (WIP) #143

Benchmark runner (WIP) #143

Conversation

mistakia commented Jul 26, 2018 • edited Loading

Todo

Benchmarks

haadcode commented Aug 1, 2018

mistakia commented Aug 2, 2018 • edited Loading

aphelionz commented Nov 3, 2018 • edited Loading

mistakia commented Dec 29, 2018

aphelionz commented Dec 29, 2018

mistakia commented Jan 1, 2019 • edited Loading

mistakia commented Mar 19, 2019

mistakia commented May 17, 2019

mistakia commented May 23, 2019

aphelionz commented May 25, 2019

aphelionz commented May 25, 2019

aphelionz commented May 25, 2019

mistakia commented May 25, 2019

mistakia commented May 28, 2019 • edited Loading

aphelionz commented May 28, 2019

mistakia commented May 28, 2019

shamb0t commented May 29, 2019

haadcode commented May 29, 2019

aphelionz commented May 29, 2019

mistakia commented Jul 26, 2018 •

edited

Loading

mistakia commented Aug 2, 2018 •

edited

Loading

aphelionz commented Nov 3, 2018 •

edited

Loading

mistakia commented Jan 1, 2019 •

edited

Loading

mistakia commented May 28, 2019 •

edited

Loading