Improve `bundler/setup` performance again by not deduplicating intermediate results by deivid-rodriguez · Pull Request #5533 · ruby/rubygems

deivid-rodriguez · 2022-05-13T09:33:26Z

What was the end-user or developer problem that led to this PR?

The performance of bundler/setup is still slow.

What is your fix for the problem, implemented in this PR?

On a different patch, it was noticed by @ngan that we are calling LazySpecification#hash many times, and simply memoizing that led to a very considerable performance improvement in his app.

I noticed though that we shouldn't be calling LazySpecification#hash that many times, and I located the culprit at SpecSet#for where we were deduplicating the partial aggregated result on every iteration. It is enough to do it just once at the end.

This leads on a 12% speedup on Rails repository Gemfile vs the previous 8% I was getting from memoizing LazySpecification#hash. Also, after this patch memoizing LazySpecification#hash has no effect in performance anymore.

Make sure the following tasks are checked

Describe the problem / feature
Write tests for features and bug fixes
Write code to solve the problem
Make sure you follow the current code style and write meaningful commit messages without tags

On a different patch, it was noticed Ngam Pham that we are calling `LazySpecification#hash` many times, and simply memoizing that led to a very considerable performance improvement in his app. I noticed though that we shouldn't be calling `LazySpecification#hash` that many times, and I located the culprit at `SpecSet#for` where we were deduplicating the partial aggregated result on every iteration. It is enough to do it just once at the end. This leads on a 12% speedup on Rails repository Gemfile vs the previous 8% I was getting from memoizing `LazySpecification#hash`. Also, after this patch memoizing `LazySpecification#hash` has no effect in performance anymore. Co-authored-by: Ngan Pham <ngan@users.noreply.github.com>

bundler/lib/bundler/spec_set.rb

On `rails/rails` repository Gemfile, running the following script ``` # script.rb require "bundler/setup" ``` #### Before ``` ➜ rails git:(main) ✗ BUNDLER_VERSION=2.4.0.dev ruby-memory-profiler --pretty --no-detailed --allocated-strings=0 --retained-strings=0 script.rb Total allocated: 24.37 MB (207937 objects) Total retained: 2.98 MB (34152 objects) ``` #### After ``` ➜ rails git:(main) ✗ BUNDLER_VERSION=2.4.0.dev ruby-memory-profiler --pretty --no-detailed --allocated-strings=0 --retained-strings=0 script.rb Total allocated: 22.27 MB (206856 objects) Total retained: 2.98 MB (34152 objects) ``` Co-authored-by: Josh Nichols <josh.nichols@gusto.com>

technicalpickles · 2022-05-13T20:49:00Z

Compared this to 2.3.9 and 2.3.13 with our Gemfile:

# output is bundler version, and the output of `Benchmark.measure`
2.3.9
  2.789033   0.243557   3.032590 (  3.830557)

2.3.13
  1.328539   0.277676   1.606215 (  2.631212)

2.4.0.dev
  0.923131   0.097992   1.021123 (  1.024666)

So, looks good!

Looking over flamegraphs for `require 'bundler/setup'`, I'm seeing `Bundler::DepProxy#name` show up quite often. The method itself is really simple, delegating to the dependency's proxy. I suspect it is getting called enough that memoizing the value improves will improve performance by saving a method call, in exchange for saving the value in memory. When testing with this patch plus ruby#5533 I saw time go from 0.92s to 0.75s.

…andled deps I was looking at (yet another) flamegraph in speedscope, and used the 'left hand heavy' and was shocked to realize that 0.5s of the 1.7s is spent in DepProxy#name. This method _only_ delegates the name to an underlying spec, so it's not complex at all. It seems to be of how often this line ends up calling it: next if handled.any?{|d| d.name == dep.name && (match_current_platform || d.__platform == dep.__platform) } || dep.name == "bundler" The `handled` array is built up as dependencies are handled, so this get slower as more dependencies are installed. This change changes how `handled` is track. Instead of just an array, I've tried using a Hash, with the key being a dep's name, and the value being a list of deps with that name. This means it's constant time to find the dependencies with the same name. I saw a drop from 1.7s to 1.0s against master, and from 0.95s to 0.24s when used with ruby#5533

Improve `bundler/setup` performance again (cherry picked from commit b0958db)

…andled deps I was looking at (yet another) flamegraph in speedscope, and used the 'left hand heavy' and was shocked to realize that 0.5s of the 1.7s is spent in DepProxy#name. This method _only_ delegates the name to an underlying spec, so it's not complex at all. It seems to be of how often this line ends up calling it: next if handled.any?{|d| d.name == dep.name && (match_current_platform || d.__platform == dep.__platform) } || dep.name == "bundler" The `handled` array is built up as dependencies are handled, so this get slower as more dependencies are installed. This change changes how `handled` is track. Instead of just an array, I've tried using a Hash, with the key being a dep's name, and the value being a list of deps with that name. This means it's constant time to find the dependencies with the same name. I saw a drop from 1.7s to 1.0s against master, and from 0.95s to 0.24s when used with ruby#5533

…ing hash lookup of handled deps I was looking at (yet another) flamegraph in speedscope, and used the 'left hand heavy' and was shocked to realize that 0.5s of the 1.7s is spent in DepProxy#name. This method _only_ delegates the name to an underlying spec, so it's not complex at all. It seems to be of how often this line ends up calling it: next if handled.any?{|d| d.name == dep.name && (match_current_platform || d.__platform == dep.__platform) } || dep.name == "bundler" The `handled` array is built up as dependencies are handled, so this get slower as more dependencies are installed. This change changes how `handled` is track. Instead of just an array, I've tried using a Hash, with the key being a dep's name, and the value being a list of deps with that name. This means it's constant time to find the dependencies with the same name. I saw a drop from 1.7s to 1.0s against master, and from 0.95s to 0.24s when used with ruby/rubygems#5533 ruby/rubygems@844dac30d4

technicalpickles reviewed May 13, 2022

View reviewed changes

bundler/lib/bundler/spec_set.rb Outdated Show resolved Hide resolved

bundler/lib/bundler/spec_set.rb Show resolved Hide resolved

deivid-rodriguez mentioned this pull request May 13, 2022

Memoize Bundler::LazySpecification#hash to improve performance #5534

Closed

4 tasks

deivid-rodriguez added the bundler: performance label May 13, 2022

ngan mentioned this pull request May 13, 2022

Memoize some #hash calls #5532

Closed

4 tasks

technicalpickles mentioned this pull request May 13, 2022

Bundler::DepProxy#name performance improvement. #5536

Closed

4 tasks

technicalpickles mentioned this pull request May 13, 2022

Improve performance of Bundler::SpecSet#for by using hash lookup of handled deps #5537

Merged

4 tasks

deivid-rodriguez merged commit b0958db into master May 16, 2022

deivid-rodriguez deleted the alt-speedup branch May 16, 2022 08:24

deivid-rodriguez changed the title ~~Improve bundler/setup performance again~~ Improve bundler/setup performance again by not deduplicating intermediate results May 18, 2022

deivid-rodriguez added a commit that referenced this pull request May 18, 2022

Merge pull request #5533 from rubygems/alt-speedup

50663e8

Improve `bundler/setup` performance again (cherry picked from commit b0958db)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve `bundler/setup` performance again by not deduplicating intermediate results#5533

Improve `bundler/setup` performance again by not deduplicating intermediate results#5533
deivid-rodriguez merged 2 commits intomasterfrom
alt-speedup

deivid-rodriguez commented May 13, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

technicalpickles commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

deivid-rodriguez commented May 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What was the end-user or developer problem that led to this PR?

What is your fix for the problem, implemented in this PR?

Make sure the following tasks are checked

Uh oh!

Uh oh!

Uh oh!

technicalpickles commented May 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

deivid-rodriguez commented May 13, 2022 •

edited

Loading