Calculate GCD for longs more efficiently #140

mlangc · 2024-02-25T13:04:20Z

Replaces the GCD implementation for long values with code that is several times faster. See
https://medium.com/@m.langer798/stein-vs-stein-on-the-jvm-c911809bfce1 for details.

Replaces the GCD implementation for long values with code that is several times faster. See https://medium.com/@m.langer798/stein-vs-stein-on-the-jvm-c911809bfce1 for details.

aherbert · 2024-02-25T22:50:41Z

Interesting analysis on your blog. It would be helpful if you apply the same changes to public static int gcd(int p, int q) too.

mlangc · 2024-02-26T17:30:40Z

Interesting analysis on your blog. It would be helpful if you apply the same changes to public static int gcd(int p, int q) too.

Makes sense - I just pushed another commit that does exactly that.

codecov-commenter · 2024-02-26T19:48:07Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 99.24%. Comparing base (27ab685) to head (6f9ebd4).
Report is 1 commits behind head on master.

Additional details and impacted files

@@             Coverage Diff              @@
##             master     #140      +/-   ##
============================================
+ Coverage     99.23%   99.24%   +0.01%     
+ Complexity     1828     1802      -26     
============================================
  Files            70       70              
  Lines          4808     4779      -29     
  Branches        896      881      -15     
============================================
- Hits           4771     4743      -28     
  Misses           10       10              
+ Partials         27       26       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

mlangc · 2024-02-26T20:22:35Z

I did some further benchmarks, and it seems that the implementation for int is actually better left as is. I'll look more closely tomorrow or after tomorrow & share the results with you.

mlangc · 2024-03-01T10:11:49Z

After adding some benchmarks, I found out that the existing GCD implementation for ints is also very performant for longs, if only one small change is made to it. Thus I adapted the implementation for ints, and replaced the version for longs with the same code, but for 64 bits.

Here are the benchmark results in 1000 GCDs per second on my laptop (see https://github.com/apache/commons-numbers/pull/140/files#diff-61d1811860900830accad2a21d17e8bcd905486d5c91ed6e43bef31e11e27147):

Benchmark                                Mode  Cnt      Score      Error  Units
GcdPerformance.gcdBigInteger            thrpt   10   1504.116 ±  166.441  ops/s
GcdPerformance.gcdInt                   thrpt   10  21050.470 ± 1057.856  ops/s
GcdPerformance.gcdLong                  thrpt   10  10352.825 ±  272.360  ops/s
GcdPerformance.oldGcdInt                thrpt   10  21003.386 ±  849.209  ops/s
GcdPerformance.oldGcdIntAdaptedForLong  thrpt   10   4741.043 ±  562.422  ops/s
GcdPerformance.oldGcdLong               thrpt   10   2500.723 ±   79.173  ops/s

As you can verify, the performance for ints is not really affected by the change, however for longs, there is a big difference, as you can see here (see https://colab.research.google.com/drive/11uz20qhFhUgv_-2YewzR--SDRD4_swYr#scrollTo=1M4k9mlEbvab&line=32&uniqifier=1)

aherbert · 2024-03-01T11:42:53Z

It seems that commit history changed the int version in NUMBERS-132. This introduced the use of numberOfTrailingZeros for fast divide by powers of 2. It did not update the long version.

I'll review the code with some feedback. I expect the benchmark code to fail the build due to uncommented public methods. If these are made private it should be OK.

aherbert

Thanks for the code and benchmark. I am fine with the code changes as they copy the int implementation from NUMBERS-132 to long, and improve the performance.

I have given some ideas to change the benchmark to allow more flexibility in testing. I also think it will fail the build without some form of comments on public classes and methods. You can test this by running mvn from the JMH module directory. Comments are always helpful so a future visitor will not have to look at this GH PR or your informative blog post on the topic.

aherbert · 2024-03-01T11:44:04Z