Fix broken KPM scalar optimization, use 5-arg mul! instead #72

pablosanjose · 2020-06-17T11:57:33Z

The clever scalar version of the KPM iteration inherited from Elsa was broken. Since Julia 1.3 we have 5-argument mul! that implements a similar scalar algorithm. It is also multithreaded to boot. This PR uses such facility, and fixes KPM in the process (I used the computation in https://discourse.julialang.org/t/kernel-polynomial-method/34240/3 as reference). A side effect is that the code is again much clearer and closer to the original algorithm.

codecov-commenter · 2020-06-17T12:03:12Z

Codecov Report

Merging #72 into master will increase coverage by 0.52%.
The diff coverage is 0.00%.

@@            Coverage Diff             @@
##           master      #72      +/-   ##
==========================================
+ Coverage   56.20%   56.72%   +0.52%     
==========================================
  Files          15       15              
  Lines        2288     2267      -21     
==========================================
  Hits         1286     1286              
+ Misses       1002      981      -21

Impacted Files	Coverage Δ
src/KPM.jl	`0.00% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6ef6de5...bc004f3. Read the comment docs.

threading cleanup transpose mul fix for A != I use thread buffers only when needed fix commented BLAS version activate BLAS codepath

pablosanjose · 2020-06-17T16:30:19Z

It turns out the mul! approach relies on BLAS threads (of course...). After experimenting with the help of @fernandopenaranda, we cannot seem to beat BLAS with Threads.@threads (let alone MKL, which appears to be twice as fast), so the mul! method in this PR seems like the optimal choice, currently. The last commit includes the old (but corrected) scalar code, commented out, in case it is of use in the future.

more stringent test comment [skip ci]

pablosanjose added 2 commits June 17, 2020 13:45

5 - mul! version of the broken KPM optimization

0c3d2e0

added test

5cec058

pablosanjose added 2 commits June 17, 2020 18:26

back to scalar iteration

ec28a1f

threading cleanup transpose mul fix for A != I use thread buffers only when needed fix commented BLAS version activate BLAS codepath

Merge branch 'fixKPM2' into fixKPM

e7e1edc

pablosanjose force-pushed the fixKPM branch from ecbfd3d to edf51b4 Compare June 17, 2020 17:08

missing adjoint

c3768b0

more stringent test comment [skip ci]

pablosanjose force-pushed the fixKPM branch from bc004f3 to c3768b0 Compare June 17, 2020 17:11

pablosanjose merged commit 3e29975 into master Jun 17, 2020

pablosanjose deleted the fixKPM branch June 18, 2020 10:26

pablosanjose mentioned this pull request Jul 7, 2020

Threaded sparse matrix - vector multiplication #74

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix broken KPM scalar optimization, use 5-arg mul! instead #72

Fix broken KPM scalar optimization, use 5-arg mul! instead #72

pablosanjose commented Jun 17, 2020

codecov-commenter commented Jun 17, 2020 •

edited

Loading

pablosanjose commented Jun 17, 2020

Fix broken KPM scalar optimization, use 5-arg mul! instead #72

Fix broken KPM scalar optimization, use 5-arg mul! instead #72

Conversation

pablosanjose commented Jun 17, 2020

codecov-commenter commented Jun 17, 2020 • edited Loading

Codecov Report

pablosanjose commented Jun 17, 2020

codecov-commenter commented Jun 17, 2020 •

edited

Loading