diag of SparseMatrixCSC should always return SparseVector #23261

fredrikekre · 2017-08-14T22:05:33Z

diag(A::SparseMatrixCSC, k::Int=0) now dispatches here and then to getindex for sparse matrices. Will do some benchmarking if it is worth updating SpDiagIterator, or keep it as is.

fix JuliaLang/LinearAlgebra.jl#411

tkelman · 2017-08-14T22:14:09Z

if it doesn't help performance to adapt SpDiagIterator, we should maybe deprecate that if it isn't used anywhere else

fredrikekre · 2017-08-14T22:24:44Z

It is used in trace too, but thats it. Will see what benchmarks say tomorrow.

Sacha0

superficially lgtm :).

andreasnoack · 2017-08-15T19:32:04Z

base/sparse/linalg.jl

@@ -879,7 +879,7 @@ for f in (:\, :Ac_ldiv_B, :At_ldiv_B)
            if m == n
                if istril(A)
                    if istriu(A)
-                        return ($f)(Diagonal(A), B)
+                        return ($f)(Diagonal(Vector(diag(A))), B)


Is this to avoid changing the behavior? I guess it should still work with a sparse vector, right?

Yea it works, but will result in a SparseMatrixCSC, so the result is not inferable.

fredrikekre · 2017-08-15T21:20:03Z

It seems that using SpDiagIterator is faster so we should rewrite that to work with arbitrary diagonal. I suppose this could be merged as is and we can make performance improvements later, but might as well leave it as WIP until I get around to fix it.

fredrikekre · 2017-08-16T15:02:15Z

I updated SpDiagIterator such that we can iterate over any diagonal.

tkelman · 2017-08-16T16:01:05Z

base/sparse/sparsematrix.jl

+    ind = Vector{Ti}(); sizehint!(ind, min(l, nnz(A)))
+    val = Vector{Tv}(); sizehint!(val, min(l, nnz(A)))
+    for (i, v) in enumerate(SpDiagIterator(A, d))
+        if !iszero(v)


does SpDiagIterator skip structural zeros or produce them?

Well, it is like getindex so it returns either the stored value at that position or zero(T). I am tempted to just iterate over stored values but that would be breaking I guess. Could also just remove SpDiagIterator and move it into the diag function.

that would be breaking I guess

in what way?

Something like

for i in SpDiagIterator(spzeros(3,3)) # ... end

Currently gives a loop of length 3, with i being 0 all three times. If the iterator only returned stored values, this loop would be done on the first iteration.

Ah yes good point. I just grepped all registered packages and it's never used outside of Base, so I think as long as it still does what it's supposed to in the places that Base uses it, we can change its behavior if it would be an improvement.

Yea I did some searching too, but found it nowhere else. In that case I think we can just remove it. I did some benchmarking for trace and it seems that

function trace2(A::SparseMatrixCSC{Tv,Ti}) where {Tv,Ti} n = checksquare(A) d = zero(Tv) for i in 1:n d += A[i,i] end return d end

is faster than using the iterator. So I will remove it, and then include the iteration code inside the diag body instead.

fredrikekre · 2017-08-17T00:32:26Z

Take 3: Removed SpDiagIterator and imported the iteration code into the diag body. Now we collect all the stored values, so stored zeros will be preserved, which is kinda neat IMO

julia> A
3×3 SparseMatrixCSC{Float64,Int64} with 1 stored entry:
  [1, 1]  =  0.0

julia> diag(A)
3-element SparseVector{Float64,Int64} with 1 stored entry:
  [1]  =  0.0

tkelman · 2017-08-17T00:46:32Z

base/sparse/sparsematrix.jl

-    (r1 > r2) && (return (zero(Tv), j+1))
-    r1 = searchsortedfirst(A.rowval, j, r1, r2, Forward)
-    (((r1 > r2) || (A.rowval[r1] != j)) ? zero(Tv) : A.nzval[r1], j+1)
+function diag(A::SparseMatrixCSC{Tv,Ti}, d::Int=0) where {Tv,Ti}


looks like most of the other methods allow Integer for the second argument

Sacha0 · 2017-08-17T19:12:00Z

base/sparse/sparsematrix.jl

+        throw(ArgumentError("requested diagonal, $d, out of bounds in matrix of size ($m, $n)"))
+    end
+    l = d < 0 ? min(m+d,n) : min(n-d,m)
+    r, c = d <= 0 ? (-d, 0) : (0, d) # start row/col -1


Perhaps either type r and c or use zero to on the right side to avoid potential type instability?

Converted d directly to Int so should be ok now :)

Sacha0 · 2017-08-17T19:12:38Z

base/sparse/sparsematrix.jl

+    ind = Vector{Ti}()
+    val = Vector{Tv}()
+    for i in 1:l
+        r += 1; c += 1


Likewise here, either type r and c or use oneunit on the right side to avoid potential type instability?

Also fixed by converting to Int directly.

Sacha0 · 2017-08-17T19:14:42Z

base/sparse/sparsematrix.jl

-    for d in SpDiagIterator(A)
-        s += d
+    for i in 1:n
+        s += A[i,i]


Does this version perform similarly to the original? Sparse getindex is fairly complex / expensive.

Shouldn't it do pretty much exactly what the sparse iterator what doing? You still gotta search, right?

I expect so, at least roughly. But my conjectures often fail, so explicit verification has become my friend :).

Some benchmarking suggested this was faster. The iterator was even allocating some stuff. I will get back with number :)

using BenchmarkTools function trace2(A::SparseMatrixCSC{Tv}) where Tv n = Base.LinAlg.checksquare(A) s = zero(Tv) for i in 1:n s += A[i,i] end return s end for s in (1000, 5000), p in (0.1, 0.01, 0.005) S1 = sprand(s, s, p) S2 = S1 + speye(s, s) # typical case with values on all diagonal positions println("trace") @btime trace($S1) @btime trace($S2) println("trace2") @btime trace2($S1) @btime trace2($S2) end

with output:

trace 24.820 μs (6 allocations: 176 bytes) 26.273 μs (6 allocations: 176 bytes) trace2 25.804 μs (0 allocations: 0 bytes) 25.010 μs (0 allocations: 0 bytes) trace 11.110 μs (6 allocations: 176 bytes) 11.680 μs (6 allocations: 176 bytes) trace2 9.736 μs (0 allocations: 0 bytes) 10.217 μs (0 allocations: 0 bytes) trace 9.370 μs (6 allocations: 176 bytes) 9.862 μs (6 allocations: 176 bytes) trace2 7.443 μs (0 allocations: 0 bytes) 7.938 μs (0 allocations: 0 bytes) trace 204.548 μs (6 allocations: 176 bytes) 218.412 μs (6 allocations: 176 bytes) trace2 197.253 μs (0 allocations: 0 bytes) 210.470 μs (0 allocations: 0 bytes) trace 117.837 μs (6 allocations: 176 bytes) 120.147 μs (6 allocations: 176 bytes) trace2 112.441 μs (0 allocations: 0 bytes) 113.419 μs (0 allocations: 0 bytes) trace 104.461 μs (6 allocations: 176 bytes) 106.751 μs (6 allocations: 176 bytes) trace2 97.655 μs (0 allocations: 0 bytes) 100.329 μs (0 allocations: 0 bytes)

Sacha0 · 2017-08-17T19:17:53Z

base/sparse/sparsematrix.jl

+        r1 = Int(A.colptr[c])
+        r2 = Int(A.colptr[c+1]-1)
+        r1 > r2 && continue
+        r1 = searchsortedfirst(A.rowval, r, r1, r2, Forward)


IIRC searchsortedfirst always returns an Int?

Sacha0 · 2017-08-17T19:19:12Z

test/sparse/sparse.jl

+        S3 = sprand(T,  5, 10, 0.5)
+        for S in (S1, S2, S3)
+            A = Matrix(S)
+            @test diag(S)::SparseVector{T,Int}  == diag(A)


Extra space before ==?

Sacha0 · 2017-08-17T19:19:21Z

test/sparse/sparse.jl

+            A = Matrix(S)
+            @test diag(S)::SparseVector{T,Int}  == diag(A)
+            for k in -size(S,1):size(S,2)
+                @test diag(S, k)::SparseVector{T,Int}  == diag(A, k)


Extra space before ==?

Sacha0

Thanks @fredrikekre! :)

The Travis macOS failure appears unrelated.

ararslan added linear algebra Linear algebra sparse Sparse arrays labels Aug 14, 2017

ararslan requested a review from andreasnoack August 14, 2017 22:32

Sacha0 reviewed Aug 15, 2017

View reviewed changes

andreasnoack approved these changes Aug 15, 2017

View reviewed changes

fredrikekre force-pushed the fe/sparse-diag branch from d4abafb to 5fc8200 Compare August 16, 2017 14:58

fredrikekre requested a review from Sacha0 August 16, 2017 15:00

fredrikekre requested a review from andreasnoack August 16, 2017 15:09

tkelman reviewed Aug 16, 2017

View reviewed changes

fredrikekre added 3 commits August 17, 2017 02:23

diag of SparseMatrixCSC should always return SparseVector

0c7863b

rewrite SpDiagIterator to accept any diagonal

7af9ab2

remove SpDiagIterator

00905db

fredrikekre force-pushed the fe/sparse-diag branch from 5fc8200 to 00905db Compare August 17, 2017 00:23

tkelman reviewed Aug 17, 2017

View reviewed changes

Int -> Integer, simplifications

a77a206

fredrikekre changed the title ~~[WIP] diag of SparseMatrixCSC should always return SparseVector~~ diag of SparseMatrixCSC should always return SparseVector Aug 17, 2017

test that stored zeros are still stored zeros in the diag

17eef91

fredrikekre force-pushed the fe/sparse-diag branch from e622548 to 17eef91 Compare August 17, 2017 09:30

Sacha0 reviewed Aug 17, 2017

View reviewed changes

remove type instability for non Int d

5530e4d

Sacha0 approved these changes Aug 18, 2017

View reviewed changes

fredrikekre merged commit 5fd053f into master Aug 18, 2017

fredrikekre deleted the fe/sparse-diag branch August 18, 2017 21:56

Sacha0 mentioned this pull request Nov 26, 2024

diag specializations for structured matrices yield dense vectors JuliaLang/LinearAlgebra.jl#478

Closed

cormullion mentioned this pull request Mar 18, 2018

NEWS.md is getting a bit untidy #26508

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

diag of SparseMatrixCSC should always return SparseVector #23261

diag of SparseMatrixCSC should always return SparseVector #23261

fredrikekre commented Aug 14, 2017

tkelman commented Aug 14, 2017

fredrikekre commented Aug 14, 2017

Sacha0 left a comment

andreasnoack Aug 15, 2017

fredrikekre Aug 15, 2017

fredrikekre commented Aug 15, 2017

fredrikekre commented Aug 16, 2017

tkelman Aug 16, 2017

fredrikekre Aug 16, 2017

tkelman Aug 16, 2017

fredrikekre Aug 16, 2017

tkelman Aug 16, 2017

fredrikekre Aug 16, 2017

fredrikekre commented Aug 17, 2017

tkelman Aug 17, 2017

Sacha0 Aug 17, 2017

fredrikekre Aug 17, 2017

Sacha0 Aug 17, 2017

fredrikekre Aug 17, 2017

Sacha0 Aug 17, 2017

KristofferC Aug 17, 2017

Sacha0 Aug 17, 2017

fredrikekre Aug 17, 2017

fredrikekre Aug 17, 2017

Sacha0 Aug 17, 2017

Sacha0 Aug 17, 2017

Sacha0 Aug 17, 2017

Sacha0 left a comment

diag of SparseMatrixCSC should always return SparseVector #23261

diag of SparseMatrixCSC should always return SparseVector #23261

Conversation

fredrikekre commented Aug 14, 2017

tkelman commented Aug 14, 2017

fredrikekre commented Aug 14, 2017

Sacha0 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fredrikekre commented Aug 15, 2017

fredrikekre commented Aug 16, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fredrikekre commented Aug 17, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Sacha0 left a comment

Choose a reason for hiding this comment