improve performance for Matrix -> SparseMatrix #26334

KristofferC · 2018-03-06T10:38:20Z

Benchmark (sparse2 is PR):

A1 = rand(3000, 3000)
A2 = Matrix(sprand(3000, 3000, 0.1))
A3 = Matrix(sprand(3000, 3000, 0.01))

@btime sparse(A1)
  114.613 ms (11 allocations: 206.04 MiB)
@btime sparse(A2)
  27.586 ms (11 allocations: 20.61 MiB)
@btime sparse(A3)
  14.071 ms (11 allocations: 2.09 MiB)

@btime sparse2(A1)
  61.808 ms (7 allocations: 137.35 MiB)
@btime sparse2(A2)
  22.060 ms (7 allocations: 13.73 MiB)
@btime sparse2(A3)
  11.622 ms (7 allocations: 1.39 MiB

KristofferC · 2018-03-06T10:53:50Z

stdlib/SparseArrays/src/sparsematrix.jl

@@ -376,6 +376,27 @@ function SparseMatrixCSC{Tv,Ti}(M::AbstractMatrix) where {Tv,Ti}
    eltypeTvV = convert(Vector{Tv}, V)
    return sparse_IJ_sorted!(eltypeTiI, eltypeTiJ, eltypeTvV, size(M)...)
 end
+function SparseMatrixCSC{Tv,Ti}(M::Matrix) where {Tv,Ti}
+    nz = count(!equalto(0), M)


This count is needed to avoid using push! which unfortunately is slow due to its overhead (#24909).

Don't you want !equalto(zero(Tv)) here? Thinking about matrices with SVector entries.

chethega · 2018-03-06T11:34:35Z

stdlib/SparseArrays/src/sparsematrix.jl

@@ -376,6 +376,27 @@ function SparseMatrixCSC{Tv,Ti}(M::AbstractMatrix) where {Tv,Ti}
    eltypeTvV = convert(Vector{Tv}, V)
    return sparse_IJ_sorted!(eltypeTiI, eltypeTiJ, eltypeTvV, size(M)...)
 end
+function SparseMatrixCSC{Tv,Ti}(M::Matrix) where {Tv,Ti}
+    nz = count(!equalto(0), M)


tv might not be concrete, and the matrix may contain missing or (more realistically) be have SVector entries.

Maybe nz = count(x->true===iszero(x), M)? That way we only lose type information of zeros.

Not sure about performance implications (sorry for double-posting, failed at github-ui).

No, this is a pure performance improvement over the current function which uses findnz which does nnzA = count(t -> t != 0, A) and if Aij != 0. Changing the sparse matrix library to work more complicated elements is out of scope for this PR.

Note that equalto uses isequal, so this will change the behavior for -0.0.

CodeLenz · 2018-03-06T13:56:03Z

stdlib/SparseArrays/src/sparsematrix.jl

+    end
+    return SparseMatrixCSC(size(M, 1), size(M, 2), colptr, rowval, nzval)
+end
+
 # converting from SparseMatrixCSC to other matrix types
 function Matrix(S::SparseMatrixCSC{Tv}) where Tv


Hi.

You are using

size(M, 2)

3 times and

size(M,1)

two times. Would not be faster to define two variables with those values and than use them? Or the interpreter can optimize it anyway ?

That's a valid comment. However, in this case, even if the compiler doesn't optimize it away, the time to access these fields should be completely insignificant to the rest of the code. The question here is if it makes the code more legible to store them in m and n, for example. I'm not sure.

Indeed! Thanks for your answer. Sometimes I wonder if the loops can be (somehow) optimized in those cases.

mbauman · 2018-03-06T15:02:17Z

Is there any reason why we wouldn't use this implementation for all AbstractMatrixes? I suppose it'd probably be slower for sparse matrices… but most every other matrix implementation should be faster like this, no?

KristofferC · 2018-03-06T15:20:59Z

There are so many types of matrices I am not sure exactly what you need to fulfill the API. For example, this assumes column major and that indices starts at 1 etc. I was just playing it safe with Matrix.

andreasnoack · 2018-03-06T20:24:27Z

Extending to StridedMatrix should be safe though.

improve performance for Matrix -> SparseMatrix

0e793ea

KristofferC requested a review from andreasnoack March 6, 2018 10:38

KristofferC commented Mar 6, 2018

View reviewed changes

chethega reviewed Mar 6, 2018

View reviewed changes

do not use equalto

c517e26

CodeLenz reviewed Mar 6, 2018

View reviewed changes

mbauman added sparse Sparse arrays performance Must go faster labels Mar 6, 2018

andreasnoack approved these changes Mar 6, 2018

View reviewed changes

Update sparsematrix.jl

99be4c2

andreasnoack merged commit 6e0c299 into master Mar 7, 2018

andreasnoack deleted the kc/array_to_sparse branch March 7, 2018 07:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve performance for Matrix -> SparseMatrix #26334

improve performance for Matrix -> SparseMatrix #26334

KristofferC commented Mar 6, 2018 •

edited

Loading

KristofferC Mar 6, 2018 •

edited

Loading

chethega Mar 6, 2018

fredrikekre Mar 6, 2018

chethega Mar 6, 2018

KristofferC Mar 6, 2018

nalimilan Mar 6, 2018

CodeLenz Mar 6, 2018 •

edited

Loading

KristofferC Mar 6, 2018

CodeLenz Mar 6, 2018

mbauman commented Mar 6, 2018

KristofferC commented Mar 6, 2018

andreasnoack commented Mar 6, 2018

improve performance for Matrix -> SparseMatrix #26334

improve performance for Matrix -> SparseMatrix #26334

Conversation

KristofferC commented Mar 6, 2018 • edited Loading

KristofferC Mar 6, 2018 • edited Loading

Choose a reason for hiding this comment

chethega Mar 6, 2018

Choose a reason for hiding this comment

fredrikekre Mar 6, 2018

Choose a reason for hiding this comment

chethega Mar 6, 2018

Choose a reason for hiding this comment

KristofferC Mar 6, 2018

Choose a reason for hiding this comment

nalimilan Mar 6, 2018

Choose a reason for hiding this comment

CodeLenz Mar 6, 2018 • edited Loading

Choose a reason for hiding this comment

KristofferC Mar 6, 2018

Choose a reason for hiding this comment

CodeLenz Mar 6, 2018

Choose a reason for hiding this comment

mbauman commented Mar 6, 2018

KristofferC commented Mar 6, 2018

andreasnoack commented Mar 6, 2018

KristofferC commented Mar 6, 2018 •

edited

Loading

KristofferC Mar 6, 2018 •

edited

Loading

CodeLenz Mar 6, 2018 •

edited

Loading