broadcast!(f, C) for sparse vector/matrix C #19934

Sacha0 · 2017-01-08T21:29:08Z

This pull request provides broadcast!(f, C) for sparse vector/matrix C, with semantics close to those for generic two-argument broadcast!. Specifically, this implementation checks at runtime whether f() yields zero. If f() does yield zero, this implementation empties C and returns without additional f() calls. If f() does not yield zero, this implementation densifies and fills C via independent f() calls. (@stevengj, thoughts on these semantics?) Best!

tkelman · 2017-01-08T21:53:27Z

test/sparse/higherorderfns.jl

+    @test broadcast!(() -> 0, V) == sparse(broadcast!(() -> 0, fV))
+    @test broadcast!(() -> 0, C) == sparse(broadcast!(() -> 0, fC))
+    @test let z = 0, fz = 0; broadcast!(() -> z += 1, V) == broadcast!(() -> fz += 1, fV); end
+    @test let z = 0, fz = 0; broadcast!(() -> z += 1, C) == broadcast!(() -> fz += 1, fC); end


are you wanting to test the end values of z, fz?

Only the contents of V/fV and C/fC (which implicitly test the end values of z and fz)?

Sacha0 · 2017-01-09T01:30:36Z

(AV x86_64 failure unrelated.)

Sacha0 · 2017-01-15T07:20:58Z

Any thoughts on these semantics? If not, and absent objections or requests for time, I plan to merge this Monday morning PST. Best!

martinholters · 2017-01-19T19:24:02Z

Oh, I don't know why I missed this one, but actually, I do find the semantics a bit dubious: If f is assumed pure, independent calls seem wasteful, otherwise, just checking whether it yields zero once is insufficient. Wouldn't e.g. f() = Rand(Bool) behave funnily?

Sacha0 · 2017-01-20T00:07:35Z

I do find the semantics a bit dubious: If f is assumed pure, independent calls seem wasteful, otherwise, just checking whether it yields zero once is insufficient. Wouldn't e.g. f() = Rand(Bool) behave funnily?

Agreed on all points in principle :). Thoughts on a (presently realizable) better compromise between the generic AbstractArray code's behavior for broadcast!(f, C) and sparse broadcast[!]'s behavior elsewhere? Thanks!

martinholters · 2017-01-20T07:44:29Z

I see the following options:

Clearly document that broadcasting a non-pure function is UB and change broadcast!(f, C) for AbstractArrays to only evaluate f once to keep people from relying on any other behavior. This would leave room for future optimizations, e.g. freedom for auto-parallelization or when broadcasting from a vector v into a matrix m, just evaluate f once for every element of v and then make copies to fill the matrix m. However, this will likely surprise people trying to fill an array with independent instances by broadcasting a constructor.
Always evaluate f exactly once for each element of the destination array. Consistent, least surprising, extremely wasteful for sparse arrays and pure, zero-preserving f.
Guess the programmers intent to do the right thing at the cost of inconsistency. If well done, will often give good performance while doing what's desired, but may probably lead to very subtle bugs in generic code.
Allow call-site annotation to decide between 1 and 2, e.g. by a macro that rewrites broadcast[!] and . expressions to broadcast_chosen_option[!]. Would nicely cover all cases, but requires the most effort to realize and makes the otherwise highly usable dot-notation more obscure.

None of these are completely satisfying, my gut feeling is to prefer 4.

martinholters · 2017-01-20T07:57:42Z

The inlining of constants makes the present state really hairy:

foo(p) = rand() < p ? rand() : 0.0

x = spzeros(100)
x .= foo.(0.1)

y = spzeros(100)
p = 0.1
y .= foo.(p)

After this, y is a sparse vector with about 10% stored non-zero values as one would expect. OTOH, x has with 90% probability no stored values, and with 10% probability 100 stored values, of which about 90% are zero.

tkelman · 2017-01-20T11:33:13Z

The inlining of constants makes the present state really hairy:

The fact that it's not that hard to write code where that optimization ends up changing semantics like this makes me think it's not an unequivocally good thing to be doing in all cases. Why are we doing it in lowering instead of letting usual LLVM constant propagation optimizations do their thing more generally and safely?

Sacha0 · 2017-01-20T20:30:04Z

I see the following options:

Background reading re. broadcast!(f, A)'s semantics in general: #12277.

The inlining of constants makes the present state really hairy:

Wonderful example :).

The fact that it's not that hard to write code where that optimization ends up changing semantics like this makes me think it's not an unequivocally good thing to be doing in all cases. Why are we doing it in lowering instead of letting usual LLVM constant propagation optimizations do their thing more generally and safely?

cc @stevengj. Best!

StefanKarpinski · 2017-01-21T03:39:27Z

For now I would lean towards option 1: making this edge case explicitly undefined. That gives us some leeway to figure out the right thing in the next release (which might still be letting it be ub).

stevengj · 2017-01-21T12:31:21Z

Option 5 would be to document that broadcast assumes purity for sparse arrays only?

In algorithms with dense arrays, it's pretty useful to be able to do e.g. x .+= rand.() in stochastic algorithms, so that you don't have to allocate an auxiliary array of random numbers.

In algorithms with sparse arrays, using stochastic functions like this doesn't really make sense because they destroy sparsity, and using non-pure functions in general will defeat the ability to detect whether broadcast preserves sparsity (which is essential for performance).

martinholters · 2017-01-23T10:58:35Z

Option 5 would be option 3 with documentation. Getting that documentation right/exactly specifying the behavior is tricky, though. Consider

x .= f.(a, b)

For what combinations of sparse/dense a, b, and x would f be guaranteed to be evaluated exactly length(x) times? Is only the type of x of importance here? And for f.(a, b) also the return type? Is Diagonal sparse? Is UpperTriangular?

But even with this sorted out in a reasonable way, I still think that if the semantics differ by input type even though the same semantics could be applied for all types, this is very likely a severe gotcha when writing generic code.

StefanKarpinski · 2017-01-25T22:33:59Z

Uncertainty about this is why I'm in favor of leaving this undefined until 1.0 – hopefully we'll figure out the right choice in the next several months and then we can make that the defined behavior.

martinholters · 2017-01-26T08:34:01Z

Of course, we could combine options 3 (or 5) and 4: Do something reasonable by default, but allow opting in to assume-pure or assume-non-pure behavior.

x .= f.(a, b) # semantics type dependent
@assume_pure x .= f.(a,b) # may do all kinds of optimizations to reduce number of times f is called
@assume_non_pure x .= f.(a,b) # exactly length(x) calls to f in linear indexing order within a single thread

These annotations would be macros (with better names, hopefully) that do the same transformations that lowering does, but replacing broadcast! with broadcast_pure! and broadcast_non_pure!. (Similar for the non-! case, of course).

Implement and test broadcast!(f, C) for sparse vector/matrix C.

4b23f42

Sacha0 force-pushed the twoargspbcbang branch from 8c447d4 to 4b23f42 Compare January 8, 2017 21:30

Sacha0 added sparse Sparse arrays broadcast Applying a function over a collection labels Jan 8, 2017

Sacha0 added this to the 0.6.0 milestone Jan 8, 2017

tkelman reviewed Jan 8, 2017

View reviewed changes

Sacha0 merged commit e6d0943 into JuliaLang:master Jan 19, 2017

Sacha0 deleted the twoargspbcbang branch January 19, 2017 17:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

broadcast!(f, C) for sparse vector/matrix C #19934

broadcast!(f, C) for sparse vector/matrix C #19934

Sacha0 commented Jan 8, 2017

tkelman Jan 8, 2017

Sacha0 Jan 8, 2017

Sacha0 commented Jan 9, 2017 •

edited

Loading

Sacha0 commented Jan 15, 2017 •

edited

Loading

martinholters commented Jan 19, 2017

Sacha0 commented Jan 20, 2017

martinholters commented Jan 20, 2017

martinholters commented Jan 20, 2017

tkelman commented Jan 20, 2017

Sacha0 commented Jan 20, 2017

StefanKarpinski commented Jan 21, 2017

stevengj commented Jan 21, 2017 •

edited

Loading

martinholters commented Jan 23, 2017

StefanKarpinski commented Jan 25, 2017

martinholters commented Jan 26, 2017

broadcast!(f, C) for sparse vector/matrix C #19934

broadcast!(f, C) for sparse vector/matrix C #19934

Conversation

Sacha0 commented Jan 8, 2017

tkelman Jan 8, 2017

Choose a reason for hiding this comment

Sacha0 Jan 8, 2017

Choose a reason for hiding this comment

Sacha0 commented Jan 9, 2017 • edited Loading

Sacha0 commented Jan 15, 2017 • edited Loading

martinholters commented Jan 19, 2017

Sacha0 commented Jan 20, 2017

martinholters commented Jan 20, 2017

martinholters commented Jan 20, 2017

tkelman commented Jan 20, 2017

Sacha0 commented Jan 20, 2017

StefanKarpinski commented Jan 21, 2017

stevengj commented Jan 21, 2017 • edited Loading

martinholters commented Jan 23, 2017

StefanKarpinski commented Jan 25, 2017

martinholters commented Jan 26, 2017

Sacha0 commented Jan 9, 2017 •

edited

Loading

Sacha0 commented Jan 15, 2017 •

edited

Loading

stevengj commented Jan 21, 2017 •

edited

Loading