issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays #20007

Sacha0 · 2017-01-13T00:35:07Z

This pull request extends sparse broadcast[!] to one- and two-dimensional Arrays (via promotion to sparse, as with structured matrices), addressing #11474:

julia> sp = sprand(3,3,0.2)
3×3 sparse matrix with 2 Float64 stored entries:
  [2, 1]  =  0.475801
  [2, 3]  =  0.851515

julia> broadcast(*, [1,1,1], sp)
3×3 sparse matrix with 2 Float64 stored entries:
  [2, 1]  =  0.475801
  [2, 3]  =  0.851515

Though it works, this pull request probably should not be merged (or only as a stopgap). It serves mainly to illustrate an issue with the broadcast containertype promotion mechanism. The comments in this pull request describe that issue at length (partly copied below) followed by an illustration. I plan to sketch an implementation of the proposal in those comments in another PR. Best!

# Issue: `Array` has multiple meanings in the broadcast containertype promotion mechanism:
# (1) `Array`, in the sense of "this object is an Array (or this is a collection of Arrays)";
# (2) `AbstractArray`, in the sense of "this object is an AbstractArray (or this is a collection
#   of AbstractArrays) but where that AbstractArray is (/a subset of that collection of
#   AbstractArrays are) not of a specific AbstractArray subtype for which special
#   handling is defined;
# (3) `AbstractArray`, in the sense of "this is a collection of objects that need be funneled
#   to the generic AbstractArray broadcast code".
# Presently we conflate these three meanings in `Array`. Conflating meanings (2) and (3)
# might be fine, but conflating meaning (1) with the other two prevents separating objects
# that are Arrays (from e.g. collections of Arrays and Tuples) for special handling, as is
# necessary e.g. to handle Arrays in sparse broadcast.
#
# We should probably disambiguate these meanings in Broadcast. One approach to doing so
# would be to replace `Array` with `AbstractArray` in the existing containertype promotion
# mechanism, and then separately define containertype promotion methods for Array as we do
# for Tuple.
#
# The tricky question is what to do about the promote_containertype(ct, ::Type{Array}) = Array
# (after the change suggested above, promote_containertype(ct, ::Type{AbstractArray})) = AbstractArray)
# methods. These methods are ambiguity magnets, and there is a tendency to write similar
# such methods whenever extending broadcast to a new type, which ultimately results
# in having to write a combinatorial explosion of ambiguity-killing methods.
#
# Perhaps the following makes a reasonable model: don't define methods like
# promote_containertype(ct, ::Type{Array}) = Array, and discourage definition of such
# methods for new types. Instead (1) define promote_containertype(cta, ctb) = AbstractArray as
# primary fallback, such that any type pair without clearly defined behavior gets funneled
# to the generic AbstractArray broadcast code, and (2) encourage writing only explicit, tight
# promote_containertype definitions.
#
# The above should improve the extensibility and maintainability of Broadcast.

…unate Broadcast complexity combinatorial explosion edition).

martinholters · 2017-01-13T10:15:17Z

Could a mechanism like the for promote_type/promote_rule be used to keep the combinatorial explosion in check?

nalimilan · 2017-01-13T14:01:03Z

base/sparse/higherorderfns.jl

+promote_containertype(::Type{Matrix}, ::Type{Any}) =  Matrix
+promote_containertype(::Type{Any}, ::Type{Matrix}) = Matrix
+
+promote_containertype(::Type{Vector}, ::Type{Matrix}) = Matrix


Couldn't you replace definitions like this one with a general definition along the lines of this? Would it help fixing the problem?

promote_containertype{S<:Array, T<:Array}(::Type{S}, ::Type{T}) = Array{promote_type(eltype(S), eltype(T)), max(ndim(S), ndim(T))}`

(The element type isn't needed here of course, but I don't know how to create the Array{T, N} type without speciying T.)

Sacha0 · 2017-01-13T19:24:53Z

Could a mechanism like the for promote_type/promote_rule be used to keep the combinatorial explosion in check?

Could you expand? promote_containertype is analogous to promote_type/promote_rule, except without the fallback typejoin logic and the mechanism that makes a single promote_rule(::S, ::T) definition sufficient (rather than an argument-exchange-symmetric pair). (Something like the latter mechanism would be nice and implementation should be straightforward, but would at best curb the explosion of necessary definitions by a factor of two?)

Couldn't you replace definitions like this one with a general definition along the lines of this? Would it help fixing the problem?

Though differentiating Arrays by dimension in Broadcast's containertype promotion mechanism makes life easier for consumers of Broadcast that must separate Arrays by dimension (e.g. sparse broadcast[!]), doing so forces all consumers of Broadcast to handle that additional complexity. Conjecturing that most consumers of Broadcast do not need to separate Arrays by dimension (?), and avoiding forcing additional complexity on the many outweighing the convenience of the few, I would advocate heading the other direction: not differentiating Arrays by dimension in Broadcast's containertype promotion mechanism, and instead requiring those few consumers of Broadcast that must separate Arrays by dimension to implement that logic internally (as in e.g. #20009). (If implementation works out, the changes I suggest above might make life easier for both groups, including the logic in #20009.)

Thanks and best!

nalimilan · 2017-01-13T21:12:13Z

Makes sense. What's the problem then? :-)

Sacha0 · 2017-01-14T00:18:11Z

What's the problem then? :-)

The comments in the second/terminating code block in the original post describe the (two) issues, though not eloquently. A little expansion below:

The first issue: Broadcast's containertype promotion mechanism uses Array to indicate any of a number of distinct concepts. For example, Array can simply mean "this object is an Array", but it can also mean "this is a collection including three different subtypes of AbstractArray, a pair of Tuples, and (for good measure) a Nullable". When extending Broadcast, distinguishing the former from the latter is sometimes necessary; this is the case with sparse broadcast, for example. To do so, you either need extend Broadcast's containertype promotion mechanism with additional types in a manner that impacts other consumers of Broadcast (as in this PR), or you need largely reimplement that mechanism (as in #20009).

The second issue: When teaching Broadcast about a new container type, Broadcast's existing containertype promotion fallbacks encourage doing so in a manner that forces other Broadcast-extenders to write methods taking that new container type into account. These lines illustrate this problem, showing the beginnings of a combinatorial explosion of disambiguating methods.

Best!

tkelman · 2017-01-24T03:49:55Z

shall we close this one?

Sacha0 · 2017-01-25T02:43:28Z

shall we close this one?

Sounds good!

Extend sparse broadcast[!] to one- and two-dimensional Arrays (unfort…

31fc9f9

…unate Broadcast complexity combinatorial explosion edition).

Sacha0 added sparse Sparse arrays broadcast Applying a function over a collection labels Jan 13, 2017

Sacha0 changed the title ~~issue-pr: extend sparse broadcast[!] to one- and two-dimensional arrays~~ issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays Jan 13, 2017

Sacha0 mentioned this pull request Jan 13, 2017

extend sparse broadcast to one- and two-dimensional Arrays, better version #20009

Merged

nalimilan self-assigned this Jan 13, 2017

nalimilan reviewed Jan 13, 2017

View reviewed changes

pabloferz mentioned this pull request Jan 18, 2017

Extend sparse broadcast to VecOrMats #20102

Closed

Sacha0 closed this Jan 25, 2017

Sacha0 mentioned this pull request Feb 22, 2017

Wishlist: API and documentation for extending broadcast #20740

Closed

Sacha0 mentioned this pull request Apr 24, 2017

Base.Broadcast.promote_containertype improvements #21534

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays #20007

issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays #20007

Sacha0 commented Jan 13, 2017 •

edited

Loading

martinholters commented Jan 13, 2017

nalimilan Jan 13, 2017

Sacha0 commented Jan 13, 2017

nalimilan commented Jan 13, 2017

Sacha0 commented Jan 14, 2017

tkelman commented Jan 24, 2017

Sacha0 commented Jan 25, 2017

issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays #20007

issue-pr: extend sparse broadcast[!] to one- and two-dimensional Arrays #20007

Conversation

Sacha0 commented Jan 13, 2017 • edited Loading

martinholters commented Jan 13, 2017

nalimilan Jan 13, 2017

Choose a reason for hiding this comment

Sacha0 commented Jan 13, 2017

nalimilan commented Jan 13, 2017

Sacha0 commented Jan 14, 2017

tkelman commented Jan 24, 2017

Sacha0 commented Jan 25, 2017

Sacha0 commented Jan 13, 2017 •

edited

Loading