Define new dot method for abstract vectors #22392

dalum · 2017-06-16T07:33:54Z

#22374 undefines dot between matrices. The concerns raised in #22220 was that dot(::Matrix, ::Matrix) is presently the Euclidean product. This behaviour will be fixed/broken by #22374 and the Euclidean product between matrices will have to be computed by calling vecdot.

There are two main arguments for adding methods between matrices and numbers to dot as in this PR:

numpy defines dot in a similar way (except for the conjugation) between arrays and numbers and arrays and arrays
this allows exploitation of the nested calls to dot between the entries in vectors, where the entries are arrays or numbers to write sums of products as dot products of vectors

(I don't know how to make #22220 point to this new branch, so I'm making a new PR. Sorry for the noise.)

andreasnoack · 2017-06-16T14:05:31Z

@eveydee Sorry for obstructing your PRs. It is great that you contribute. However, I think we should go for the complete separation of dot and vecdot and simply define

dot(x::AbstractVector, y::AbstractVector) = sum(xx'yy for (xx,yy) in zip(x,y)

instead of having dot call vecdot. I believe that would allow everybody to do what they'd like to do with either dot or vecdot.

dalum · 2017-06-16T16:23:44Z

@andreasnoack I'm really glad to receive feedback and hope that I contribute more than cause trouble.

I think yours is a much cleaner solution, but it looks like it comes with a performance cost which is quite significant for normal short vectors:

julia> dott(x::AbstractVector, y::AbstractVector) = sum(xx'yy for (xx,yy) in zip(x,y))
julia> a = 1:4
julia> @benchmark dot(a, a)
BenchmarkTools.Trial: 
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     22.109 ns (0.00% GC)
  median time:      22.961 ns (0.00% GC)
  mean time:        23.265 ns (0.00% GC)
  maximum time:     91.116 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     997

julia> @benchmark dott(a, a)
BenchmarkTools.Trial: 
  memory estimate:  128 bytes
  allocs estimate:  3
  --------------
  minimum time:     64.720 ns (0.00% GC)
  median time:      68.734 ns (0.00% GC)
  mean time:        84.562 ns (15.93% GC)
  maximum time:     3.641 μs (97.08% GC)
  --------------
  samples:          10000
  evals/sample:     980

I would imagine this is the reason dot was defined to be equal to vecdot in the first place. 😕

stevengj · 2017-06-16T16:42:43Z

Presumably @andreasnoack's suggestion was mainly for the semantics. We'd probably still want specialized methods for specific types (e.g. arrays of numbers, cases that would call BLAS, arrays of vectors, arrays of matrices, etcetera), and the fallback method should probably be implemented by explicit loops rather than summing a generator.

andreasnoack · 2017-06-16T18:01:45Z

@andreasnoack I'm really glad to receive feedback and hope that I contribute more than cause trouble.

Good to hear that.

As @stevengj suggests, I'm mainly after the semantics here.

It would be great if we had two argument mapreduce. I think we could do something like

dot(x::AbstractVector{<:Number}, y::AbstractVector{<:Number}) = vecdot(x, y)
function dot(x::AbstractVector, y::AbstractVector)
  if length(x) != length(y)
    throw(ArgumentError("something"))
  end
  return isa(first(x)'first(y)) ? vecdot(x,y) : sum(xx'yy for (xx,yy) in zip(x,y))
end

and avoid the overhead in the most important cases. The second version would also be fairly fast in the Number case but it has a branch and can't handle zero length arguments. We could try to get the zero argument version working in more cases but it wouldn't work for Vector{Vector} anyway so I'm not sure it is worth it.

This reverts commit d46cdc7.

dalum · 2017-06-19T07:55:12Z

I have tried to optimise the dot method to the best of my knowledge, by studying the implementations of foldl, reduce, etc. and benchmarking different approaches. I'm not entirely sure if calling @inbounds is safe, but it does make it slightly faster on my machine. Curiously, the BLAS methods are a bit slower for short vectors of numbers, but have a cross-over around vector lengths of 16.

StefanKarpinski · 2017-06-19T15:05:38Z

Curiously, the BLAS methods are a bit slower for short vectors of numbers, but have a cross-over around vector lengths of 16.

This is typical: BLAS calls have a fair amount of overhead, but are highly optimized, so the larger the input, the better they fare; the flip side is that they do less well for smaller inputs.

* Define dot product between Number and AbstractArray * Define dot between abstract arrays * Added docs for dot between arrays * Revert "Define dot product between Number and AbstractArray" This reverts commit d46cdc7. * Define new dot method between AbstractVectors

dalum changed the title ~~Define dot between matrices and numbers~~ Define dot between arrays and numbers Jun 16, 2017

tkelman added the needs tests Unit tests are required for this change label Jun 16, 2017

ararslan added the linear algebra Linear algebra label Jun 16, 2017

Evey Dee added 5 commits June 19, 2017 14:20

Define dot product between Number and AbstractArray

edeb334

Define dot between abstract arrays

483541a

Added docs for dot between arrays

be1bc76

Revert "Define dot product between Number and AbstractArray"

daadff0

This reverts commit d46cdc7.

Define new dot method between AbstractVectors

420da4e

dalum force-pushed the evey/andot branch from 0ead524 to 420da4e Compare June 19, 2017 07:39

dalum changed the title ~~Define dot between arrays and numbers~~ Define new dot method for abstract vectors Jun 19, 2017

andreasnoack removed the needs tests Unit tests are required for this change label Jul 18, 2017

andreasnoack merged commit 8a18928 into JuliaLang:master Jul 18, 2017

dalum deleted the evey/andot branch September 12, 2017 06:24

Jutho mentioned this pull request Dec 15, 2017

recursive vecdot and vecnorm #25093

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define new dot method for abstract vectors #22392

Define new dot method for abstract vectors #22392

dalum commented Jun 16, 2017 •

edited

Loading

andreasnoack commented Jun 16, 2017 •

edited

Loading

dalum commented Jun 16, 2017 •

edited

Loading

stevengj commented Jun 16, 2017

andreasnoack commented Jun 16, 2017

dalum commented Jun 19, 2017 •

edited

Loading

StefanKarpinski commented Jun 19, 2017 •

edited

Loading

Define new dot method for abstract vectors #22392

Define new dot method for abstract vectors #22392

Conversation

dalum commented Jun 16, 2017 • edited Loading

andreasnoack commented Jun 16, 2017 • edited Loading

dalum commented Jun 16, 2017 • edited Loading

stevengj commented Jun 16, 2017

andreasnoack commented Jun 16, 2017

dalum commented Jun 19, 2017 • edited Loading

StefanKarpinski commented Jun 19, 2017 • edited Loading

dalum commented Jun 16, 2017 •

edited

Loading

andreasnoack commented Jun 16, 2017 •

edited

Loading

dalum commented Jun 16, 2017 •

edited

Loading

dalum commented Jun 19, 2017 •

edited

Loading

StefanKarpinski commented Jun 19, 2017 •

edited

Loading