Improvements to batched_mul, including PermutedDimsArray #187

mcabbott · 2020-03-22T14:23:42Z

I think that we can allow batched_gemm! to be called on many (but not all) PermutedDimsArrays, and this is generally much faster than first calling permutedims. This PR is an attempt to implement this. It also extends batched_mul! to take α, β scales like mul!.

EARLIER:

It adds methods to treat PermutedDimsArray{<:Number,3,(2,1,3)} as equivalent to BatchedTranspose (see batched_transpose causes a 'Need an adjoint for constructor NNlib.BatchedTranspose' error Zygote.jl#552) and to allow batched_adjoint ∘ batched_transpose to be trivial on real-valued arrays.
It deletes the method for copy(::BatchedAdjoint) etc, from implementation for batch-wise matrix multiplication #100, to return an Array like copy(::Adjoint).

src/batched/batchedmul.jl

mcabbott · 2020-04-03T21:08:46Z

This has changed a lot. The current form uses https://github.com/JuliaMatrices/ArrayLayouts.jl to keep track dimensions & strides, via traits UnitStrideFirst and UnitStride{2}, added there in JuliaLinearAlgebra/ArrayLayouts.jl#12 (which not yet merged).

See also #191 for an alternative approach.

Either of these can be made to work with JuliaGPU/CuArrays.jl#664, which will similarly allow permutations.

JuliaRegistries/General#13830

mcabbott · 2020-10-24T12:02:02Z

This could possibly now be done with https://github.com/SciML/ArrayInterface.jl instead of https://github.com/JuliaMatrices/ArrayLayouts.jl . But I'm hesitant to add dependencies (which would probably need to be added to CUDA.jl too), and I think #191 is simpler lower-tech solution.

CarloLucibello reviewed Mar 24, 2020

View reviewed changes

src/batched/batchedmul.jl Outdated Show resolved Hide resolved

mcabbott mentioned this pull request Mar 31, 2020

Make gemm_strided_batched! work with PermutedDimsArrays JuliaGPU/CuArrays.jl#664

Closed

Michael Abbott added 21 commits April 3, 2020 19:03

StridedArray -> Array

73fd2b0

batched_mul accepts PermutedDimsArray, etc

b82e04b

tweaks

2c59a3a

restore StridedArray, and use BlasFloat

5f17a3d

make sub-testsets

c26b0b2

use stride(A, 3) not size(A, 1) * size(A, 2) in batched_gemm

b4acf4d

re-organise batched_mul to allow more permutations

3ea3f09

using is_strided() to decide who to call

4f70094

add pointer

c4880a8

clean up

91ab283

typo

064c250

using LinearAlgebra

362c80d

oops

2709d34

add ArrayLayouts

2cdd3c0

first attempt to use ArrayLayouts traits, some parts broken

4e26f02

mostly fixed use of ArrayLayouts traits

4333974

change notation to UnitStride{D}

970347b

re-order args, comment out transpose(C) idea

777786c

now memory_layout() checks strides if it must

37feadf

allow transposed C too

c2e0d5b

prefer BatchedAdjoint outside

679dd0b

mcabbott force-pushed the fix1 branch from fa103ee to 679dd0b Compare April 3, 2020 17:04

mcabbott mentioned this pull request Apr 3, 2020

Layouts & storage JuliaLinearAlgebra/ArrayLayouts.jl#9

Open

quick tests for dynamic overhead

ecf8c29

mcabbott mentioned this pull request Apr 3, 2020

Allow batched_mul to work through PermutedDimsArray, II #191

Merged

mcabbott marked this pull request as draft April 17, 2020 20:54

ArrayLayouts v0.2.5

5a4bf7e

JuliaRegistries/General#13830

Michael Abbott added 2 commits April 29, 2020 10:38

Merge branch 'fix1' of github.com:mcabbott/NNlib.jl into fix1

a4aed02

add ArrayLayouts to manifest

6bd6c5d

mcabbott closed this Oct 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improvements to batched_mul, including PermutedDimsArray #187

Improvements to batched_mul, including PermutedDimsArray #187

mcabbott commented Mar 22, 2020 •

edited

Loading

mcabbott commented Apr 3, 2020

mcabbott commented Oct 24, 2020

Improvements to batched_mul, including PermutedDimsArray #187

Improvements to batched_mul, including PermutedDimsArray #187

Conversation

mcabbott commented Mar 22, 2020 • edited Loading

mcabbott commented Apr 3, 2020

mcabbott commented Oct 24, 2020

mcabbott commented Mar 22, 2020 •

edited

Loading