`im*x` is inefficient #12851

eschnett · 2015-08-28T18:07:06Z

The mathematical expression i x is naturally translated to im*x. Unfortunately, this leads to less efficient code than 1im*x. (This is with LLVM 3.6.1).

julia> f(x)=im*x; @code_native f(1.0)
    .section    __TEXT,__text,regular,pure_instructions
Filename: none
Source line: 0
    pushq   %rbp
    movq    %rsp, %rbp
    vmovaps %xmm0, %xmm1
Source line: 1
    vmovq   %xmm1, %rax
    vxorps  %xmm0, %xmm0, %xmm0
    testq   %rax, %rax
    jns L36
    movabsq $13406539760, %rax      ## imm = 0x31F178FF0
    vmovsd  (%rax), %xmm0
L36:    popq    %rbp
    retq

julia> f(x)=1im*x; @code_native f(1.0)
    .section    __TEXT,__text,regular,pure_instructions
Filename: none
Source line: 0
    pushq   %rbp
    movq    %rsp, %rbp
    vmovaps %xmm0, %xmm1
    vxorps  %xmm0, %xmm0, %xmm0
Source line: 1
    vmulsd  %xmm0, %xmm1, %xmm0
    popq    %rbp
    retq

That is, while the expression 1im*x leads to ideal code (I guess the multiplication by zero has to remain because of nans?), the expression im*x contains a branch, moving values between xmm and general registers, and loads the constant 0.0 from memory (!). Also, the nan semantics are different, and I'd argue they are wrong -- I expect Nan+Nan*im as result:

julia> (im*NaN, 1im*NaN)
(0.0 + NaN*im,NaN + NaN*im)

My guess is that the solution is either to ensure that Complex{Bool} is converted to Complex{Int} before being converted to Complex{Float64}, or to explicitly define complex arithmetic operators that handle Complex{Bool}.

The text was updated successfully, but these errors were encountered:

jiahao · 2015-08-28T18:11:37Z

Relevant: #10000 - the redesign of im as Complex{Bool} unfortunately also no longer respects sign of zero, which can be important when dealing with branch cuts.

eschnett · 2015-08-28T18:12:30Z

I just see that the special case is already there:

*(x::Bool, z::Complex) = ifelse(x, z, zero(z))

I don't think it should use ifelse, at least not for Float32 and Float64.

yuyichao · 2015-08-28T18:18:41Z

Just one more datapoint. im * x is not inlined for some cases (Float32 or Complex64) which disables @simd while 1im is fine in such cases.

eschnett · 2015-08-31T15:50:29Z

These additional definitions

*(x::Real, z::Complex{Bool}) = Complex(x*real(z), x*imag(z))
*(z::Complex{Bool}, x::Real) = Complex(real(z)*x, imag(z)*x)

seem to solve this issue for me. These definitions catch im, which is of the type Complex{Bool}; they mirror the existing special cases for Bool * Complex.

jakebolewski · 2015-09-04T21:03:15Z

Closed by #12887

kshyatt added performance Must go faster complex Complex numbers labels Aug 28, 2015

eschnett mentioned this issue Aug 31, 2015

Optimize several Complex{Bool} operations #12887

Merged

jakebolewski closed this as completed Sep 4, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`im*x` is inefficient #12851

`im*x` is inefficient #12851

eschnett commented Aug 28, 2015

jiahao commented Aug 28, 2015

eschnett commented Aug 28, 2015

yuyichao commented Aug 28, 2015

eschnett commented Aug 31, 2015

jakebolewski commented Sep 4, 2015

im*x is inefficient #12851

im*x is inefficient #12851

Comments

eschnett commented Aug 28, 2015

jiahao commented Aug 28, 2015

eschnett commented Aug 28, 2015

yuyichao commented Aug 28, 2015

eschnett commented Aug 31, 2015

jakebolewski commented Sep 4, 2015

`im*x` is inefficient #12851

`im*x` is inefficient #12851