regression in performance of counting loops #5469

mlubin · 2014-01-22T03:02:06Z

I was playing around with an example for the class I'm preparing and noticed a 30% performance regression in the following code from 0.2 to master:

function normAndMin(x)
    n = 0.0
    m = Inf
    for i in 1:length(x)
        n += x[i]*x[i]
        if x[i] < m
            m = x[i]
        end
    end
    return sqrt(n),m
end

x = rand(100_000_000)
normAndMin(x) # throw away
n,m = @time normAndMin(x)

Julia 0.2:
elapsed time: 0.101895311 seconds (6912 bytes allocated)

Julia master:
elapsed time: 0.135729463 seconds (6812 bytes allocated)

The text was updated successfully, but these errors were encountered:

JeffBezanson · 2014-01-22T03:54:08Z

This is certainly due to 0860767.

mlubin · 2014-01-22T04:34:53Z

Any hope for improvement?

JeffBezanson · 2014-01-22T04:38:03Z

Yes, this can be fiddled with, and we might overhaul integer ranges partly for this purpose.

…t,len this Range1 could be the UnitRange of #5585, with Range1 deprecated also intended to address #5469 (performance)

JeffBezanson · 2014-03-13T03:41:02Z

I have been digging deeply into this today. The way we lower iteration may be interfering with LLVM's ability to recognize loop idioms. The following code is slow:

    state = 1
    while state != l+1
        i = state
        state += 1
        n += x[i]*x[i]
        if x[i] < m
            m = x[i]
        end
    end

And simply moving state += 1 to the end is faster:

    state = 1
    while state != l+1
        i = state
        n += x[i]*x[i]
        if x[i] < m
            m = x[i]
        end
        state += 1
    end

simonster · 2014-03-13T04:42:52Z

It looks like, with state += 1 at the beginning, the if is a branch, whereas with state += 1 at the end, it is optimized into a select. And indeed with:

    state = 1
    while state != l+1
        i = state
        state += 1
        n += x[i]*x[i]
        m = ifelse(x[i] < m, x[i], m)
    end

I seem to get the same performance as with state += 1 at the end.

mlubin · 2014-03-13T05:21:24Z

Cool!

JeffBezanson · 2014-03-13T05:25:37Z

Please verify. Worryingly, this varies a bit by machine, but it's either the same or faster on both machines I've tried so far.

mlubin · 2014-03-13T05:28:56Z

Confirmed back to 0.2 timings on my machine.

timholy · 2014-03-13T10:04:58Z

Very nice.

…t,len this Range1 could be the UnitRange of #5585, with Range1 deprecated also intended to address #5469 (performance)

ghost assigned JeffBezanson Jan 22, 2014

StefanKarpinski mentioned this issue Jan 27, 2014

the rangepocalypse #5585

Closed

JeffBezanson mentioned this issue Mar 12, 2014

performance regressions since 0.2 #6112

Closed

JeffBezanson added a commit that referenced this issue Mar 12, 2014

alternate implementation of Range1 storing start,stop instead of star…

c4f8295

…t,len this Range1 could be the UnitRange of #5585, with Range1 deprecated also intended to address #5469 (performance)

JeffBezanson closed this as completed in 14d0a7d Mar 13, 2014

JeffBezanson mentioned this issue Mar 13, 2014

Task advances on done() not next() ? #6125

Closed

JeffBezanson added a commit that referenced this issue Mar 31, 2014

alternate implementation of Range1 storing start,stop instead of star…

5d12393

…t,len this Range1 could be the UnitRange of #5585, with Range1 deprecated also intended to address #5469 (performance)

JeffBezanson added a commit that referenced this issue Apr 1, 2014

alternate implementation of Range1 storing start,stop instead of star…

ef3c641

…t,len this Range1 could be the UnitRange of #5585, with Range1 deprecated also intended to address #5469 (performance)

timholy mentioned this issue Apr 6, 2014

WIP: Add Cartesian product iteration. Fixes #1917 #6437

Closed

nolta mentioned this issue Nov 30, 2014

WIP: split next into nextval & nextstate #9182

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

regression in performance of counting loops #5469

regression in performance of counting loops #5469

mlubin commented Jan 22, 2014

JeffBezanson commented Jan 22, 2014

mlubin commented Jan 22, 2014

JeffBezanson commented Jan 22, 2014

JeffBezanson commented Mar 13, 2014

simonster commented Mar 13, 2014

mlubin commented Mar 13, 2014

JeffBezanson commented Mar 13, 2014

mlubin commented Mar 13, 2014

timholy commented Mar 13, 2014

regression in performance of counting loops #5469

regression in performance of counting loops #5469

Comments

mlubin commented Jan 22, 2014

JeffBezanson commented Jan 22, 2014

mlubin commented Jan 22, 2014

JeffBezanson commented Jan 22, 2014

JeffBezanson commented Mar 13, 2014

simonster commented Mar 13, 2014

mlubin commented Mar 13, 2014

JeffBezanson commented Mar 13, 2014

mlubin commented Mar 13, 2014

timholy commented Mar 13, 2014