Modernize sampler in testrender #1534

fpsunflower · 2022-07-17T00:04:27Z

The sampler implementation in testrender was very close to the 2020 article from Brent Burley. This patch
catches up to best practices, including a faster owen scrambler from Nathan Vegdahl and recognizing that
the index scrambling can be achived by the same owen scrambler method (which saves a bunch of logic and runs
faster). Finally, avoid the redundant bit reversals and use compiler intrinsics when available to benefit
from hardware instructions (for ARM or GPUs). Finally the integer hash constants were updated to the latest
from the hash prospector project.

While more optimizations are certainly possible, this is a reasonable balance between quality and speed and saves a few lines of code over the previous implementation.

On my laptop, running the microfacet sample from the testsuite at 1024x768 with 4x4 samples goes from 10.2s down to 8.9s (which is roughly 12% faster).

lgritz · 2022-07-17T00:13:50Z

Wow, the sample scrambler itself was accounting for 10% of runtime??

fpsunflower · 2022-07-17T00:17:50Z

Yep! Now that the faster methods are well documented in the literature, might as well use them :)

lgritz · 2022-07-18T15:53:20Z

It's failing one of the unit tests. I know you're looking into that separately, but in the mean time, should we update the ref images (you can have more than one ref image for a test -- the test passes if it matches any of the choices, so sometimes we have multiple that we consider "correct" if we just can't make all platforms match exactly)? Or if it's just one or two pixels we can adjust the amount of allowable difference? Or would you like to simply hold off on merging this until you fully solve #1535?

fpsunflower · 2022-07-19T04:43:33Z

It seems its always the same test giving us trouble. I have one or two more things to try, otherwise I'll just commit this with an extra reference image.

The sampler implementation in testrender was very close to the 2020 article from Brent Burley. This patch catches up to best practices, including a faster owen scrambler from Nathan Vegdahl and recognizing that the index scrambling can be achived by the same owen scrambler method (which saves a bunch of logic and runs faster). Finally, avoid the redundant bit reversals and use compiler intrinsics when available to benefit from hardware instructions (for ARM or GPUs). Finally the integer hash constants were updated to the latest from the hash prospector project. While more optimizations are certainly possible, this is a reasonable balance between quality and speed. Update reference images for new sampler Signed-off-by: Chris Kulla <[email protected]>

lgritz · 2022-07-19T05:43:48Z

I've had trouble with that test before, too. I was never able to figure out why that one, in particular, was so brittle to minor platform differences. And not just a couple LSB errors, it was something that was changing where they rays were going.

The sampler implementation in testrender was very close to the 2020 article from Brent Burley. This patch catches up to best practices, including a faster owen scrambler from Nathan Vegdahl and recognizing that the index scrambling can be achived by the same owen scrambler method (which saves a bunch of logic and runs faster). Finally, avoid the redundant bit reversals and use compiler intrinsics when available to benefit from hardware instructions (for ARM or GPUs). Finally the integer hash constants were updated to the latest from the hash prospector project. While more optimizations are certainly possible, this is a reasonable balance between quality and speed. Update reference images for new sampler Signed-off-by: Chris Kulla <[email protected]>

Relevant changes: * GPU string fix -- needed missing cast (AcademySoftwareFoundation#1553) * Fix typo error that prevented correct typecheck of ternary (AcademySoftwareFoundation#1552) * testrender now supports the standard MaterialX closure: * testrender improvements: modernize sampler (AcademySoftwareFoundation#1534), switch to cone tracing (AcademySoftwareFoundation#1543), support MaterialX closures (AcademySoftwareFoundation#1533, AcademySoftwareFoundation#1536, AcademySoftwareFoundation#1537, AcademySoftwareFoundation#1538, AcademySoftwareFoundation#1541, AcademySoftwareFoundation#1542, AcademySoftwareFoundation#1547) * Bump LLVM to 14.0 * Help reduce PTX nondeterminism (AcademySoftwareFoundation#1570) See merge request spi/dev/3rd-party/osl-feedstock!30

fpsunflower force-pushed the new-sampler branch from b0d18b5 to 5af2b7f Compare July 17, 2022 00:16

aconty approved these changes Jul 18, 2022

View reviewed changes

fpsunflower force-pushed the new-sampler branch from 5af2b7f to 48e7b2b Compare July 19, 2022 05:52

fpsunflower merged commit 95fa78d into AcademySoftwareFoundation:main Jul 19, 2022

fpsunflower deleted the new-sampler branch October 28, 2024 03:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modernize sampler in testrender #1534

Modernize sampler in testrender #1534

fpsunflower commented Jul 17, 2022

lgritz commented Jul 17, 2022

fpsunflower commented Jul 17, 2022

lgritz commented Jul 18, 2022

fpsunflower commented Jul 19, 2022

lgritz commented Jul 19, 2022

Modernize sampler in testrender #1534

Modernize sampler in testrender #1534

Conversation

fpsunflower commented Jul 17, 2022

lgritz commented Jul 17, 2022

fpsunflower commented Jul 17, 2022

lgritz commented Jul 18, 2022

fpsunflower commented Jul 19, 2022

lgritz commented Jul 19, 2022