Solari: Dynamic realtime global illumination #10000

JMS55 · 2023-10-02T17:12:30Z

WARNING: Highly experimental, will not be merged anytime soon.

Here's a pretty image so the PR isn't entirely text, but keep in mind there's still a lot of work needed to get this to a generally usable and artifact/bug-free state.

Overview

Solari is an implementation of fully dynamic, fully realtime raytraced global illumination. All lights and objects can move and mutate, contribute indirect lighting (including emissive meshes), and receive indirect light bounces. It's comparable in scope to Unreal Engine 5's Lumen (Fortnite, Lumen in the Land of Nanite), or Nvidia's RTXGI (Cyberpunk 2077).

This is a high end GPU-intensive rendering technique. All testing was done on an RTX 3080 GPU, at both 1080p and 4k, with an initial target budget of around 4ms of GPU time (ideally we can get this down to ~2ms with further optimizations). As it relies on hardware raytracing support, it only works on GPUs such as Nvidia's RTX 2000 series+, and AMD's RX 6000 series+. Currently, this PR is using an experimental fork of wgpu that implements hardware raytracing on the Vulkan backend only. Metal and DirectX12 are not currently supported.

Note that Solari only (currently) does indirect diffuse lighting. Indirect specular lighting (reflections) will come later, and raytraced direct diffuse and specular lighting much later as part of a separate sub-plugin of Solari, likely using a different technique.

Current Technique

The following is a simple high-level overview of Solari's current rendering, as details are extremely subject to change and I don't want to have to rewrite this frequently. A detailed breakdown will probably come once everything is actually finished. See the literature section of this PR description for links to each of the techniques.

Like Lumen and specifically GI-1.0, Solari uses a multi-level radiance/irradiance cache. The high level process is as follows:

Light probes capture the incoming radiance from all directions at points directly visible from the camera. These probes are reprojected between frames, forming the "screen cache".
In order to get incoming radiance for a probe, a ray is traced out of the probe, hitting another point. In order to get the irradiance at that point, the "world cache" is queried.
The world cache is a pre-allocated, persistent hashmap storing irradiance for a large chunk of the world called a cell. In order to determine the irradiance for the cell, rays are traced per-cell towards light sources (and optionally other cells).

The final path is as such: camera -> visible point on screen (probe placed) -> a different point in the world (world cache queried) -> light sources. GI-1.0 has a great illustration of this:

A more detailed, technical breakdown is as follows:

Screen probes are allocated as a set of 4 cascades of octahedral probes. Each cascade has probes placed twice as far apart, with twice greater directional resolution (so bigger probes), and has half as many total probes, and tracing a twice longer radiance interval.
The world cache is allocated a single persistent storage buffer.
Each frame:
Go over the active world cache cells, and decay a life value (reset on each access) by 1. If the life of a cell reaches 0, it is turned back into an empty cell.
A series of passes compact the alive world cache cells into a buffer, in order to improve occupancy for the next step.
Each alive world cache cell traces a ray towards light sources, and optionally an extra ray in a random direction. While the first ray gets light from an analytic light source or emissive mesh, the optional extra ray queries the world cache itself (other cells) for radiance, forming a multi-bounce feedback effect that is particularly important for indoor scenes.
The contribution of the newly traced rays are then temporally blended into the current irradiance for each associated cell.
Screen probes (for each cascade) are reprojected using motion vectors and the current and previous frame's depth buffers. Screen probes are placed directly on visible points in the world, reconstructed using the rasterized depth buffer.
Per cell (octahedral map texel) of each probe for each cascade, a new ray is traced, and radiance is computed by querying the world cache at the hit point. Each probe traces within a specific radiance interval. The new radiance values are temporally blended into the probes.
Starting from the highest cascade, each cascade recursively merges downwards onto cascade 0, the lowest resolution but highest density cascade.
Each probe in the merged cascade 0 is converted to spherical harmonics which filters out higher (noisy) frequencies and makes reading them cheaper in the next pass.
Finally, each pixel on screen interpolates from the spherical harmonics to form the final GI texture. The main lighting passes read from this texture per-pixel as an additional form of indirect light.

TODOs

See crates/bevy_pbr/src/solari/todo.txt.

Expect lots of bugs, ghosting, light leaks, quality issues, etc. Especially in non-toy scenes. Anything in bevy_pbr::solari::scene should be treated as temporary, and only written in order to let me work on the actual shader techniques.

This is a draft PR, and will likely remain that way for a long while. I've been working on this basically solo for months already, and expect to continue doing so for many more months. That said, I'm putting this out there in the hopes of raising additional help developing Solari. If you're interested in helping with the development, please reach out on the rendering-dev channel in Bevy's discord!

Blockers

wgpu ray-tracing support: [wgpu-core] Inline RayQuery Support gfx-rs/wgpu#3631
naga support for arrayLength() on binding arrays of buffers: spv-out: implement OpArrayLength on array buffer bindings gfx-rs/naga#2372
naga_oil support for ray query types and functions: Handle naga's SpecialTypes naga_oil#54
Bindless texturing setup in Bevy

All of these are currently worked around either in Bevy (along with other hacks), or hacked around in forked dependencies.

Other missing nice-to-haves for better performance:

Subgroup operation support in wgsl
Async compute for building acceleration structures
Async compute

Literature

Path of Exile 2's Radiance Cascades
- https://drive.google.com/file/d/1L6v1_7HY2X-LV3Ofb6oyTIxgEaP4LOI6/view (Section 4.5 in particular)
Unreal Engine 5's Lumen
- https://www.youtube.com/watch?v=2GYXuM10riw
- https://advances.realtimerendering.com/s2022/index.html#Lumen
AMD's GI-1.0
Spatial Hashing

Thanks

Finally, I'd like to thank Alexander Sannikov for his help understanding and implementing his radiance cascades technique, @daniel-keitel for their work implementing raytracing support in wgpu, and countless others in the Bevy discord, wgpu/naga communities, and other graphics forums that have given me advice on this project. Far too many people to name individually :)

This reverts commit cdf0271.

entropylost · 2024-08-09T16:36:02Z

@JMS55 What happened? Did it just get too far behind or something?

JMS55 · 2024-08-09T16:46:03Z

A couple of things:

Wgpu raytracing never got finished, and maintaining a fork of wgpu/naga/naga_oil/bevy got more and more painful
The algorithms/implementations I was trying out were flawed. Nowadays I feel like ReSTIR based methods have a lot more promise than screen space probes, radiance cascades or otherwise. The probes were just too finicky. Additionally my world-space radiance cache was poorly/incorrectly implemented. If I were to start over the project, I would forgo the world space cache until I was confident in the final gather and denoising, and only then start working on it.
I ended up devoting 95% of my Bevy-development time to meshlets, which is making a lot more progress and isn't blocked (anymore) on wgpu features.

I am really looking forward to coming back to this in the future, but wgpu needs to support raytracing first. Realtime RT-GI is super cool, and since I started this project there's been even more exciting papers, but I don't have the time or motivation to keep trying to patch wgpu and bevy.

If anyone would like to see this work continue, please contribute raytracing support to wgpu along with the tests it needs to get merged.

baadc0de · 2025-01-07T22:41:46Z

I'm working on pushing RT support in wgpu with express motivation to get some ReSTIR lighting approaches into bevy.

JMS55 · 2025-01-07T22:54:49Z

Hey, glad to see more people interested in RT!

Wgpu's current RT support is actually good enough for me to start working on RT DI/GI. The main blockers are no longer RT related:

Wgpu needs proper bindless texture/buffer support, where you can mutate a bind group to add more resources to the binding array without recreating it.
Naga oil needs to support transferring SpecialTypes between modules Handle naga's SpecialTypes naga_oil#54

When these are resolved I plan to revive Solari.

baadc0de · 2025-01-08T04:57:23Z

I'm still studying the internals and how wgpu works. My plan is to help mature the RT stack on wpgu side first - compaction, in-place update of accel structures, then work my way up to binding arrays.

JMS55 added 30 commits September 4, 2023 13:51

Switch to solari deps

e7a8cda

Port SolariScenePlugin

788981e

WIP port SolariGolobalIlluminationPlugin

55223e9

Add BufferCache

b996539

Finish host code

c16e638

Copy old shaders, misc fixes

9c7f5f7

Add back example

5dad197

Re-export model

5cc1513

Regenerate readme

a49f223

Change error message

d045210

Add missing components to example

cb8f4a8

TODO: Handle vertex attributes better

cdf0271

Revert "TODO: Handle vertex attributes better"

12f7bb4

This reverts commit cdf0271.

Extract SolariEnabled

8630b7b

Fix material handling

e730e8f

More fixes

7735a8b

WIP

b990581

Fix shaders more

be23d62

More shader fixes

0ac122e

More shader porting

5807567

More shader fixes

01b3789

Fix crash when solari not enabled

eb17dbb

Actually fix extract system

5c10a3d

More shader fixes

0336a00

Fixes

7eec0d9

Fix depth_to_world_position

94c5fc8

Add GI texture to main pass

079306d

Fixes

4c7d240

Apply GI to main pass

4dc8474

Misc cleanup

f9c06ce

JMS55 added 16 commits September 23, 2023 21:37

WIP fixes

283ac0a

Misc rename

e72e4bd

Fix probe count calculations

dc28c77

Cleanup

8a47a6c

Remove AmbientLight from example

b8f0422

Add todo

58fb285

Fix taa weights

e018c58

Use STBN

517ed2c

Misc

d9bce2e

Fixes

c0df663

Todo cleanup

37e12d0

Add perf todo

f8d1a19

Require push constants

7f94c11

Fixes

8996e07

WIP better weights

15a683c

Add GI toggle to example

a29d962

alice-i-cecile added C-Feature A new feature, making something new possible A-Rendering Drawing game state to the screen D-Complex Quite challenging from either a design or technical perspective. Ask for help! labels Oct 2, 2023

JMS55 added the S-Blocked This cannot move forward until something else changes label Oct 3, 2023

JMS55 mentioned this pull request Oct 3, 2023

Ray tracing support? #639

Open

JMS55 added 2 commits October 5, 2023 17:01

Merge commit 'a9622408665662eff9ddae83d913c6cfa2fa61d2' into solari3

645880d

Hack to get example to load

1268a3f

JMS55 mentioned this pull request Mar 20, 2024

Open source license? NVIDIA-RTX/SHARC#1

Closed

JMS55 closed this Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solari: Dynamic realtime global illumination #10000

Solari: Dynamic realtime global illumination #10000

JMS55 commented Oct 2, 2023 •

edited

Loading

entropylost commented Aug 9, 2024

JMS55 commented Aug 9, 2024 •

edited

Loading

baadc0de commented Jan 7, 2025

JMS55 commented Jan 7, 2025

baadc0de commented Jan 8, 2025 •

edited

Loading

Solari: Dynamic realtime global illumination #10000

Solari: Dynamic realtime global illumination #10000

Conversation

JMS55 commented Oct 2, 2023 • edited Loading

WARNING: Highly experimental, will not be merged anytime soon.

Overview

Current Technique

TODOs

Blockers

Literature

Thanks

entropylost commented Aug 9, 2024

JMS55 commented Aug 9, 2024 • edited Loading

baadc0de commented Jan 7, 2025

JMS55 commented Jan 7, 2025

baadc0de commented Jan 8, 2025 • edited Loading

JMS55 commented Oct 2, 2023 •

edited

Loading

JMS55 commented Aug 9, 2024 •

edited

Loading

baadc0de commented Jan 8, 2025 •

edited

Loading