New inflation layer with optional OpenMP acceleration#51
Closed
tonynajjar wants to merge 1 commit intomain_dexoryfrom
Closed
New inflation layer with optional OpenMP acceleration#51tonynajjar wants to merge 1 commit intomain_dexoryfrom
tonynajjar wants to merge 1 commit intomain_dexoryfrom
Conversation
Signed-off-by: Tony Najjar <tony.najjar@dexory.com>
4ce992e to
c3e0f67
Compare
Author
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Benchmark Comparison Summary
Test Environments
Dev Machine (ubuntu@dexory)
Robot (arri-74)
Performance Comparison (Key Benchmarks)
1000×1000 Grid (1M cells, 50% occupancy, 2m inflation radius)
2000×2000 Grid (4M cells, 50% occupancy, 2m inflation radius)
3333×3333 Grid (11.1M cells, 50% occupancy, 2m inflation radius)
4000×4000 Grid (16M cells, 30% occupancy, 1m inflation radius)
Key Findings
1. New Implementation Impact (OpenMP disabled)
2. OpenMP Parallelization Impact
3. Grid Size Scaling
4. Occupancy Impact (1500×1500 tests)
5. Inflation Radius Impact
Detailed Results by Parameter
Varying Occupancy (1500×1500 grid, 2m inflation)
Key Observation: Old implementation degrades significantly with higher occupancy (8→75 ms on dev), while new implementation remains stable (14-16 ms without OpenMP, 4-5 ms with OpenMP).
Varying Inflation Radius (1000×1000 grid, 50% occupancy)
Key Observation: Old implementation shows 36% slowdown from smallest to largest radius (11.2→17.6 ms on dev). New implementation shows minimal variation (<3% difference).
Varying Cost Scale (1000×1000 grid, 50% occupancy, 2m radius)
Key Observation: Cost scale factor has negligible impact on performance across all implementations.
Recommendations
✅ Use new implementation with OpenMP enabled - Provides 6.5-15.3× speedup
✅ Even without OpenMP, new implementation is 1.1-3.5× faster
✅ Performance is more predictable and scales better with grid size
✅ Robot shows excellent speedup despite lower CPU frequency
✅ New implementation handles varying occupancy and inflation radii efficiently
Performance Summary Chart