Skip to content

Speed up calculation of subcell bounds#2727

Merged
DanielDoehring merged 2 commits intotrixi-framework:mainfrom
bennibolm:speed-up-calc-bounds
Jan 8, 2026
Merged

Speed up calculation of subcell bounds#2727
DanielDoehring merged 2 commits intotrixi-framework:mainfrom
bennibolm:speed-up-calc-bounds

Conversation

@bennibolm
Copy link
Copy Markdown
Contributor

@bennibolm bennibolm commented Jan 8, 2026

Similar to #2644, this PR refactors the computation of subcell bounds in calc_bounds_twosided!. Instead of going through all nodes and decide whether there is an available neighbor (if i>1, if i<nnodes, ...), we run over all subcell interfaces (for i in 2:nnodes(dg), ...).
main

julia> @benchmark Trixi.calc_bounds_twosided!($var_min, $var_max, $variable, $u, $t, $semi, $equations)
BenchmarkTools.Trial: 10000 samples with 1 evaluation per sample.
 Range (min … max):  236.021 μs … 318.680 μs  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     238.703 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   239.852 μs ±   3.522 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

        ▄▇█▆▄▅▅▅▃▂▂▂▁                         ▁                 ▂
  ▂▂▃▅▆██████████████████▇▇█▇▇█▇▇▆▇▆▆▆▇▆▅▆▇▆▆███▇▇▆▆▅▆▆▆▆▆▅▄▅▄▄ █
  236 μs        Histogram: log(frequency) by time        255 μs <

 Memory estimate: 0 bytes, allocs estimate: 0.

this PR

julia> @benchmark Trixi.calc_bounds_twosided!($var_min, $var_max, $variable, $u, $t, $semi, $equations)
BenchmarkTools.Trial: 10000 samples with 1 evaluation per sample.
 Range (min … max):  128.819 μs … 174.041 μs  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     131.858 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   132.633 μs ±   2.932 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

     ▁▂▃▄▅▆▇██▇▇▆▅▅▄▄▃▂▂▁▁ ▁      ▁                             ▃
  ▄▆▇███████████████████████████████▇█▇████▆▇█▇▇█▇▇▇▇▆▆▇▇▆▆▆▆▆▆ █
  129 μs        Histogram: log(frequency) by time        145 μs <

 Memory estimate: 0 bytes, allocs estimate: 0.

I tried the same for the computation of one-sided bounds (nonlinear variables), but it makes the routine slower due to more evaluations of variable(..., equations).

@bennibolm bennibolm added performance We are greedy refactoring Refactoring code without functional changes labels Jan 8, 2026
@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented Jan 8, 2026

Review checklist

This checklist is meant to assist creators of PRs (to let them know what reviewers will typically look for) and reviewers (to guide them in a structured review process). Items do not need to be checked explicitly for a PR to be eligible for merging.

Purpose and scope

  • The PR has a single goal that is clear from the PR title and/or description.
  • All code changes represent a single set of modifications that logically belong together.
  • No more than 500 lines of code are changed or there is no obvious way to split the PR into multiple PRs.

Code quality

  • The code can be understood easily.
  • Newly introduced names for variables etc. are self-descriptive and consistent with existing naming conventions.
  • There are no redundancies that can be removed by simple modularization/refactoring.
  • There are no leftover debug statements or commented code sections.
  • The code adheres to our conventions and style guide, and to the Julia guidelines.

Documentation

  • New functions and types are documented with a docstring or top-level comment.
  • Relevant publications are referenced in docstrings (see example for formatting).
  • Inline comments are used to document longer or unusual code sections.
  • Comments describe intent ("why?") and not just functionality ("what?").
  • If the PR introduces a significant change or new feature, it is documented in NEWS.md with its PR number.

Testing

  • The PR passes all tests.
  • New or modified lines of code are covered by tests.
  • New or modified tests run in less then 10 seconds.

Performance

  • There are no type instabilities or memory allocations in performance-critical parts.
  • If the PR intent is to improve performance, before/after time measurements are posted in the PR.

Verification

  • The correctness of the code was verified using appropriate tests.
  • If new equations/methods are added, a convergence test has been run and the results
    are posted in the PR.

Created with ❤️ by the Trixi.jl community.

@codecov
Copy link
Copy Markdown

codecov bot commented Jan 8, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 97.01%. Comparing base (a499f99) to head (437ed1b).
⚠️ Report is 1 commits behind head on main.

Additional details and impacted files
@@           Coverage Diff           @@
##             main    #2727   +/-   ##
=======================================
  Coverage   97.01%   97.01%           
=======================================
  Files         556      556           
  Lines       43958    43960    +2     
=======================================
+ Hits        42645    42647    +2     
  Misses       1313     1313           
Flag Coverage Δ
unittests 97.01% <100.00%> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@bennibolm bennibolm marked this pull request as ready for review January 8, 2026 10:57
@DanielDoehring DanielDoehring merged commit cd3bca0 into trixi-framework:main Jan 8, 2026
39 checks passed
@bennibolm bennibolm deleted the speed-up-calc-bounds branch January 8, 2026 12:09
DanielDoehring pushed a commit to DanielDoehring/Trixi.jl that referenced this pull request Feb 19, 2026
* Revise calculation of two-sided bounds

* typo
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

performance We are greedy refactoring Refactoring code without functional changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants