You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi folks!
The SimpleKernelTimer is very handy for quick profilings, but I find the output hard to read when profiling regions are nested. Consider the following code example:
The output show currently shows all the regions as one big flat list, which is hard to analyze. It be much clearer if the formatting took into account the hierarchical structure of the regions. The output could look something like the following:
(Type) Total Time, Call Count, Avg. Time per Call, %Total Time in Kernels, %Total Program Time
-------------------------------------------------------------------------
Regions:
+ MyKokkosApp
| (REGION) ...
| + Solve
| | (REGION) ...
| | + Bar
| | | (REGION) ...
| | | - BarLoop
| | | (REGION) ...
| | | - Other
| | | (REGION) ...
| | - Foo
| | (REGION) ...
| | - Other
| | (REGION) ...
| - Initialization
| (REGION) ...
| - Other
| (REGION) ...
What do you think of this?
The text was updated successfully, but these errors were encountered:
Hi folks!
The
SimpleKernelTimer
is very handy for quick profilings, but I find the output hard to read when profiling regions are nested. Consider the following code example:The output show currently shows all the regions as one big flat list, which is hard to analyze. It be much clearer if the formatting took into account the hierarchical structure of the regions. The output could look something like the following:
What do you think of this?
The text was updated successfully, but these errors were encountered: