-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add benchmark tests for TMVA Sofie #239
Open
lmoneta
wants to merge
35
commits into
root-project:master
Choose a base branch
from
lmoneta:tmva_sofie_benchmark
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
4 tasks
sitongan
approved these changes
Dec 2, 2021
lmoneta
force-pushed
the
tmva_sofie_benchmark
branch
from
March 8, 2022 09:37
11d7d82
to
5018380
Compare
lmoneta
force-pushed
the
tmva_sofie_benchmark
branch
from
April 3, 2022 10:10
03cf673
to
8f79b34
Compare
lmoneta
force-pushed
the
tmva_sofie_benchmark
branch
2 times, most recently
from
July 4, 2022 17:14
0e87df1
to
bebe103
Compare
lmoneta
force-pushed
the
tmva_sofie_benchmark
branch
from
March 6, 2024 09:47
babcd71
to
9c562ba
Compare
Disable automatic generation of test code, using template benchmark is trivial to add a new test
Add corresponding files (including new .dat wiles for weights in compiled_models)
Use also a given number of events (e.g. 64)
Add also all needed input files
for SOFIE tests Remove then header file from repository Remove also obsolate onnx file that cannot be parsed by SOFIE Add new FindONNXRuntime and a new FindLWTNN to find the required dependency package. Use minimial FIndModule file, which requires only to define an env variable to find the package
Add optimization flags for auto-vectorization : -mavx2 -fno-signaling-nans -fno-trapping-math. With autovec, COnv1D test is 3 times faster. These changes are used to produce the benchmark results presented ad ACAT.
…sage of state.thread_index
Add json file for Generator model
Modifiy noth ONNXRuntimeinference and SOFIEINference to support models with multiple inputs
- set in ONNXRuntime also the interopnumthreads to 1
lmoneta
force-pushed
the
tmva_sofie_benchmark
branch
from
June 4, 2024 14:26
9c562ba
to
0f948b0
Compare
Do not use empty path when creating Session classes Improve tests by adding possibility to pass location of input files
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Ann new tests for TMVA SOFIE including tests using ONNXRuntime and LWTNN
Add corresponding Find.. cmake modules to find the needed package
Make also use of FindBLAS to find teh correct BLAS version. Need to use -DBLA_VENDOR to configure to the right BLAS type installed in the system.
For example -DBLA_VENDOR= -DBLA_VENDOR=Intel10_64lp_seq to use the sequential MKL library that is used in the benchmark presented at ACAT
This PR is built on top of #236 tnaks to Federico Sossai