-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
added benchmark results of keccak256 based binary merklization on mul…
…tiple platforms ( cpu, gpu etc. )
- Loading branch information
1 parent
f03707f
commit 3a5fa29
Showing
4 changed files
with
165 additions
and
2 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,109 @@ | ||
### Binary Merklization using KECCAK-256 on Intel CPU(s) | ||
|
||
Compiling with | ||
|
||
```bash | ||
SHA=keccak_256 make aot_cpu | ||
``` | ||
|
||
### On `Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz` | ||
|
||
```bash | ||
$ lscpu | grep -i cpu\(s\) | ||
|
||
CPU(s): 4 | ||
On-line CPU(s) list: 0-3 | ||
NUMA node0 CPU(s): 0-3 | ||
``` | ||
|
||
```bash | ||
running on Intel(R) Xeon(R) CPU E5-2686 v4 @ 2.30GHz | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 466.478477 ms 3.288778 ms 3.442020 ms | ||
2 ^ 21 898.963977 ms 6.508914 ms 6.558546 ms | ||
2 ^ 22 1.797621 s 13.061319 ms 13.172746 ms | ||
2 ^ 23 3.591501 s 27.324937 ms 27.123078 ms | ||
2 ^ 24 7.186666 s 54.148528 ms 54.237210 ms | ||
2 ^ 25 14.404052 s 123.865217 ms 108.246855 ms | ||
``` | ||
|
||
### On `Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz` | ||
|
||
```bash | ||
$ lscpu | grep -i cpu\(s\) | ||
|
||
CPU(s): 128 | ||
On-line CPU(s) list: 0-127 | ||
NUMA node0 CPU(s): 0-31,64-95 | ||
NUMA node1 CPU(s): 32-63,96-127 | ||
``` | ||
|
||
```bash | ||
running on Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 13.362355 ms 1.821476 ms 1.326708 ms | ||
2 ^ 21 20.922397 ms 3.589614 ms 2.430955 ms | ||
2 ^ 22 33.674320 ms 6.493885 ms 4.294246 ms | ||
2 ^ 23 106.859444 ms 11.947260 ms 8.593155 ms | ||
2 ^ 24 117.165222 ms 23.851139 ms 8.417020 ms | ||
2 ^ 25 233.647003 ms 25.051263 ms 16.673447 ms | ||
``` | ||
|
||
### On `Intel(R) Xeon(R) Gold 6128 CPU @ 3.40GHz` | ||
|
||
```bash | ||
$ lscpu | grep -i cpu\(s\) | ||
|
||
CPU(s): 24 | ||
On-line CPU(s) list: 0-23 | ||
NUMA node0 CPU(s): 0-5,12-17 | ||
NUMA node1 CPU(s): 6-11,18-23 | ||
``` | ||
|
||
```bash | ||
running on Intel(R) Xeon(R) Gold 6128 CPU @ 3.40GHz | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 34.571529 ms 1.809763 ms 897.616875 us | ||
2 ^ 21 61.404680 ms 3.326612 ms 1.588368 ms | ||
2 ^ 22 117.968746 ms 5.674248 ms 7.157974 ms | ||
2 ^ 23 231.852088 ms 9.238144 ms 13.273680 ms | ||
2 ^ 24 462.241001 ms 20.315251 ms 12.602417 ms | ||
2 ^ 25 924.972606 ms 31.446401 ms 24.707977 ms | ||
``` | ||
|
||
### On `Intel(R) Xeon(R) E-2176G CPU @ 3.70GHz` | ||
|
||
```bash | ||
$ lscpu | grep -i cpu\(s\) | ||
|
||
CPU(s): 12 | ||
On-line CPU(s) list: 0-11 | ||
NUMA node0 CPU(s): 0-11 | ||
``` | ||
|
||
```bash | ||
running on Intel(R) Xeon(R) E-2176G CPU @ 3.70GHz | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 73.894415 ms 932.138625 us 850.445250 us | ||
2 ^ 21 109.423621 ms 1.782943 ms 1.715456 ms | ||
2 ^ 22 218.244072 ms 3.493360 ms 3.446031 ms | ||
2 ^ 23 436.918616 ms 6.905427 ms 6.842661 ms | ||
2 ^ 24 883.594877 ms 13.812258 ms 13.749230 ms | ||
2 ^ 25 1.930962 s 27.554382 ms 27.591307 ms | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
### Binary Merklization using KECCAK-256 on Intel GPU(s) | ||
|
||
Compiling with | ||
|
||
```bash | ||
SHA=keccak_256 make aot_gpu | ||
``` | ||
|
||
### On `Intel(R) UHD Graphics P630 [0x3e96]` | ||
|
||
```bash | ||
running on Intel(R) UHD Graphics P630 [0x3e96] | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 108.488926 ms 1.332275 ms 745.381500 us | ||
2 ^ 21 212.384799 ms 1.497735 ms 1.454533 ms | ||
2 ^ 22 422.459127 ms 5.289694 ms 2.832562 ms | ||
2 ^ 23 841.035348 ms 5.684048 ms 5.597084 ms | ||
2 ^ 24 1.679276 s 11.176738 ms 11.080438 ms | ||
2 ^ 25 3.355854 s 22.150604 ms 22.356589 ms | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,24 @@ | ||
### Binary Merklization using KECCAK-256 on Nvidia GPU(s) | ||
|
||
Compile with | ||
|
||
```bash | ||
SHA=keccak_256 make cuda | ||
``` | ||
|
||
### On `Tesla V100-SXM2-16GB` | ||
|
||
```bash | ||
running on Tesla V100-SXM2-16GB | ||
|
||
|
||
Benchmarking Binary Merklization using KECCAK-256 | ||
|
||
leaf count execution time host-to-device tx time device-to-host tx time | ||
2 ^ 20 751.924875 us 1.167792 ms 1.005363 ms | ||
2 ^ 21 1.344910 ms 2.304931 ms 2.016678 ms | ||
2 ^ 22 2.517974 ms 4.593017 ms 4.025208 ms | ||
2 ^ 23 4.864380 ms 9.128906 ms 8.053345 ms | ||
2 ^ 24 8.179686 ms 18.250488 ms 16.049194 ms | ||
2 ^ 25 16.144776 ms 36.534668 ms 32.099121 ms | ||
``` |