-
Notifications
You must be signed in to change notification settings - Fork 799
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Introduce subsystem benchmarking tool (#2528)
This tool makes it easy to run parachain consensus stress/performance testing on your development machine or in CI. ## Motivation The parachain consensus node implementation spans across many modules which we call subsystems. Each subsystem is responsible for a small part of logic of the parachain consensus pipeline, but in general the most load and performance issues are localized in just a few core subsystems like `availability-recovery`, `approval-voting` or `dispute-coordinator`. In the absence of such a tool, we would run large test nets to load/stress test these parts of the system. Setting up and making sense of the amount of data produced by such a large test is very expensive, hard to orchestrate and is a huge development time sink. ## PR contents - CLI tool - Data Availability Read test - reusable mockups and components needed so far - Documentation on how to get started ### Data Availability Read test An overseer is built with using a real `availability-recovery` susbsytem instance while dependent subsystems like `av-store`, `network-bridge` and `runtime-api` are mocked. The network bridge will emulate all the network peers and their answering to requests. The test is going to be run for a number of blocks. For each block it will generate send a “RecoverAvailableData” request for an arbitrary number of candidates. We wait for the subsystem to respond to all requests before moving to the next block. At the same time we collect the usual subsystem metrics and task CPU metrics and show some nice progress reports while running. ### Here is how the CLI looks like: ``` [2023-11-28T13:06:27Z INFO subsystem_bench::core::display] n_validators = 1000, n_cores = 20, pov_size = 5120 - 5120, error = 3, latency = Some(PeerLatency { min_latency: 1ms, max_latency: 100ms }) [2023-11-28T13:06:27Z INFO subsystem-bench::availability] Generating template candidate index=0 pov_size=5242880 [2023-11-28T13:06:27Z INFO subsystem-bench::availability] Created test environment. [2023-11-28T13:06:27Z INFO subsystem-bench::availability] Pre-generating 60 candidates. [2023-11-28T13:06:30Z INFO subsystem-bench::core] Initializing network emulation for 1000 peers. [2023-11-28T13:06:30Z INFO subsystem-bench::availability] Current block 1/3 [2023-11-28T13:06:30Z INFO substrate_prometheus_endpoint] 〽️ Prometheus exporter started at 127.0.0.1:9999 [2023-11-28T13:06:30Z INFO subsystem_bench::availability] 20 recoveries pending [2023-11-28T13:06:37Z INFO subsystem_bench::availability] Block time 6262ms [2023-11-28T13:06:37Z INFO subsystem-bench::availability] Sleeping till end of block (0ms) [2023-11-28T13:06:37Z INFO subsystem-bench::availability] Current block 2/3 [2023-11-28T13:06:37Z INFO subsystem_bench::availability] 20 recoveries pending [2023-11-28T13:06:43Z INFO subsystem_bench::availability] Block time 6369ms [2023-11-28T13:06:43Z INFO subsystem-bench::availability] Sleeping till end of block (0ms) [2023-11-28T13:06:43Z INFO subsystem-bench::availability] Current block 3/3 [2023-11-28T13:06:43Z INFO subsystem_bench::availability] 20 recoveries pending [2023-11-28T13:06:49Z INFO subsystem_bench::availability] Block time 6194ms [2023-11-28T13:06:49Z INFO subsystem-bench::availability] Sleeping till end of block (0ms) [2023-11-28T13:06:49Z INFO subsystem_bench::availability] All blocks processed in 18829ms [2023-11-28T13:06:49Z INFO subsystem_bench::availability] Throughput: 102400 KiB/block [2023-11-28T13:06:49Z INFO subsystem_bench::availability] Block time: 6276 ms [2023-11-28T13:06:49Z INFO subsystem_bench::availability] Total received from network: 415 MiB Total sent to network: 724 KiB Total subsystem CPU usage 24.00s CPU usage per block 8.00s Total test environment CPU usage 0.15s CPU usage per block 0.05s ``` ### Prometheus/Grafana stack in action <img width="1246" alt="Screenshot 2023-11-28 at 15 11 10" src="https://github.com/paritytech/polkadot-sdk/assets/54316454/eaa47422-4a5e-4a3a-aaef-14ca644c1574"> <img width="1246" alt="Screenshot 2023-11-28 at 15 12 01" src="https://github.com/paritytech/polkadot-sdk/assets/54316454/237329d6-1710-4c27-8f67-5fb11d7f66ea"> <img width="1246" alt="Screenshot 2023-11-28 at 15 12 38" src="https://github.com/paritytech/polkadot-sdk/assets/54316454/a07119e8-c9f1-4810-a1b3-f1b7b01cf357"> --------- Signed-off-by: Andrei Sandu <[email protected]>
- Loading branch information
Showing
31 changed files
with
5,829 additions
and
38 deletions.
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,61 @@ | ||
[package] | ||
name = "polkadot-subsystem-bench" | ||
description = "Subsystem performance benchmark client" | ||
version = "1.0.0" | ||
authors.workspace = true | ||
edition.workspace = true | ||
license.workspace = true | ||
readme = "README.md" | ||
publish = false | ||
|
||
[[bin]] | ||
name = "subsystem-bench" | ||
path = "src/subsystem-bench.rs" | ||
|
||
# Prevent rustdoc error. Already documented from top-level Cargo.toml. | ||
doc = false | ||
|
||
[dependencies] | ||
polkadot-node-subsystem = { path = "../subsystem" } | ||
polkadot-node-subsystem-util = { path = "../subsystem-util" } | ||
polkadot-node-subsystem-types = { path = "../subsystem-types" } | ||
polkadot-node-primitives = { path = "../primitives" } | ||
polkadot-primitives = { path = "../../primitives" } | ||
polkadot-node-network-protocol = { path = "../network/protocol" } | ||
polkadot-availability-recovery = { path = "../network/availability-recovery", features = ["subsystem-benchmarks"] } | ||
color-eyre = { version = "0.6.1", default-features = false } | ||
polkadot-overseer = { path = "../overseer" } | ||
colored = "2.0.4" | ||
assert_matches = "1.5" | ||
async-trait = "0.1.57" | ||
sp-keystore = { path = "../../../substrate/primitives/keystore" } | ||
sc-keystore = { path = "../../../substrate/client/keystore" } | ||
sp-core = { path = "../../../substrate/primitives/core" } | ||
clap = { version = "4.4.6", features = ["derive"] } | ||
futures = "0.3.21" | ||
futures-timer = "3.0.2" | ||
gum = { package = "tracing-gum", path = "../gum" } | ||
polkadot-erasure-coding = { package = "polkadot-erasure-coding", path = "../../erasure-coding" } | ||
log = "0.4.17" | ||
env_logger = "0.9.0" | ||
rand = "0.8.5" | ||
parity-scale-codec = { version = "3.6.1", features = ["derive", "std"] } | ||
tokio = "1.24.2" | ||
clap-num = "1.0.2" | ||
polkadot-node-subsystem-test-helpers = { path = "../subsystem-test-helpers" } | ||
sp-keyring = { path = "../../../substrate/primitives/keyring" } | ||
sp-application-crypto = { path = "../../../substrate/primitives/application-crypto" } | ||
sc-network = { path = "../../../substrate/client/network" } | ||
sc-service = { path = "../../../substrate/client/service" } | ||
polkadot-node-metrics = { path = "../metrics" } | ||
itertools = "0.11.0" | ||
polkadot-primitives-test-helpers = { path = "../../primitives/test-helpers" } | ||
prometheus_endpoint = { package = "substrate-prometheus-endpoint", path = "../../../substrate/utils/prometheus" } | ||
prometheus = { version = "0.13.0", default-features = false } | ||
serde = "1.0.192" | ||
serde_yaml = "0.9" | ||
paste = "1.0.14" | ||
orchestra = { version = "0.3.3", default-features = false, features = ["futures_channel"] } | ||
|
||
[features] | ||
default = [] |
Oops, something went wrong.