Skip to content
This repository has been archived by the owner on Feb 1, 2023. It is now read-only.

More stats, knobs and tunings #514

Merged
merged 29 commits into from
Aug 18, 2021
Merged

More stats, knobs and tunings #514

merged 29 commits into from
Aug 18, 2021

Conversation

petar
Copy link
Contributor

@petar petar commented Jul 16, 2021

[x] prometheus metric for how long it takes to send messages
[x] option for maximum outstanding bytes per peer
[x] tighter send timeouts
- Minimum timeout of 10s.
- We add 2s due to latencies.
- Minimum bandwidth of 100kbit/s.
- Maximum message send time of 2M (way more time than necessary).
[x] option for number of task workers
[x] option for number of engine task workers

Depends on: https://github.com/ipfs/go-peertaskqueue/pull/10/commits
Part of ipfs/kubo#8233

@petar petar requested a review from aschmahmann July 20, 2021 20:07
@petar petar marked this pull request as ready for review July 22, 2021 12:38
@petar
Copy link
Contributor Author

petar commented Jul 22, 2021

@aschmahmann this is ready for review. passes on each OS once. the failing tests are all documented flaky.

dupMetric: dupHist,
allMetric: allHist,
sentHistogram: sentHistogram,
sendTimeHistogram: sendTimeHistogram,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this still useful? If it's not documented then it's not useful and we should remove it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it is useful to know if your outbound connection is slow. Let's keep it. Where is the right place to document it? The metric itself is documented when initialized, so the doc shows up on the grafana dashboard.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, that seems fine. We should try and track this internally on our infra (and ask Pinata to try on theirs) so we can get an understanding of what these numbers look like and if the time resolution is anywhere close to correct.

bitswap_test.go Show resolved Hide resolved
go.mod Outdated Show resolved Hide resolved
internal/decision/blockstoremanager.go Outdated Show resolved Hide resolved
internal/decision/engine_test.go Outdated Show resolved Hide resolved
internal/decision/engine.go Outdated Show resolved Hide resolved
internal/decision/engine.go Outdated Show resolved Hide resolved
@petar
Copy link
Contributor Author

petar commented Aug 5, 2021

@aschmahmann ready for another pass.

@aschmahmann aschmahmann merged commit 2b51297 into master Aug 18, 2021
@aschmahmann aschmahmann deleted the feat/more-stats branch August 18, 2021 12:03
@aschmahmann aschmahmann mentioned this pull request Aug 23, 2021
62 tasks
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants