ConsolidateNudge: nudge without reducing capacity of underlying slice, to keep pointslicepool effective #1923

Dieterbe · 2020-10-14T22:30:21Z

Note: this builds on top of #1922, review/merge that first.

Without this fix, you would commonly see that slices going into the pointslicepool would have a cap that is one or two points less then what you need on subsequent reads.

E.g. I added some debug lines and saw this:

metrictank-q1_1  | PSP.GetMin(2880) from github.com/grafana/metrictank/vendor/github.com/tinylib/msgp/msgp.Decode->github.com/grafana/metrictank/api/models.(*GetDataRespV1).DecodeMsg
metrictank-q1_1  | candidate has cap 2877 we want 2880
metrictank-q1_1  | PSP.GetMin(2880) from github.com/grafana/metrictank/api.(*Server).renderMetrics->github.com/grafana/metrictank/api.(*Server).executePlan
metrictank-q1_1  | candidate has cap 2877 we want 2880
metrictank-q1_1  | PSP.Put(2880) from github.com/grafana/metrictank/api.(*Server).renderMetrics->github.com/grafana/metrictank/expr.Plan.Clean
metrictank-q1_1  | PSP.Put(2877) from github.com/grafana/metrictank/api.(*Server).renderMetrics->github.com/grafana/metrictank/expr.Plan.Clean

This fix increases efficacy of the pool.

Here we can see the comparison of two query nodes q0 and q1.
q0 runs master+stats (#1922)
q1 runs the code from this branch.
We can see that as time advances, there are periods where nudging happens, which breaks down the efficacy of the pointslicepool in q0.
Q1 doesn't have this problem. We see an increased hit rate, sadly it doesn't translate into memory savings

q1:

q0:

shanson7

The changes look good to me, but I'm wondering how effective the PSP really is. It would be nice if we could show a definitive workload that is improved in either memory usage or query execution speed by the usage of the PSP. I expect a workload with many requests for points >= DefaultPointSliceSize might see an allocation rate improvement within GC cycles.

Dieterbe · 2020-10-15T11:35:21Z

I've been thinking to make the PSP configurable, and provide a few options:
"null", "traditional", "size-classes x/y/z", etc. we can then just easily try different options on real workloads

robert-milan

I agree with the other comments. I would not expect to see a memory savings, but probably reduce allocation rate / GC improvements on heavy workloads.

without this fix, you would commonly see that slices going into the pointslicepool would have a cap that is one or two points less then what you need on subsequent reads. This fix increases efficacy of the pool.

Dieterbe force-pushed the psp-stats-with-nudge-fix branch from b23b039 to d8b5ee5 Compare October 14, 2020 22:33

Dieterbe requested review from robert-milan and shanson7 October 14, 2020 22:34

Dieterbe mentioned this pull request Oct 15, 2020

use pointslicepool for /getdata cluster-fan out requests #1921

Merged

shanson7 reviewed Oct 15, 2020

View reviewed changes

robert-milan approved these changes Oct 15, 2020

View reviewed changes

consolidateNudged: nudge without reducing capacity of underlying slice

e45c4ac

without this fix, you would commonly see that slices going into the pointslicepool would have a cap that is one or two points less then what you need on subsequent reads. This fix increases efficacy of the pool.

Dieterbe force-pushed the psp-stats-with-nudge-fix branch from d8b5ee5 to e45c4ac Compare October 15, 2020 21:24

Dieterbe merged commit 2edab76 into master Oct 15, 2020

Dieterbe deleted the psp-stats-with-nudge-fix branch October 15, 2020 21:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ConsolidateNudge: nudge without reducing capacity of underlying slice, to keep pointslicepool effective #1923

ConsolidateNudge: nudge without reducing capacity of underlying slice, to keep pointslicepool effective #1923

Dieterbe commented Oct 14, 2020 •

edited

Loading

shanson7 left a comment

Dieterbe commented Oct 15, 2020 •

edited

Loading

robert-milan left a comment

ConsolidateNudge: nudge without reducing capacity of underlying slice, to keep pointslicepool effective #1923

ConsolidateNudge: nudge without reducing capacity of underlying slice, to keep pointslicepool effective #1923

Conversation

Dieterbe commented Oct 14, 2020 • edited Loading

shanson7 left a comment

Choose a reason for hiding this comment

Dieterbe commented Oct 15, 2020 • edited Loading

robert-milan left a comment

Choose a reason for hiding this comment

Dieterbe commented Oct 14, 2020 •

edited

Loading

Dieterbe commented Oct 15, 2020 •

edited

Loading