Speculative queries #956

shanson7 · 2018-06-28T19:37:55Z

I made speculation configurable, but one thing will still change even with speculation disabled: Local requests are now via HTTP rather than a special case. This is good and bad (good: happens in parallel with peer requests and tracing just works, bad: a bit of overhead)

TODO:

Figure out partitions / shard group problem
Some functions (e.g. findSeries) don't use peerQuery; fix that
Try to more efficiently pre-allocate memory for initial results
Documentation

replay · 2018-06-29T15:13:37Z

api/cluster.go

+	// metric api.cluster.speculative.requests is how many peer queries resulted in speculation
+	speculativeAttempts = stats.NewCounter32("api.cluster.speculative.attempts")
+
+	// metric api.cluster.speculative.requests is how many peer queries were improved due to speculation


the metric name in the comment is not right

replay · 2018-06-29T15:18:25Z

api/cluster.go

+
+// peerQuerySpeculative takes a request and the path to request it on, then fans it out
+// across the cluster, except to the local peer. If any peer fails requests to
+// other peers are aborted. If 95% of peers have been heard from, and we are missing


speculationThreshold is configurable now

replay · 2018-06-29T15:38:29Z

cluster/cluster.go

+		}
+		memberStartPartition := member.GetPartitions()[0]
+
+		if _, ok := membersMap[memberStartPartition]; !ok {


This seems to determine whether a member is already in the map or not based on its first partition. But I can't see where we are sorting the partitions, if two MTs of a shard are configured to have the same partitions but they are specified in a different order wouldn't this break?

That's true. IMO, it seems like the partitions should be sorted at start up. I could add a sort to the SetPartitions function.

I agree that sorting at startup makes the most sense, that way it only needs to be done once

I just noticed that in cases where the partition ids are not sorted in the config it is important to first update to this version of MT without activating speculative queries, and only once all MTs are on this version enable speculative queries. Otherwise querying might temporarily be broken until all MTs are updated because the older ones still returned their partition IDs unsorted. But i guess there is nothing we can do about that.

Well, we could sort the partitions of the cluster peers when we get them. As it stands right now, with or without speculation enabled, the partitions will need to be sorted. That is, unless I change the code as I mentioned in #956 (comment)

shanson7 · 2018-07-10T20:23:48Z

Status update:

I'm using this on our prod setup. We get about 67 render requests/sec. We have 120 shard groups * 2 replicas (240 total peers). This results in about 8000 peer requests/sec.

Speculation kicks in on darn near all requests, as with that many peers, it's highly likely that any is undergoing some GC.

Here's a snapshot with some of the data:
https://snapshot.raintank.io/dashboard/snapshot/Yy5adwlVIpEo7IyoS3QZj7QuikTsCw27

Of note:

Speculation "win" percentage is frequently in the high 90's, indicating that most requests are aided by speculation.
"Additional" requests to peers (requests that wouldn't be made if speculation was disabled) is frequently less than 5% (Our speculation-threshold is set at 94%, so at most 7 additional HTTP requests are made per speculative query, making our worst case here ~5.8%)
"Win %" seems to increase under load (but so does the p90 response time).

You can see in this snapshot when we upped the load:
https://snapshot.raintank.io/dashboard/snapshot/YAifupVWdmQ8nuIDa58qo4ciEmIWAgwk

Here's a comparison of our render response times before and after rollout (under very light load, 4 render reqs/sec):
https://snapshot.raintank.io/dashboard/snapshot/wBWTIbLGkF2GXiQIEjAjyzRFsG5HMqmB

You can see how much smoother and improved the median and p90 response times are.

replay · 2018-07-20T17:08:07Z

looks great, but could you fix the test please? this might be what you want: afc60d2

shanson7 · 2018-07-20T17:49:03Z

I haven't verified it, but I'm pretty sure that the existing peer query mechanism required peers to have the same partitions as their comrades. Speculative peer queries absolutely requires it. Is it worth supporting something like

if speculation-threshold < 1 {
   peerQuerySpeculatively
} else {
   peerQueryOldWay
}

replay · 2018-07-24T15:57:09Z

So far I've not heard of any cases where people configured their MT partitions in a way where the partitions are not the same for all instances in a shard (that's what you mean, right?). I think it's probably not worth the additional complexity just to support this very rare edge case. What do you think @woodsaj

woodsaj · 2018-07-24T19:24:17Z

the docs state that when you have multiple replicas, partitions must be assigned in groups.
https://github.com/grafana/metrictank/blob/master/docs/clustering.md#combining-metrictanks-horizontal-scaling-plus-high-availability

replay

LGTM

Dieterbe · 2018-08-06T20:19:08Z

Here's a comparison of our render response times before and after rollout (under very light load, 4 render reqs/sec):
https://snapshot.raintank.io/dashboard/snapshot/wBWTIbLGkF2GXiQIEjAjyzRFsG5HMqmB
You can see how much smoother and improved the median and p90 response times are.

interestingly:

win % is way lower than before. probably due to the very light load?
i see that there were 2 periods wherein speculation was enabled. in the first period, the latencies were still elevated. is this because speculation wasn't fully rolled out across the cluster? (additional HTTP is lower in first period compared to second)

otherwise, all these numbers look great :)

shanson7 · 2018-08-06T20:43:47Z

win % is way lower than before. probably due to the very light load?

Correct. Here's a snapshot under heavier load (with real user queries)
https://snapshot.raintank.io/dashboard/snapshot/IoV6Z91WYgh3VFmlB133fr2xVxnH9B4m

i see that there were 2 periods wherein speculation was enabled. in the first period, the latencies were still elevated. is this because speculation wasn't fully rolled out across the cluster? (additional HTTP is lower in first period compared to second)

Yes, this was still during the rollout.

shanson7 · 2018-08-06T20:59:29Z

Of note, this optimization is likely of more value the larger the cluster. At our 120 shard group cluster, it's been great.

replay reviewed Jun 29, 2018

View reviewed changes

shanson7 added 5 commits June 29, 2018 16:03

Add query that returns prioritized peers per shard group

b3446b5

Add speculative peer queries

a81ea69

Use speculative peer queries

baef89d

Fix metric names in comments

3922e53

Clarify comments

ca4439c

shanson7 force-pushed the speculativeQueries branch from fc5823a to ca4439c Compare June 29, 2018 20:54

shanson7 added 2 commits July 3, 2018 12:27

Sort partitions

af9d87d

Make findSeries speculative too

f94b249

shanson7 changed the title ~~WIP - Speculative queries~~ Speculative queries Jul 13, 2018

shanson7 added 2 commits July 20, 2018 13:27

Update docs and ini files

d758400

Remove broken test line (ala grafana#949)

840f352

replay approved these changes Jul 25, 2018

View reviewed changes

replay merged commit a7f011d into grafana:master Jul 25, 2018

shanson7 mentioned this pull request Jul 26, 2018

Support speculative / failover querying to peers #954

Closed

shanson7 deleted the speculativeQueries branch July 30, 2018 19:36

Dieterbe mentioned this pull request Aug 6, 2018

Revert "Speculative queries" #976

Closed

shanson7 mentioned this pull request Oct 23, 2019

Uneven partition distributions can lead to inconsistent results #1506

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speculative queries #956

Speculative queries #956

shanson7 commented Jun 28, 2018 •

edited

Loading

replay Jun 29, 2018 •

edited

Loading

replay Jun 29, 2018

replay Jun 29, 2018 •

edited

Loading

shanson7 Jun 29, 2018

replay Jul 4, 2018

replay Jul 23, 2018

shanson7 Jul 23, 2018

shanson7 commented Jul 10, 2018

replay commented Jul 20, 2018

shanson7 commented Jul 20, 2018

replay commented Jul 24, 2018

woodsaj commented Jul 24, 2018 •

edited

Loading

replay left a comment

Dieterbe commented Aug 6, 2018

shanson7 commented Aug 6, 2018

shanson7 commented Aug 6, 2018

Speculative queries #956

Speculative queries #956

Conversation

shanson7 commented Jun 28, 2018 • edited Loading

replay Jun 29, 2018 • edited Loading

Choose a reason for hiding this comment

replay Jun 29, 2018

Choose a reason for hiding this comment

replay Jun 29, 2018 • edited Loading

Choose a reason for hiding this comment

shanson7 Jun 29, 2018

Choose a reason for hiding this comment

replay Jul 4, 2018

Choose a reason for hiding this comment

replay Jul 23, 2018

Choose a reason for hiding this comment

shanson7 Jul 23, 2018

Choose a reason for hiding this comment

shanson7 commented Jul 10, 2018

replay commented Jul 20, 2018

shanson7 commented Jul 20, 2018

replay commented Jul 24, 2018

woodsaj commented Jul 24, 2018 • edited Loading

replay left a comment

Choose a reason for hiding this comment

Dieterbe commented Aug 6, 2018

shanson7 commented Aug 6, 2018

shanson7 commented Aug 6, 2018

shanson7 commented Jun 28, 2018 •

edited

Loading

replay Jun 29, 2018 •

edited

Loading

replay Jun 29, 2018 •

edited

Loading

woodsaj commented Jul 24, 2018 •

edited

Loading