Reduce amount of caches POSITIONS we send #16561

erikjohnston · 2023-10-27T14:16:03Z

Follow on from / actually correctly does #16557

erikjohnston · 2023-10-27T14:17:28Z

synapse/replication/tcp/streams/_base.py

@@ -489,6 +497,8 @@ def current_token(self, instance_name: str) -> Token:
        return self.store.get_cache_stream_token_for_writer(instance_name)

    def minimal_local_current_token(self) -> Token:
+        if self.store._cache_id_gen:
+            return self.store._cache_id_gen.get_minimal_local_current_token()


_cache_id_gen is None on SQLite

Why is the cache stream special?

I think at the time we didn't have a good way of doing it on SQLite, or something?

Sorry, I think I mean two questions:

Why are we handling the caches stream separately to the others? Presumably b/c it is especially chatty and we want to ratelimit the updates sent out?

What's the difference between self.store._cache_id_gen.get_minimal_local_current_token() and self.current_token(self.local_instance_name)?

Why are we handling the caches stream separately to the others? Presumably b/c it is especially chatty and we want to ratelimit the updates sent out?

This seems to be the case, by my reading of https://github.com/matrix-org/synapse/pull/16557/files#diff-844ba8f7be8c32eb75cc8092e1c48528797f6a8e1eeada942724ba6f73923a9cR214-R218

Ahh, I'd missed that this was inside the CachesStream class, I thought this was base logic for all streams.

I guess the point is that if there's no _cache_id_gen then there are no other workers to worry about and so the distinction between minimum and current tokens is moot?

Are there any other streams whose minimal_local_current_token impl we should sanity check?

I guess the point is that if there's no _cache_id_gen then there are no other workers to worry about and so the distinction between minimum and current tokens is moot?

Yup

Are there any other streams whose minimal_local_current_token impl we should sanity check?

I don't think so. Most of the others can just rely directly on the ID gens (and there's a helper class to do that)

I mean more that e.g. the function bodies here don't call something with the word "minimal" in:

synapse/synapse/replication/tcp/streams/federation.py

Lines 74 to 75 in 8f35f81

def minimal_local_current_token(self) -> Token:

return self.current_token(self.local_instance_name)

synapse/synapse/replication/tcp/streams/_base.py

Lines 389 to 390 in 8f35f81

def minimal_local_current_token(self) -> Token:

return self.current_token_function()

synapse/synapse/replication/tcp/streams/_base.py

Lines 343 to 344 in 8f35f81

def minimal_local_current_token(self) -> Token:

return self._federation_queue.get_current_token(self.local_instance_name)

Right, so yeah you're right that they're suboptimal implementations, but they are also valid implementations. We mostly care about the difference for caches as a) its high traffic, and b) we have an extra check for it that wants minimal_local_current_token to return the actual minimum

synapse/replication/tcp/streams/_base.py

erikjohnston added 2 commits October 27, 2023 15:15

Reduce amount of caches POSITIONS we send

166848a

Newsfile

d4a7aa4

erikjohnston commented Oct 27, 2023

View reviewed changes

DMRobertson reviewed Oct 27, 2023

View reviewed changes

synapse/replication/tcp/streams/_base.py Show resolved Hide resolved

erikjohnston marked this pull request as ready for review October 27, 2023 14:22

erikjohnston requested a review from a team as a code owner October 27, 2023 14:22

DMRobertson approved these changes Oct 27, 2023

View reviewed changes

erikjohnston enabled auto-merge (squash) October 27, 2023 14:48

erikjohnston disabled auto-merge October 27, 2023 15:07

erikjohnston merged commit 5413cef into develop Oct 27, 2023
39 of 41 checks passed

erikjohnston deleted the erikj/less_replication_traffic branch October 27, 2023 15:07

clokep mentioned this pull request Nov 17, 2023

Also discard 'caches' and 'backfill' stream POSITIONS #16655

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce amount of caches POSITIONS we send #16561

Reduce amount of caches POSITIONS we send #16561

erikjohnston commented Oct 27, 2023 •

edited

Loading

erikjohnston Oct 27, 2023

DMRobertson Oct 27, 2023

erikjohnston Oct 27, 2023

DMRobertson Oct 27, 2023 •

edited

Loading

DMRobertson Oct 27, 2023

DMRobertson Oct 27, 2023

DMRobertson Oct 27, 2023

erikjohnston Oct 27, 2023

DMRobertson Oct 27, 2023

erikjohnston Oct 27, 2023

	def minimal_local_current_token(self) -> Token:
	return self.current_token(self.local_instance_name)

	def minimal_local_current_token(self) -> Token:
	return self.current_token_function()

	def minimal_local_current_token(self) -> Token:
	return self._federation_queue.get_current_token(self.local_instance_name)

Reduce amount of caches POSITIONS we send #16561

Reduce amount of caches POSITIONS we send #16561

Conversation

erikjohnston commented Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

DMRobertson Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

erikjohnston commented Oct 27, 2023 •

edited

Loading

DMRobertson Oct 27, 2023 •

edited

Loading