This repository has been archived by the owner on Apr 26, 2024. It is now read-only.
Waiting for streams to catch up when processing HTTP replication can get stuck timing out #15136
Labels
A-Workers
Problems related to running Synapse in Worker Mode (or replication)
O-Uncommon
Most users are unlikely to come across this or unexpected workflow
S-Major
Major functionality / product severely impaired, no satisfactory workaround.
T-Defect
Bugs, crashes, hangs, security vulnerabilities, or other reported issues.
#14820 (introduced in v1.76.0) added code to wait for streams to catch up when receiving HTTP replication responses.
We've seen that code hit a failure mode where all waits on the
backfill
stream start timing out, on a deployment where backfill is very rare. It's not clear how Synapse entered that state or how to reproduce it yet.The text was updated successfully, but these errors were encountered: