Await for a chain worker to be available for eviction #3037

jvff · 2024-12-16T22:01:37Z

Motivation

The WorkerState contains an LRU cache of chain workers (more specifically, it keeps open channels to the ChainWorkerActors so that they keep running). Once the cache becomes full, requests for new chain workers randomly attempt to evict an idle chain worker.

Eviction was done by looking for an idle chain worker, in the order of least-recently used endpoints. This means that we iterate through the endpoints, starting with the one that was last used the earliest, and checked if it had a strong count of one or less. If it did, it meant that the endpoint that sent the request had already been dropped, and that only happens after the response is received.

If no idle worker is found, that means that there is no worker that's a candidate for eviction, because they all seem to be handling a request. If we evict a chain worker before it finishes handling the request and later on start another chain worker for the same chain while the request is still executing, a race condition is formed which could corrupt the chain state data. Therefore, if there's no eviction candidate the code would wait 250 milliseconds and try again repeatedly for up to three seconds before giving up with an error.

This whole eviction process may run concurrently between many requesters. This leads to possible unfairness, as there's no guarantee which one will succeed. It also introduces undesirable delays between retries.

Proposal

Use a Semaphore to establish a queue among the chain worker requests. This way there's certainty that once the permit is acquired, there will be at least one idle chain worker ready to be evicted.

In order to implement this change, a new ChainWorkers type was created to manage the cache. The ChainActorEndpoint also became a new type, in order to hold the semaphore permit.

Test Plan

CI should catch any regressions to the happy path, and #3052 should add a new test for the extreme load case.

Release Plan

Nothing to do, because this is a just a refactor that should follow the normal release cycle.

Links

reviewer checklist

Prepare to move the chain worker cache handling out of the `WorkerState` type.

Simplify `WorkerState` by using the new type to handle how the chain worker cache is evicted.

Create a separate `handle_request` method with the code to handle a single request.

Prepare to change the eviction procedure.

Split the previous method to make it more readable.

Ensure that the synchronous `Mutex` is locked and used in a separate synchronous method.

Replace the old alias with the same name with `ChainRequestSender`. The goal is for the new type to also hold a permit to indicate it is using a chain worker.

Ensure that attempts to use chain workers when the cache is full are fair.

It's no longer needed, because eviction is now only performed once a permit is held, guaranteeing that there's at least one chain worker that's idle.

linera-core/src/chain_worker/actor.rs

afck · 2024-12-17T20:30:13Z

linera-core/src/chain_worker/cache.rs

+            cache: Arc::new(Mutex::new(LruCache::new(limit))),
+            active_endpoints: Arc::new(Semaphore::new(limit.get())),


Does using the same limit here mean that if limit requests are in flight to a single chain worker, and there's another incoming request for a different worker, we will now block that new request even though a slot would be free?

Good catch! It does seem like we're first trying to acquire a self.active_endpoints Semaphore even if there already exists a chain worker for the same chain_id.

Yes, that is true. I thought about having a map of permits, so that requests that get queued anyway would reuse the existing permit. However, that may lead to starvation because a chain worker that is heavily used will never release its permit.

The solution I thought then was to have another semaphore for acquiring each permit, but I thought the permitception would be overengineering. I'm not sure how to proceed here, and would love some feedback and ideas.

I still think it's okay to improve (fix?) this in a follow-up PR because:

this impacts a (hopefully) edge case (lots of requests for a single chain)

worst outcome is degraded performance

with the benefit that it would be fairer in extreme scenarios (all chains getting many requests)

I don't think it's an edge case to have several requests for the same chain worker. And every time we have that we underutilize at least one other chain worker.

But I don't have a better idea at the moment either.

afck · 2024-12-17T20:32:04Z

linera-core/src/chain_worker/cache.rs

+        let mut cache = self.cache.lock().unwrap();
+
+        if let Some(sender) = cache.get(&chain_id) {
+            Ok(ChainActorEndpoint::new(sender.clone(), permit))


(See my previous comment.) Would it be better to clone the existing permit here?

afck · 2024-12-17T20:35:52Z

linera-core/src/chain_worker/cache.rs

+            .iter()
+            .rev()
+            .find(|(_, candidate_endpoint)| candidate_endpoint.strong_count() <= 1)
+            .expect("`stop_one` should only be called while holding a permit for an endpoint");


Can there be a race condition where the permit is already freed but the reference count hasn't been set to 0 yet?

Good point! I was lucky that I declared the fields in ChainActorEndpoint in the right order, so that the channel sender is always dropped before the permit 😅

deuszx

How does this PR affect the semantics of getting the chan worker endpoint? previously we'd try to find a chain worker (for eviction) and if none available we'd wait for 250ms and repeat that for 3 seconds (approx 12 times). Can you put that into the PR description?

linera-core/src/chain_worker/cache.rs

ma2bd · 2024-12-18T13:25:35Z

Ok so since #3052 is pending, we don't know how much this helps, do we?

jvff · 2024-12-18T13:29:57Z

Ok so since #3052 is pending, we don't know how much this helps, do we?

I don't think we need to measure that now. The goal is to remove the delays and to make it fairer.

jvff added the enhancement New feature or request label Dec 16, 2024

jvff added this to the Testnet #2 milestone Dec 16, 2024

jvff self-assigned this Dec 16, 2024

jvff force-pushed the remove-worker-cache-eviction-delays branch 2 times, most recently from 598e667 to 43fb237 Compare December 17, 2024 14:23

jvff added 9 commits December 17, 2024 18:31

Create a new ChainWorkers helper type

7335779

Prepare to move the chain worker cache handling out of the `WorkerState` type.

Use ChainWorkers in WorkerState

eed851e

Simplify `WorkerState` by using the new type to handle how the chain worker cache is evicted.

Refactor to split ChainWorkerActor::run method

1826e8b

Create a separate `handle_request` method with the code to handle a single request.

Refactor to move eviction code into ChainWorkers

88d2d3b

Prepare to change the eviction procedure.

Refactor to add create_new_endpoint method

7e6b918

Split the previous method to make it more readable.

Refactor stop_one to separate synchronous code

54f5d6e

Ensure that the synchronous `Mutex` is locked and used in a separate synchronous method.

Refactor to create a new ChainActorEndpoint type

cf0cc86

Replace the old alias with the same name with `ChainRequestSender`. The goal is for the new type to also hold a permit to indicate it is using a chain worker.

Use a semaphore to queue chain worker requests

73afe15

Ensure that attempts to use chain workers when the cache is full are fair.

Remove eviction retries

e5ffc04

It's no longer needed, because eviction is now only performed once a permit is held, guaranteeing that there's at least one chain worker that's idle.

jvff force-pushed the remove-worker-cache-eviction-delays branch from 43fb237 to e5ffc04 Compare December 17, 2024 19:08

jvff changed the title ~~[WIP] Await for a chain worker to be available for eviction~~ Await for a chain worker to be available for eviction Dec 17, 2024

jvff marked this pull request as ready for review December 17, 2024 19:59

jvff requested review from afck, Twey, ma2bd and deuszx December 17, 2024 19:59

afck reviewed Dec 17, 2024

View reviewed changes

deuszx reviewed Dec 18, 2024

View reviewed changes

linera-core/src/chain_worker/cache.rs Show resolved Hide resolved

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Await for a chain worker to be available for eviction #3037

Await for a chain worker to be available for eviction #3037

jvff commented Dec 16, 2024 •

edited

Loading

afck Dec 17, 2024

deuszx Dec 18, 2024

jvff Dec 18, 2024 •

edited

Loading

afck Dec 18, 2024

afck Dec 17, 2024

afck Dec 17, 2024

jvff Dec 18, 2024

deuszx left a comment

ma2bd commented Dec 18, 2024

jvff commented Dec 18, 2024

		cache: Arc::new(Mutex::new(LruCache::new(limit))),
		active_endpoints: Arc::new(Semaphore::new(limit.get())),

Await for a chain worker to be available for eviction #3037

Are you sure you want to change the base?

Await for a chain worker to be available for eviction #3037

Conversation

jvff commented Dec 16, 2024 • edited Loading

Motivation

Proposal

Test Plan

Release Plan

Links

afck Dec 17, 2024

Choose a reason for hiding this comment

deuszx Dec 18, 2024

Choose a reason for hiding this comment

jvff Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

afck Dec 18, 2024

Choose a reason for hiding this comment

afck Dec 17, 2024

Choose a reason for hiding this comment

afck Dec 17, 2024

Choose a reason for hiding this comment

jvff Dec 18, 2024

Choose a reason for hiding this comment

deuszx left a comment

Choose a reason for hiding this comment

ma2bd commented Dec 18, 2024

jvff commented Dec 18, 2024

jvff commented Dec 16, 2024 •

edited

Loading

jvff Dec 18, 2024 •

edited

Loading