Lint for holding locks across await points #5439

rokob · 2020-04-08T17:49:21Z

This introduces the lint await_holding_lock. For async functions, we iterate
over all types in generator_interior_types and look for types named MutexGuard,
RwLockReadGuard, or RwLockWriteGuard. If we find one then we emit a lint.

changelog: introduce the await_holding_lock lint

yaahc

Looks great!

yaahc · 2020-04-08T23:44:15Z

tests/ui/await_holding_lock.stderr

+LL | |
+LL | |     let second = baz().await;
+LL | |
+...  |


the ... here from it skipping all but one of the awaits this is held thru is particularly hilarious

Manishearth · 2020-04-09T04:57:09Z

clippy_lints/src/await_holding_lock.rs

+    "Inside an async function, holding a MutexGuard while calling await"
+}
+
+const MUTEX_GUARD_TYPES: [&str; 3] = ["MutexGuard", "RwLockReadGuard", "RwLockWriteGuard"];


These should be full paths in util/paths.rs

Then we can just use match_def_path

I went this route to capture:

std::sync::mutex::MutexGuard futures_locks::MutexGuard parking_lot::MutexGuard

with one check. Similarly for the others.

I will change to using full paths as that certainly reduces false positives, but at the cost of false negatives for custom guard types.

Do we want this in the first place for custom guard types from async executors (?)

I initially thought yes you wanted this for async-aware locks too, but that seems to be an incorrect assumption on my part. We can leave it to std::sync and parking_lot:: as there does not appear to be any disagreement on that being wrong.

Manishearth · 2020-04-09T04:58:48Z

clippy_lints/src/await_holding_lock.rs

+            return;
+        }
+
+        for ty_clause in &cx.tables.generator_interior_types {


how does this work?

My understanding based on the way check_fn gets called is that cx.tables gets set up here: https://github.com/rust-lang/rust/blob/58dd1ce8383aaebcad9b6027b89a316fd868b35c/src/librustc_lint/late.rs#L184

I am pretty sure these are the same tables as:

let def_id = cx.tcx.hir().local_def_id(hir_id); let tables = cx.tcx.typeck_tables_of(def_id);

where hir_id is passed into check_fn and cx is the LateContext.

Okay so that explains the where the TypeckTables come from. Then generator_interior_types is a vector of GeneratorInteriorTypeCause. The docs for that type says

Whenever a value may be live across a generator yield, the type of that value winds up in the GeneratorInteriorTypeCause struct.

So if a type ends up inside one of these TypeCause structs inside said vector then it is being held across a yield (which I am assuming is coming from an await).

Reading this now I realize I used the variable name "ty_clause" when I probably should have used "ty_cause".

Ah, TIL cx.tables gets updated per function, it used to work differently

jimblandy · 2020-04-12T20:31:50Z

The original post in #4226 talks about the reactor running other code while the mutex is locked, and while this is true, it's misleading about why that's a problem.

In an ordinary multi-threaded program, other code runs while locks are held, too. Nothing else can lock the mutex, so the data it owns is safe. So this isn't a concern.
If your task is spawned to another thread, or on a thread pool, then the guard could be dropped on a different thread than it was acquired on, which is UB for most mutex implementations. But MutexGuard isn't Send, so you'd get an error trying to spawn the future on such an executor anyway. So this isn't a concern.
Even if your task always stays on the same thread, it might race the lock-holding future against another future that tries to lock the same mutex. std::sync::Mutex panics or deadlocks if the thread that owns it tries to lock it again. This is a legitimate concern.

However, async-std and tokio both provide their own versions of Mutex that should be fine to use in asynchronous code. Their lock operations block asynchronously, they work properly when the same thread attempts to re-lock, and their guards are Send, so futures of code that holds them across awaits can be spawned onto other threads. The lint should recommend using these, instead of std::async.

This is another reason for matching the full path of the type: just because it's named Mutex doesn't mean you shouldn't hold it across an await.

rokob · 2020-04-14T16:03:26Z

@jimblandy Thank you for that extra detail. Everything you say is correct, but I want to add the additional issue as I understand it based on #4893. Most often one does not want to hold a lock across an await even if it is "safe" to do so.

My initial impression was that usually one should not be using std::sync::Mutex in async code AND not holding a MutexGuard across an await point. The former is a technical issue which can be solved with a different mutex, but the latter is a logic bug that could lead to deadlock. Maybe that is not the intention here.

For my own understanding, do we want to lint against holding a guard across an await point because it is similar to holding a lock and calling out to arbitrary code in non-async code? Or do we want to lint against it because the lock implementation itself has reentrancy issues?

At the very least we should say you don't want to use std::sync::Mutex in this context and put something like you suggested in rust-lang/rust#71072:

help: If you need to hold a mutex guard while you're awaiting, you must use an async-aware version of the Mutex type.
help: Many asynchronous foundation crates provide such a Mutex type.

But should we make the stronger statement that holding any MutexGuard is a sign of a potential logic issue?

jimblandy · 2020-04-14T21:32:55Z

For my own understanding, do we want to lint against holding a guard across an await point because it is similar to holding a lock and calling out to arbitrary code in non-async code? Or do we want to lint against it because the lock implementation itself has reentrancy issues?

This is a good question - Clippy is meant to give advice, so it makes sense for Clippy to report usage that is usually ill-advised, even if not necessarily wrong.

One of the things users of mutexes are supposed to know is that they are intended for low-contention use. Mutexes are great for short, get-in-and-get-out critical sections. Threads waiting on a mutex aren't woken up in any particular order, so a highly-contended mutex can cause starvation, or unreasonable latency, at least. I think there are other reasons, too.

Awaiting is often/usually waiting for I/O or other external activities to complete. If you're holding a mutex across such an an await, that lock may be held for a long time, and you are likely to end up with a highly contended mutex, and experience avoidable sadnesses.

So there are several different potential topics for lints here:

Using a std::sync::Mutex in async code:

a) If the MutexGuard is held across an await, this is certainly a bug, because std::sync::Mutex isn't prepared for other tasks on the same thread to try to lock it. Definite lint.

b) If the MutexGuard is not held across an await, this could be fine. If the Mutex is used in the classic get-in-and-get-out pattern, this won't even violate the principle that async tasks should not run for a long time. Unclear whether a lint is helpful or noise.
Using an async-aware Mutex, like async_std::sync::Mutex or tokio::sync::Mutex:

a) If it's not held across an await, it's not clear what the point of using this type is, instead of the standard library's. But seems barely worth linting - maybe they just want to not get in trouble if they add an await in the future.

b) If it is held across an await, then that's not incorrect, but it suggests that you might end up contending more than you'd like. Maybe lint, pointing out that Mutexes are best for low-contention use?

(Maybe the async-aware Mutexes have additional features to mitigate the consequences of contention? Fair wakeups? If so, I don't see it in the docs.)

But should we make the stronger statement that holding any MutexGuard is a sign of a potential logic issue?

If it's an async-aware MutexGuard, I don't think it is a logic issue. At least, I don't see one. No other task (or thread) is going to get access to the data it owns until it's unlocked, and nobody's going to deadlock/panic trying to take the lock.

jimblandy · 2020-04-14T21:45:35Z

"Get in, get out": https://www.youtube.com/watch?v=dht_3NziwSw&feature=youtu.be&t=111

Manishearth · 2020-04-14T22:01:27Z

a) If it's not held across an await, it's not clear what the point of using this type is, instead of the standard library's

backpressure, for one. plus the runtime's scheduler can be smarter about it

tmandry · 2020-04-14T23:51:55Z

I agree we should limit the lint to non async aware Mutex types (and sorry for leading you down the wrong path before, somehow it didn't occur to me that these had the same name as their non-async counterparts!) I think there are plenty of legitimate use cases for these in async code and am worried about introducing a lint which is too noisy.

More experience may prove me wrong, but I personally don't have data to support the idea that we should lint against using async-aware mutexes in async code.

rokob · 2020-04-17T06:56:11Z

I updated the language here and changed the method for finding what to lint so this also captures blocks and closures. I was trying to make the diagnostics more specific and came across rust-lang/rust#71137 and there is rust-lang/rust#71203 open to fix that. Having that extra span would make the diagnostics here nicer but its okay without it too.

Manishearth · 2020-04-22T00:28:00Z

@bors r+

thanks!

bors · 2020-04-22T00:28:01Z

📌 Commit ba18dde has been approved by Manishearth

bors · 2020-04-22T00:28:11Z

⌛ Testing commit ba18dde with merge 2861c78...

Lint for holding locks across await points Fixes #4226 This introduces the lint await_holding_lock. For async functions, we iterate over all types in generator_interior_types and look for types named MutexGuard, RwLockReadGuard, or RwLockWriteGuard. If we find one then we emit a lint. changelog: introduce the await_holding_lock lint

bors · 2020-04-22T00:31:08Z

💔 Test failed - checks-action_test

Fixes rust-lang#4226 This introduces the lint await_holding_lock. For async functions, we iterate over all types in generator_interior_types and look for types named MutexGuard, RwLockReadGuard, or RwLockWriteGuard. If we find one then we emit a lint.

…t of the path

…n other mutex types

…d of just a span

flip1995 · 2020-04-22T15:50:28Z

@bors r=Manishearth

bors · 2020-04-22T15:50:30Z

📌 Commit 8b052d3 has been approved by Manishearth

bors · 2020-04-22T15:50:37Z

⌛ Testing commit 8b052d3 with merge 1d4dd3d...

bors · 2020-04-22T16:10:47Z

☀️ Test successful - checks-action_dev_test, checks-action_remark_test, checks-action_test
Approved by: Manishearth
Pushing 1d4dd3d to master...

Add lint for holding RefCell Ref across an await Fixes #6008 This introduces the lint await_holding_refcell_ref. For async functions, we iterate over all types in generator_interior_types and look for `core::cell::Ref` or `core::cell::RefMut`. If we find one then we emit a lint. Heavily cribs from: #5439 changelog: introduce the await_holding_refcell_ref lint

flip1995 requested a review from yaahc April 8, 2020 20:25

flip1995 added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties label Apr 8, 2020

yaahc approved these changes Apr 8, 2020

View reviewed changes

yaahc requested a review from Manishearth April 8, 2020 23:44

Manishearth reviewed Apr 9, 2020

View reviewed changes

DevinR528 mentioned this pull request Apr 14, 2020

lazy_static rhai Engine casbin/casbin-rs#112

Closed

rokob force-pushed the lock-await branch 7 times, most recently from 4bbecf4 to ba18dde Compare April 17, 2020 06:53

flip1995 requested a review from Manishearth April 17, 2020 14:16

Manishearth approved these changes Apr 22, 2020

View reviewed changes

rokob added 3 commits April 21, 2020 21:07

Switch to matching against full paths instead of just the last elemen…

2dc8c08

…t of the path

don't test the code in the lint docs

54e7f7e

rokob added 2 commits April 21, 2020 21:07

Make lint also capture blocks and closures, adjust language to mentio…

d6e55e9

…n other mutex types

span_lint_and_note now takes an Option<Span> for the note_span instea…

8b052d3

…d of just a span

rokob force-pushed the lock-await branch from ba18dde to 8b052d3 Compare April 22, 2020 04:29

bors merged commit 1d4dd3d into rust-lang:master Apr 22, 2020

tmandry mentioned this pull request Jul 10, 2020

Draft RFC to add a must_not_await lint rust-lang/wg-async#16

Merged

This was referenced Sep 5, 2020

Lint for using await while holding RefCell Ref/RefMut #6008

Closed

Add lint for holding RefCell Ref across an await #6029

Merged

jamesbornholt mentioned this pull request Sep 17, 2021

Fix waker behavior when invoked before a poll finishes awslabs/shuttle#54

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lint for holding locks across await points #5439

Lint for holding locks across await points #5439

rokob commented Apr 8, 2020

yaahc left a comment

yaahc Apr 8, 2020

Manishearth Apr 9, 2020

Manishearth Apr 9, 2020

rokob Apr 10, 2020

Manishearth Apr 13, 2020

rokob Apr 14, 2020

Manishearth Apr 9, 2020

rokob Apr 10, 2020

Manishearth Apr 13, 2020

jimblandy commented Apr 12, 2020 •

edited

Loading

rokob commented Apr 14, 2020

jimblandy commented Apr 14, 2020 •

edited

Loading

jimblandy commented Apr 14, 2020

Manishearth commented Apr 14, 2020

tmandry commented Apr 14, 2020

rokob commented Apr 17, 2020

Manishearth commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

flip1995 commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

Lint for holding locks across await points #5439

Lint for holding locks across await points #5439

Conversation

rokob commented Apr 8, 2020

yaahc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jimblandy commented Apr 12, 2020 • edited Loading

rokob commented Apr 14, 2020

jimblandy commented Apr 14, 2020 • edited Loading

jimblandy commented Apr 14, 2020

Manishearth commented Apr 14, 2020

tmandry commented Apr 14, 2020

rokob commented Apr 17, 2020

Manishearth commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

flip1995 commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

bors commented Apr 22, 2020

jimblandy commented Apr 12, 2020 •

edited

Loading

jimblandy commented Apr 14, 2020 •

edited

Loading