Replace RwLock by a futex based one on Linux. #95762

m-ou-se · 2022-04-07T10:38:17Z

This replaces the pthread-based RwLock on Linux by a futex based one.

This implementation is similar to the algorithm suggested by @kprotty, but modified to prefer writers and spin before sleeping. It uses two futexes: One for the readers to wait on, and one for the writers to wait on. The readers futex contains the state of the RwLock: The number of readers, a bit indicating whether writers are waiting, and a bit indicating whether readers are waiting. The writers futex is used as a simple condition variable and its contents are meaningless; it just needs to be changed on every notification.

Using two futexes rather than one has the obvious advantage of allowing a separate queue for readers and writers, but it also means we avoid the problem a single-futex RwLock would have of making it hard for a writer to go to sleep while the number of readers is rapidly changing up and down, as the writers futex is only changed when we actually want to wake up a writer.

It always prefers writers, as we decided here.

It relies on futex_wake to return the number of awoken threads to be able to handle write-unlocking while both the readers-waiting and writers-waiting bits are set. Instead of waking both and letting them race, it first wakes writers and only continues to wake the readers too if futex_wake reported there were no writers to wake up.

r? @Amanieu

m-ou-se · 2022-04-07T12:10:00Z

test process::tests::test_finish_twice has been running for over 60 seconds
test process::tests::test_interior_nul_in_env_key_is_error has been running for over 60 seconds
test process::tests::test_interior_nul_in_env_value_is_error has been running for over 60 seconds
test process::tests::test_override_env has been running for over 60 seconds
test sync::rwlock::tests::frob has been running for over 60 seconds

Looks like something is wrong. :)

m-ou-se · 2022-04-07T12:28:02Z

Ah, looks like I somehow forgot to ever set the READERS_WAITING bit. Well that's an easy fix. :)

m-ou-se · 2022-04-07T12:52:21Z

Going to do some more testing first. :)

Amanieu · 2022-04-07T14:52:01Z

library/std/src/sys/unix/locks/futex_rwlock.rs

+    #[inline]
+    pub unsafe fn try_write(&self) -> bool {
+        self.state
+            .fetch_update(Acquire, Relaxed, |s| (readers(s) == 0).then(|| s + WRITE_LOCKED))


I don't think this correctly handles the case where there is an existing write lock but no waiting threads or readers.

The readers() function looks like it extracts the readers count, meaning the value doesn't consider existing waiting threads. Having a writer locked means the readers count would be the maximum value so the .then() clause would short-circuit early and return None. The use of + over | is a bit misleading, but that just transitions from 0 readers to max readers (a.k.a writer).

Amanieu · 2022-04-07T14:52:14Z

library/std/src/sys/unix/locks/futex_rwlock.rs

+    #[inline]
+    pub unsafe fn try_read(&self) -> bool {
+        self.state
+            .fetch_update(Acquire, Relaxed, |s| read_lockable(s).then(|| s + READ_LOCKED))


What happens if the reader count overflows?

read_lockable() checks for MAX_READERS which looks like its writer value minus one. If the reader count would overflow into a writer, it looks like that would be detected by read_lockable() and short circuit to None.

m-ou-se · 2022-04-07T18:24:28Z

(Found a few more issues, so this is back to the drawing board for now. Will send a better tested PR soon. :) )

m-ou-se · 2022-04-08T12:10:41Z

Sorry for the noise :)

Here's the new PR, this time properly stress tested: #95801

Return status from futex_wake().

f1a4041

m-ou-se added O-linux Operating system: Linux A-concurrency Area: Concurrency T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Apr 7, 2022

rust-highfive assigned Amanieu Apr 7, 2022

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 7, 2022

m-ou-se mentioned this pull request Apr 7, 2022

Tracking issue for improving std::sync::{Mutex, RwLock, Condvar} #93740

Closed

63 tasks

m-ou-se added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 7, 2022

m-ou-se marked this pull request as draft April 7, 2022 12:10

m-ou-se added 4 commits April 7, 2022 14:51

Add futex-based RwLock on Linux.

6b2344b

Add some comments to futex rwlock impl.

faa9279

Spin in futex rwlock.

de4a290

Don't make writers spin when #readers changes in futex RwLock.

b656db2

m-ou-se force-pushed the futex-rwlock branch from 3c528db to b656db2 Compare April 7, 2022 12:51

m-ou-se closed this Apr 7, 2022

Amanieu reviewed Apr 7, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace RwLock by a futex based one on Linux. #95762

Replace RwLock by a futex based one on Linux. #95762

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

Amanieu Apr 7, 2022

kprotty Apr 7, 2022 •

edited

Loading

Amanieu Apr 7, 2022

kprotty Apr 7, 2022

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 8, 2022

Replace RwLock by a futex based one on Linux. #95762

Replace RwLock by a futex based one on Linux. #95762

Conversation

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 7, 2022

Amanieu Apr 7, 2022

Choose a reason for hiding this comment

kprotty Apr 7, 2022 • edited Loading

Choose a reason for hiding this comment

Amanieu Apr 7, 2022

Choose a reason for hiding this comment

kprotty Apr 7, 2022

Choose a reason for hiding this comment

m-ou-se commented Apr 7, 2022

m-ou-se commented Apr 8, 2022

kprotty Apr 7, 2022 •

edited

Loading