Upgrades to sync::rwlock - fix a race and improve performance, now with 100% less 'incoming' #7109

bblum · 2013-06-13T21:18:20Z

links to issues: #7065 the race that's fixed; #7066 the perf improvement I added. There are also some minor cleanup commits here.

To measure the performance improvement from replacing the exclusive with an atomic uint, I edited the msgsend-ring-rw-arcs bench test to do a write_downgrade instead of just a write, so that it stressed the code paths that accessed read_count. (At first I was still using write and saw no performance difference whatsoever, whoooops.)

The bench test measures how long it takes to send 1,000,000 messages by using rwarcs to emulate pipes. I also measured the performance difference imposed by the fix to the access_lock race (which involves taking an extra semaphore in the cond.wait() path). The net result is that fixing the race imposes a 4% to 5% slowdown, but doing the atomic uint optimization gives a 6% to 8% speedup.

Note that this speedup will be most visible in read- or downgrade-heavy workloads. If an RWARC's only users are writers, the optimization doesn't matter. All the same, I think this more than justifies the extra complexity I mentioned in #7066.

The raw numbers are:

with xadd read count
        before write_cond fix
                4.18 to 4.26 us/message
        with write_cond fix
                4.35 to 4.39 us/message

with exclusive read count
        before write_cond fix
                4.41 to 4.47 us/message
        with write_cond fix
                4.65 to 4.76 us/message

…ces)

…Fixes rust-lang#7065.

…usive, for performance. Close rust-lang#7066.

@brson

r? @brson links to issues: #7065 the race that's fixed; #7066 the perf improvement I added. There are also some minor cleanup commits here. To measure the performance improvement from replacing the exclusive with an atomic uint, I edited the ```msgsend-ring-rw-arcs``` bench test to do a ```write_downgrade``` instead of just a ```write```, so that it stressed the code paths that accessed ```read_count```. (At first I was still using ```write``` and saw no performance difference whatsoever, whoooops.) The bench test measures how long it takes to send 1,000,000 messages by using rwarcs to emulate pipes. I also measured the performance difference imposed by the fix to the ```access_lock``` race (which involves taking an extra semaphore in the ```cond.wait()``` path). The net result is that fixing the race imposes a 4% to 5% slowdown, but doing the atomic uint optimization gives a 6% to 8% speedup. Note that this speedup will be most visible in read- or downgrade-heavy workloads. If an RWARC's only users are writers, the optimization doesn't matter. All the same, I think this more than justifies the extra complexity I mentioned in #7066. The raw numbers are: ``` with xadd read count before write_cond fix 4.18 to 4.26 us/message with write_cond fix 4.35 to 4.39 us/message with exclusive read count before write_cond fix 4.41 to 4.47 us/message with write_cond fix 4.65 to 4.76 us/message ```

Ignore aarch64 for this test as it's x86 assembly only. Fixes rust-lang#7091 fixes rust-lang#7091 - asm_syntax lint test will not compile on aarch64 changelog: none

bblum added 7 commits June 12, 2013 20:53

make util::NonCopyable a unit struct instead of a struct with a unit

6b22c09

remove bitrotted cant_nest field from RWARC (the #[mutable] tag suffi…

d809f54

…ces)

Document unstable::atomics fetch_* return values

0ca2056

Thread order_lock through rwlock condvars for reacquiring access_lock. …

bd019c4

…Fixes rust-lang#7065.

Add a test case for rust-lang#7065.

68e8fe9

Change sync::RWlock implementation to use atomic uint instead of excl…

57cb44d

…usive, for performance. Close rust-lang#7066.

Improve comments in sync and arc a bit more.

2ef8774

bors closed this Jun 15, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrades to sync::rwlock - fix a race and improve performance, now with 100% less 'incoming' #7109

Upgrades to sync::rwlock - fix a race and improve performance, now with 100% less 'incoming' #7109

bblum commented Jun 13, 2013

Upgrades to sync::rwlock - fix a race and improve performance, now with 100% less 'incoming' #7109

Upgrades to sync::rwlock - fix a race and improve performance, now with 100% less 'incoming' #7109

Conversation

bblum commented Jun 13, 2013