Check for lock existence before attempting tx execution. #6129

mystenmark · 2022-11-15T18:03:07Z

This fixes a race where this error could trigger:

https://github.com/MystenLabs/sui/blob/main/crates/sui-storage/src/lock_service.rs#L185-L188

The race, briefly explained:

tx A writes object (O1, v1), then creates the lock for that object
tx B consumes (O1, v1) as an input
tx manager notices that the object exists before the lock has been written
it attempts to execute tx B
tx B makes it through transaction_input_checker (which doesn't actually check for lock existence) and then produces this strange error.

Also included are changes I used to repro the problem in the simulator.

mwtian

I'm still thinking about removing shared lock. Left some comments on other parts of the change.

mwtian · 2022-11-15T23:51:07Z

crates/sui-core/src/authority.rs

@@ -1024,6 +1029,9 @@ impl AuthorityState {
        let (gas_status, input_objects) =
            transaction_input_checker::check_certificate_input(&self.database, certificate).await?;

+        let owned_object_refs = input_objects.filter_owned_objects();
+        self.check_owned_locks(&owned_object_refs).await?;


Do transactions set None locks or write output objects first? TransactionManager publishes transactions with input objects, but not checking corresponding locks. Maybe worth a comment on what is required for this check to be success after TransactionManager.

We write objects first, then set None locks, which is the source of the race I am fixing here.

We don't strictly need this check here for transactions executed from TransactionManager, because it already verifies this now. But we could still hit the race from the authority server path. I hope to clean up and simplify a lot of this once we are able to remove the sequence tables.

mwtian · 2022-11-15T23:56:28Z

crates/sui-core/src/authority/authority_store.rs

                        missing.push(ObjectKey::from(objref));
                    }
                }
            };
        }

+        if !probe_lock_exists.is_empty() {
+            match self.lock_service.locks_exist(probe_lock_exists).await {


Why do we need to use lock service in get_missing_input_objects()? Can the validation in lock service be done on the missing objects after get_missing_input_objects()?

Because if the lock does not yet exist, the transaction can't be executed yet, so effectively it is the same as if the object didn't exist.

Can the validation in lock service be done on the missing objects after get_missing_input_objects()?

I don't think I understand this question

Basically I want to keep the semantics of get_missing_input_objects() simple. Intuitively missing locks are different from missing objects. If sui should guarantee that lock exists if object exists, treating lock not exists as a missing input object can complicate other parts of the system. For example, missing input objects can only be notified when the object commits. Is there the same guarantee for locks?

I know we have talked about the order of writing locks vs objects after a transaction is finished. If you think missing locks are expected behavior, and treating them as missing input objects are correct (they will be eventually notified), can you leave a detailed comment in the code?

The 2nd question is that to keep the logic simple, it seems better to call get_missing_input_objects() first, then scan objects that are not missing for locks.

I verified that objects exist notifications are sent after writing locks. So I think what we need here is a comment why locks may not exist, and after locks are written TransactionManager will get notified.

got it - left a comment!

crates/sui-storage/src/lock_service.rs

crates/sui-core/src/authority/authority_store.rs

mwtian

This PR looks good overall, after test failures are fixed.

For setting and removing shared locks, I believe shared locks now must exist when trying to remove them, which is great. We can also add back the logic to skip writing shared locks when effects exist, in both functions that acquire shared locks.

…)" This reverts commit 616d228.

mystenmark requested a review from mwtian November 15, 2022 18:03

mwtian reviewed Nov 16, 2022

View reviewed changes

mystenmark force-pushed the mlogan-sim-debug branch from 379afbe to 0003ad1 Compare November 17, 2022 05:14

mystenmark requested a review from mwtian November 17, 2022 05:14

mwtian approved these changes Nov 17, 2022

View reviewed changes

mystenmark force-pushed the mlogan-sim-debug branch from 0003ad1 to f6ecb68 Compare November 18, 2022 20:26

mystenmark added 8 commits November 21, 2022 20:43

Remove shared locks after execution

138fc93

Support for setting committee size and number of shared objects

b363afb

Verify locks exist before attempting to execute transactions.

b42d8d9

PR comments

5e4f012

fix rebase problems

12fd466

Add debug logs to TransactionManager

e8e6bed

reinitialize old locks when reverting state update

93a6e09

rebase fixes

3f12d8d

mystenmark force-pushed the mlogan-sim-debug branch from f6ecb68 to 3f12d8d Compare November 22, 2022 05:24

add comment

e140264

mystenmark enabled auto-merge (squash) November 22, 2022 05:32

mystenmark merged commit 616d228 into main Nov 22, 2022

mystenmark deleted the mlogan-sim-debug branch November 22, 2022 05:45

mwtian added a commit that referenced this pull request Nov 30, 2022

Revert "Check for lock existence before attempting tx execution. (#6129…

79100a5

…)" This reverts commit 616d228.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check for lock existence before attempting tx execution. #6129

Check for lock existence before attempting tx execution. #6129

mystenmark commented Nov 15, 2022

mwtian left a comment

mwtian Nov 15, 2022

mystenmark Nov 17, 2022

mwtian Nov 15, 2022

mystenmark Nov 17, 2022

mwtian Nov 17, 2022

mwtian Nov 17, 2022

mystenmark Nov 22, 2022

mwtian left a comment

Check for lock existence before attempting tx execution. #6129

Check for lock existence before attempting tx execution. #6129

Conversation

mystenmark commented Nov 15, 2022

mwtian left a comment

Choose a reason for hiding this comment

mwtian Nov 15, 2022

Choose a reason for hiding this comment

mystenmark Nov 17, 2022

Choose a reason for hiding this comment

mwtian Nov 15, 2022

Choose a reason for hiding this comment

mystenmark Nov 17, 2022

Choose a reason for hiding this comment

mwtian Nov 17, 2022

Choose a reason for hiding this comment

mwtian Nov 17, 2022

Choose a reason for hiding this comment

mystenmark Nov 22, 2022

Choose a reason for hiding this comment

mwtian left a comment

Choose a reason for hiding this comment