Decide whether to lock function arguments at compile time #720

oremanj · 2024-09-16T20:57:53Z

An attempt at the static locking determination that I suggested in #695 (comment). Note this PR is against the free-threaded branch.

Behavior differences from the free-threaded branch without this change:

it is no longer possible to do nb::arg().lock(false) or .lock(runtime_determined_value); this could be re-added by restoring cast_flags::lock and checking the arg_flags at runtime, but I didn't think it was worth the complexity
we no longer prohibit locking self in __init__; changing this would also require restoring cast_flags::lock, and it's not clear what benefit it would have (sure the lock is somewhat superfluous but do we really care?)

wjakob · 2024-09-18T02:42:30Z

This is great! It is certainly better than my runtime solution. Two thoughts:

You had mentioned in the prior PR that it's unfortunate if nb::arg().lock requires the complex function dispatch loop. Does this adress that Issue? I am thinking that some runtime check in nb_func_new is still needed to discover that the simple dispatch base case still applies despite argument annotations being present.
All the duplication between arg, arg_v, arg_locked, and arg_locked_v seems a bit much with this further compile-time distinction. Would it be better to have single a template class?

template <bool IsLocked, bool HasValue> struct arg {
    const char *name = nullptr;
    PyObject *value = nullptr;
    // ...

   arg</* IsLocked = */ true, HasValue> lock() { return { name, value, ... }; }
   template <typename T>
   arg<IsLocked, /* HasValue = */ true> operator=(T&&x) {... }
};

All the code processing the various args could then match on the arguments.

wjakob · 2024-09-18T02:46:14Z

After thinking more about it, perhaps having non-template arg variants is also better. This class is used very often in bindings, and having a template here might affect compile time costs. I am not sure.

wjakob

OOps, I forgot to send off these comments.

include/nanobind/nb_func.h

oremanj · 2024-09-18T16:58:31Z

You had mentioned in the prior PR that it's unfortunate if nb::arg().lock requires the complex function dispatch loop. Does this adress that Issue? I am thinking that some runtime check in nb_func_new is still needed to discover that the simple dispatch base case still applies despite argument annotations being present.

Correct, that's separate. This PR makes that improvement possible, but doesn't implement it.

All the duplication between arg, arg_v, arg_locked, and arg_locked_v seems a bit much with this further compile-time distinction. Would it be better to have single a template class?

If we had another axis of variation I would definitely want to go for the template. I think the current case is borderline, and don't mind changing it if you prefer the template, but I think the duplication still winds up a little easier to understand. For backcompat we would need to keep the arg and arg_v names regardless, so the template would be something like arg_t with arg/arg_v as aliases for particular instantiations; then we'd need a metafunction to detect "any instantiation of arg_t", etc.

wjakob · 2024-09-18T23:20:18Z

Thanks a lot!

This commit refactors argument the locking locking so that it occurs at compile-time without imposing runtime overheads. The change applies to free-threaded extensions. Behavior differences compared to the prior approach: - it is no longer possible to do ``nb::arg().lock(false)`` or ``.lock(runtime_determined_value)`` - we no longer prohibit locking self in ``__init__``; changing this would also require restoring ``cast_flags::lock``, and it's not clear that the benefit outweighs the complexity.

Decide whether to lock function arguments at compile time

8155d6c

oremanj force-pushed the free-threaded branch from 70622d1 to 8155d6c Compare September 16, 2024 21:00

Clarify docs

168a10f

oremanj force-pushed the free-threaded branch from 70a9493 to 20ae577 Compare September 16, 2024 22:36

Attempt to workaround MSVC bug

0ad9042

oremanj force-pushed the free-threaded branch from 20ae577 to 0ad9042 Compare September 16, 2024 22:40

Less satisfying MSVC workaround

0b1feea

oremanj force-pushed the free-threaded branch from c3b0558 to 0b1feea Compare September 16, 2024 22:46

oremanj mentioned this pull request Sep 16, 2024

Support for free-threaded Python #695

Merged

6 tasks

wjakob reviewed Sep 18, 2024

View reviewed changes

include/nanobind/nb_func.h Outdated Show resolved Hide resolved

include/nanobind/nb_func.h Outdated Show resolved Hide resolved

include/nanobind/nb_func.h Outdated Show resolved Hide resolved

oremanj force-pushed the free-threaded branch from 82f76aa to 0bbb9dc Compare September 18, 2024 17:06

Code review

babe47d

oremanj force-pushed the free-threaded branch from 0bbb9dc to babe47d Compare September 18, 2024 17:39

wjakob merged commit 7ccb5e5 into wjakob:free-threaded Sep 18, 2024
26 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decide whether to lock function arguments at compile time #720

Decide whether to lock function arguments at compile time #720

oremanj commented Sep 16, 2024 •

edited

Loading

wjakob commented Sep 18, 2024 •

edited

Loading

wjakob commented Sep 18, 2024

wjakob left a comment

oremanj commented Sep 18, 2024

wjakob commented Sep 18, 2024

Decide whether to lock function arguments at compile time #720

Decide whether to lock function arguments at compile time #720

Conversation

oremanj commented Sep 16, 2024 • edited Loading

wjakob commented Sep 18, 2024 • edited Loading

wjakob commented Sep 18, 2024

wjakob left a comment

Choose a reason for hiding this comment

oremanj commented Sep 18, 2024

wjakob commented Sep 18, 2024

oremanj commented Sep 16, 2024 •

edited

Loading

wjakob commented Sep 18, 2024 •

edited

Loading