RFC: result_ffi_guarantees #3391

Lokathor · 2023-02-21T15:55:16Z

text/0000-result_ffi_guarantees.md

clarfonthey · 2023-02-21T19:13:24Z

Is there any particular downside to just making this guarantee for all enums similar to Option and Result in layout, like Poll for instance?

Lokathor · 2023-02-21T19:15:11Z

Yes that was mentioned in the Future Work portion. Basically it can probably become a semver hazard sort of thing if it's automatically done for all enums and people aren't opting in to the behavior. Keeping the RFC scoped small makes it more likely to be accepted and completed on the types people use most often.

clarfonthey · 2023-02-21T20:01:33Z

Ahh, I see; I completely missed that part. I guess that what I'm failing to understand is how you can actually run into that kind of hazard unless you use something like #[non_exhaustive], since any changes to the type would be breaking anyway.

afetisov · 2023-02-28T13:56:11Z

The examples need a bit more diversity. They are all of the form Result<T, ()>, while the proposal also covers Result<(), E> or Result<T, ZST>.

This needs some more discussion on which exact zero-sized types should have layout optimization guarantees, because it could be a semver hazard, particularly in FFI. A type struct Error {} could have new fields added in the future. Perhaps limit only to #[repr(C)] ZSTs? Or to structs of the form struct NoParams;, where any field addition is obviously a breaking change?

I have doubts whether the layout guarantees would be useful in practice, though. The primary motivation in the RFC isn't memory usage but FFI ergonomics. With Option<&T>, for example, a raw pointer is already treated as "either nothing or a valid *mut T" in C function APIs, so the guarantee is reasonable. But with fallible APIs, the conventions are all over the place. Some functions return negative integers on error, some - positive ones, some do both of those, some even return 0, or return garbage and set ERRNO. In principle, you could cover those cases, e.g. Result<(), NonZeroI32> covers the "0 for success, nonzero for error" part. But in practice, you would expect specific error codes and not just arbitrary integers, so you'd have to do some error classification on the Rust side anyway.

This means that a separate ABI-stable struct as the return type, with functions success(self) -> T and error(self) -> Error would likely be a better choice. Once the Try-trait gets stable, you could also use all the usual ? sugar with those types.

nbdd0121 · 2023-02-28T15:29:27Z

The point about ZST is also brought up last week by @tmandry during lang team triage meeting (notes). We could use limit it to #[repr(transparent)] or #[repr(C)] ZSTs.

ChrisDenton · 2023-02-28T15:42:52Z

Tbh, I'm not particularly keen on having guarentees on repr(C) ZSTs. They are not a thing in ISO C and compiler extensions may (or may not) do different things with them which makes them potentially an FFI hazzard.

afetisov · 2023-02-28T15:47:26Z

@ChrisDenton Rust guarantees their specific behaviour, so this seems like a non-issue. On the FFI side none of those types would exist anyway, just like Result<_, _> doesn't. Though #[repr(C)] is likely indeed a poor choice for layout optimization guarantees, since one could just as easily add private fields as to a #[repr(Rust)] type. #[repr(transparent)] indeed looks like a better option.

Lokathor · 2023-02-28T15:48:57Z

repr(transparent) newtypes of () would be sufficient to allow for different errors, but a lot of people would probably ask why they have to put a "useless field" in their error type to make it work right.

afetisov · 2023-02-28T17:34:37Z

Why would you make it a newtype of ()? I'm thinking of the following definitions:

struct ZST;
#[repr(transparent)]
struct ZST();
#[repr(transparent)]
struct ZST {}
#[repr(transparent)]
struct ZST<T>(PhantomData<T>);

Lokathor · 2023-02-28T18:11:50Z

I had thought that a repr(transparent) needed to have a single non-zero sized field, but double checking using your example.. it seems that there can be one or zero non-zero sized fields.

So sure, repr(transparent) (or fieldless) seems a fine requirement on the ZST. One moment and I'll make an edit.

joshtriplett · 2023-02-28T19:45:12Z

@rfcbot merge

rfcbot · 2023-02-28T19:45:13Z

Team member @joshtriplett has proposed to merge this. The next step is review by the rest of the tagged team members:

Concerns:

~~needs-to-mention-non_exhaustive~~ resolved by RFC: result_ffi_guarantees #3391 (comment)

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

cc @rust-lang/lang-advisors: FCP proposed for lang, please feel free to register concerns.
See this document for info about what commands tagged team members can give me.

scottmcm · 2023-02-28T19:51:28Z

Overall seems reasonable.

@rfcbot concern needs-to-mention-non_exhaustive

It's implied by the definition of non_exhaustive that a non-exhaustive ZST can't be guaranteed here, but since we've had so many issues about it (rust-lang/rust#78586), I'd like to see it called out explicitly in here to make sure we get tests about it.

scottmcm · 2023-02-28T19:53:09Z

it seems that there can be one or zero non-zero sized fields.

Zero fields was FCPed in rust-lang/rust#77841

CryZe · 2023-02-28T19:57:42Z

There's been some ideas of potentially modifying the calling convention to support Result more cheaply. Does this RFC interfere with that?

Lokathor · 2023-02-28T20:28:03Z

There's been some ideas of potentially modifying the calling convention to support Result more cheaply. Does this RFC interfere with that?

This RFC requires that the combined type be treated as the primitive type, so I guess "yes, it could interfere with that". However, since the primitive types are generally single-register (except and wide pointers and ints bigger than a machine register), this doesn't seem like a strong problem. Other forms of Result (eg: Result<(),()>) could still (in the future) be optimized differently (eg: have the function "return the value" via status flag being set or not).

Zero fields was FCPed in rust-lang/rust#77841

It seems that the reference didn't get an update.

concern needs-to-mention-non_exhaustive

Updated.

programmerjake · 2023-03-01T08:21:09Z

one thing i'd specifically want to reserve space for is representing Result errors with unwinding (distinct from panicking) when the user opts into it, probably through an attribute on the error type or on the function. the idea is this is more efficient for functions that very rarely fail, because the generated LLVM IR don't have to construct Result values or match on them.

this may need a specific carve-out exception in the defined ABI for Result, so we should carve it out now before it becomes a breaking change.

Lokathor · 2023-03-01T15:18:35Z

Hm, could you say more about what you'd expect to be changed within the RFC to have that carved out? Some mentioning in Future Possibilities? I'm completely unfamiliar with how unwinding interacts with the foreign ABI modes.

RalfJung · 2023-03-01T15:33:53Z

text/0000-result_ffi_guarantees.md

+* `Result<T, E>` where either `T` or `E`:
+  * Are a zero-sized type with alignment 1 (a "1-ZST").
+  * Either have no fields (eg: `()` or `struct Foo;`) or have `repr(transparent)` if there are fields.
+  * Do not have the `#[non_exhaustive]` attribute.


I assume the nested list of requirements is a conjunction, but that should be made explicit.

I am surprised to see repr(transparent) here. In the past we stated that a repr(transparent) in the standard library does not mean this type will not change in the future -- basically repr(transparent) is a private attribute that the library may use, but not clients of the library. If this is intended to deal with issues like repr(transparent) should not consider non_exhaustive types as 1-ZSTs outside their crate rust#78586, I would have expected "has no private field" (which is then trivially true for fieldless types)

yeah it's a "must meet all of these", i can clarify the wording.

I'd be fine changing to "must have no fields" if necessary. Since it's in FCP though maybe a T-lang member can make a clear call here before I touch that part?

The issue is that it's very easy to add a private field and accidentally break the guaranteed layout. Imho there should be some explicit way to opt into such optimizations for ZSTs which are guaranteed to remain ZST. I don't think #[repr(transparent)] is a good solution to the issue, but it's the closest approximation in current Rust, so it's at least something to consider.

Is the "private attribute" opinion documented somewhere? Afaik it isn't, and it's not uncommon to treat it as an API guarantee.

I asked Josh via Zulip and he said it would be fine to further restrict the rules here even while FCP is active.

I will make an update later today to simply require no fields at all (and no non_exhaustive). Then it will be the usual semver breaking change if a field is added. I believe that should satisfy things?

No. The thing that really worries me is that, being a guaranteed optimization, breaking the layout won't produce any errors. The optimization will just silently stop applying. This will lead to UB on the FFI side, and I don't see any reliable way to check for such breakage.

An upstream crate makes a semver breaking change by adding some inner field (possibly even zero-sized one, but perhaps not in a guaranteed way). A downstream crate which used the changed type in extern fn() -> Result<&T, ZST> now silently gets UB, and there is no way for them to avoid it other than tracking all changes in all dependencies.

Currently the solution would involve explicitly ABI-stable types, and conversions between them and native Rust types. Such conversions would either work correctly, fail to compile or fail at runtime (assuming they are properly written and verify their preconditions).

Also note that this isn't an issue for current types with guaranteed layout optimizations, since their number is very small and they are all provided by the language. There is only a finite number of non-generic NonZeroX types, and they are all guaranteed by the language. If user-defined niches become a thing, I would be just as worried.

For this reason I would prefer to have an explicit way to specify that the layout optimization must apply, rather than deduce it from implicit properties of code, which may be changed by the programmer without even considering that someone could rely on them.

If the crate you're getting a type from puts some attribute on their type, can't they just remove that attribute (presumably as a semver break) in a future version? Then isn't it just as bad for the downstream crates relying on it?

Honestly I don't think people should do this in the first place with any type that's not their own type or (). However, I don't want to disallow user ZSTs entirely because it makes for much better internal error handling compared to having () as the error type.

the point of having an attribute is that explicitly having to remove it makes people consider it a semver break since it's obvious, whereas having it completely implicit makes it much more likely people won't realize they need a semvar break to change their library type -- most people aren't going to think about Result's ABI when they change their library type.

An upstream crate makes a semver breaking change by adding some inner field (possibly even zero-sized one, but perhaps not in a guaranteed way). A downstream crate which used the changed type in extern fn() -> Result<&T, ZST> now silently gets UB, and there is no way for them to avoid it other than tracking all changes in all dependencies.

We do have improper_ctypes warning though.

improper_ctypes is far too noisy to be useful, so we shouldn't rely on a fairly substandard lint.

joshtriplett · 2023-03-01T15:56:20Z

one thing i'd specifically want to reserve space for is representing Result errors with unwinding (distinct from panicking) when the user opts into it,

This RFC covers the FFI case, which wouldn't use that optimisation. And if it did then the future RFC proposing that would have a lot of specifying to do, and I think that specifying belongs there rather than here.

programmerjake · 2023-03-01T19:19:44Z

Hm, could you say more about what you'd expect to be changed within the RFC to have that carved out? Some mentioning in Future Possibilities? I'm completely unfamiliar with how unwinding interacts with the foreign ABI modes.

I'm basically expecting that the list of requirements to have Result have a defined ABI would have an entry kinda like this:

Future RFCs may introduce additional requirements, e.g. some new 1-ZST error types may be marked such that they opt out of stable ABI for Result<T, MyErr> e.g. by representing the Err variant by unwinding instead of returning normally.

this way we tell users they can't just just check the current list of requirements for any arbitrary type and be guaranteed to always forevermore have stable ABI when those requirements are met, future RFCs may introduce exceptions for types using new features.

nikomatsakis · 2023-03-14T18:22:13Z

@rfcbot fcp reviewed

This makes sense to me.

rfcbot · 2023-03-14T18:22:18Z

🔔 This is now entering its final comment period, as per the review above. 🔔

riking · 2023-03-14T23:31:33Z

The primitive ZSTs include (), [T; 0], PhantomData<T>, and function pointer types

Slight correction: function item types. Function pointers have sizes.

rfcbot · 2023-03-24T18:25:12Z

The final comment period, with a disposition to merge, as per the review above, is now complete.

As the automated representative of the governance process, I would like to thank the author for their work and everyone else who contributed.

This will be merged soon.

dtolnay · 2023-04-05T20:09:17Z

This RFC was discussed in this week's lang team triage meeting. We (I) weren't sure whether the compiler already treats types like Result<NonZeroI32, ()> with the same ABI as C int32_t, vs with the same ABI as C struct { int32_t; }. (The distinction between these 2 in extern "C", unlike in extern "Rust", is why RFC 1758 repr(transparent) was needed.)

lokathor: does compiler already do this?

nikomatsakis: Pretty sure it does.

cramertj: is this insta-stable?

nikomatsakis: I think so, or else we can fcp merge the PR to the reference.

If Result<NonZeroI32, ()> already works like int32_t on all platforms then implementing this RFC is a matter of exempting the appropriate types from improper_ctypes / improper_ctypes_definitions, and testing; if it works like struct { int32_t; } there would need to be codegen changes.

As far as I could tell, the former is the case, at least in my usual suspect ABI (32-bit x86).

// cargo asm --target i686-unknown-linux-gnu

use std::num::NonZeroI32;

#[repr(C)]
pub struct Wrapper {
    pub primitive: i32,
}

pub extern "C" fn primitive() -> i32 {
    1
}

// different code than `primitive`
pub extern "C" fn wrapper() -> Wrapper {
    Wrapper { primitive: 1 }
}

// same code as `primitive`
pub extern "C" fn result_nonzero() -> Result<NonZeroI32, ()> {
    Ok(unsafe { NonZeroI32::new_unchecked(1) })
}

tmandry · 2023-04-18T19:48:30Z

The @rust-lang/lang team has decided to accept this RFC.

To track further discussion, subscribe to the tracking issue here:
rust-lang/rust#110503

Support Result<T, E> across FFI when niche optimization can be used Allow allow enums like `Result<T, E>` to be used across FFI if the T/E can be niche optimized and the non-niche-optimized type is FFI safe. Implementation of rust-lang/rfcs#3391 Tracking issue: rust-lang#110503 Additional ABI and codegen tests were added in rust-lang#115372

Support Result<T, E> across FFI when niche optimization can be used (v2) This PR is identical to rust-lang#122253, which was approved and merged but then removed from master by a force-push due to a [CI bug](https://rust-lang.zulipchat.com/#narrow/stream/242791-t-infra/topic/ci.20broken.3F). r? ghost Original PR description: --- Allow allow enums like `Result<T, E>` to be used across FFI if the T/E can be niche optimized and the non-niche-optimized type is FFI safe. Implementation of rust-lang/rfcs#3391 Tracking issue: rust-lang#110503 Additional ABI and codegen tests were added in rust-lang#115372

Support Result<T, E> across FFI when niche optimization can be used (v2) This PR is identical to #122253, which was approved and merged but then removed from master by a force-push due to a [CI bug](https://rust-lang.zulipchat.com/#narrow/stream/242791-t-infra/topic/ci.20broken.3F). r? ghost Original PR description: --- Allow allow enums like `Result<T, E>` to be used across FFI if the T/E can be niche optimized and the non-niche-optimized type is FFI safe. Implementation of rust-lang/rfcs#3391 Tracking issue: rust-lang/rust#110503 Additional ABI and codegen tests were added in rust-lang/rust#115372

rfc text

4b5a891

joshtriplett reviewed Feb 21, 2023

View reviewed changes

text/0000-result_ffi_guarantees.md Outdated Show resolved Hide resolved

joshtriplett added T-lang Relevant to the language team, which will review and decide on the RFC. I-lang-nominated Indicates that an issue has been nominated for prioritizing at the next lang team meeting. labels Feb 21, 2023

restrict the ZST rules, provide further explanation.

f647e37

rfcbot added proposed-final-comment-period Currently awaiting signoff of all team members in order to enter the final comment period. disposition-merge This RFC is in PFCP or FCP with a disposition to merge it. labels Feb 28, 2023

require that there NOT be the non_exhaustive attribute on the ZST

e729531

RalfJung reviewed Mar 1, 2023

View reviewed changes

clarify that the condition list is an "all" list.

2c258cc

rfcbot added final-comment-period Will be merged/postponed/closed in ~10 calendar days unless new substational objections are raised. and removed proposed-final-comment-period Currently awaiting signoff of all team members in order to enter the final comment period. labels Mar 14, 2023

missing comma

40b65f3

rfcbot added finished-final-comment-period The final comment period is finished for this RFC. to-announce and removed final-comment-period Will be merged/postponed/closed in ~10 calendar days unless new substational objections are raised. labels Mar 24, 2023

tmandry mentioned this pull request Apr 18, 2023

Tracking Issue for result_ffi_guarantees (RFC 3391) rust-lang/rust#110503

Closed

7 tasks

Add RFC number and tracking issue

7320c01

tmandry merged commit d1df23c into rust-lang:master Apr 18, 2023

Lokathor deleted the optional branch April 18, 2023 20:39

zacklukem mentioned this pull request Apr 28, 2023

Guarantee result nonzero for FFI rust-lang/rust#110968

Closed

MasterAwesome mentioned this pull request Mar 9, 2024

Support Result<T, E> across FFI when niche optimization can be used rust-lang/rust#122253

Merged

dtolnay mentioned this pull request May 5, 2024

Support Result<T, E> across FFI when niche optimization can be used (v2) rust-lang/rust#124747

Merged

RalfJung mentioned this pull request Aug 7, 2024

Can CFI be made compatible with type erasure schemes? rust-lang/rust#128728

Open

workingjubilee mentioned this pull request Sep 20, 2024

Finish stabilization of result_ffi_guarantees rust-lang/rust#130628

Merged

RFC: result_ffi_guarantees #3391

RFC: result_ffi_guarantees #3391

Conversation

Lokathor commented Feb 21, 2023 • edited by tmandry Loading

clarfonthey commented Feb 21, 2023

Lokathor commented Feb 21, 2023

clarfonthey commented Feb 21, 2023

afetisov commented Feb 28, 2023 • edited Loading

nbdd0121 commented Feb 28, 2023

ChrisDenton commented Feb 28, 2023

afetisov commented Feb 28, 2023

Lokathor commented Feb 28, 2023

afetisov commented Feb 28, 2023 • edited Loading

Lokathor commented Feb 28, 2023 • edited Loading

joshtriplett commented Feb 28, 2023

rfcbot commented Feb 28, 2023 • edited Loading

scottmcm commented Feb 28, 2023

scottmcm commented Feb 28, 2023

CryZe commented Feb 28, 2023 • edited Loading

Lokathor commented Feb 28, 2023

programmerjake commented Mar 1, 2023 • edited Loading

Lokathor commented Mar 1, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

joshtriplett commented Mar 1, 2023

programmerjake commented Mar 1, 2023

nikomatsakis commented Mar 14, 2023

rfcbot commented Mar 14, 2023

riking commented Mar 14, 2023

rfcbot commented Mar 24, 2023

dtolnay commented Apr 5, 2023

tmandry commented Apr 18, 2023

Lokathor commented Feb 21, 2023 •

edited by tmandry

Loading

afetisov commented Feb 28, 2023 •

edited

Loading

afetisov commented Feb 28, 2023 •

edited

Loading

Lokathor commented Feb 28, 2023 •

edited

Loading

rfcbot commented Feb 28, 2023 •

edited

Loading

CryZe commented Feb 28, 2023 •

edited

Loading

programmerjake commented Mar 1, 2023 •

edited

Loading