-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error reporting from attribute macros regressed in 1.46.0 #76360
Comments
If I remember correctly, we should still be able to use |
Hey Cleanup Crew ICE-breakers! This bug has been identified as a good cc @AminArria @camelid @chrissimpkins @contrun @DutchGhost @elshize @ethanboxx @h-michael @HallerPatrick @hdhoang @hellow554 @imtsuki @kanru @KarlK90 @LeSeulArtichaut @MAdrianMattocks @matheus-consoli @mental32 @nmccarty @Noah-Kennedy @pard68 @PeytonT @pierreN @Redblueflame @RobbieClarken @RobertoSnap @robjtede @SarthakSingh31 @senden9 @shekohex @sinato @spastorino @turboladen @woshilapin @yerke |
Assigning |
This is caused by #73345 - since we no longer pass a This is a very interesting corner case - the I think there are two possible ways to fix this:
I'm leaning towards option 2 (assuming that it doesn't cause significant regressions in practice). Bang-proc-macros are the way to invoke a proc-macro on an arbitrary On the other hand, it's possible that option 2 could lead to a large number of spurious errors in practice - if the user introduces a syntax error when modifying a commonly-used struct (which has an attribute macro applied), removing this struct entirely could cause many spurious resolution errors. Invoking the attribute macro might lead to better (but still invalid) tokens being produced, leading to cleaner error messages. |
@ahl Thanks for providing such a nicer reproducer! |
@Aaron1011 thanks for the quick diagnosis. I agree with your assessment that both options have downsides (I think in your final paragraph you're describing option 2 rather than option 1 vis-a-vis removing the struct entirely). Option 1 seems like a bit of a mess; option 2 seems more "philosophically pure" while being a little less pragmatic. I agree that option 2 is probably preferable and that a crisp semantic like "only valid items will cause the invocation of attribute macros" is a very clear delineation between |
@Aaron1011 I agree with your assessment and think that the second option is the obviously correct choice for us. This might cause some weird errors for proc macros with pseudo DSLs that are no longer turned into valid code, but given that we are already recovering the parse most of the body will be treated as empty anyways, so it should be fine™. |
Could we also add something where the compiler emits one of its “this is a bug” messages if an error doesn’t show span information? Or is that not feasible? |
@camelid: Right now, we deliberately create these 'dummy spans' due to issue #43081. PR #76130 makes progress towards eliminating these cases, but there's still more work to be done. In the future, I think it would definitely make sense to emit a warning, and direct users to open a bug report. However, I don't think we'd want to emit an actual ICE - it would just create noise, since the query stack and call stack aren't useful for debugging missing spans. |
@Aaron1011 Okay, thanks! |
The pretty-print/reparse check has been removed on the latest Nighty, leading to a better error message:
However, we should still come to a decision about whether or not proc-macros should be invoked when we've managed to recover a syntactically invalid attribute target. |
From the discussion above, three people have already voiced in favor of option 2 but I don't think I understand what downside is being claimed for option 1:
"Parser recovery behavior is now part of the stable surface of the compiler" — I don't see how, since I thought error diagnostics get emitted whenever a recovery takes place. Rustc sees invalid syntax, thinks it knows roughly what you mean or what the surrounding code means, emits diagnostics to that effect, and continues on with a salvaged input. Anything later in the compilation process, including name resolution or attribute macros being run, is a trick for producing more meaningful error messages, because compilation of the enclosing crate is already guaranteed not to succeed due to the errors emitted at the parser recovery. Under option 1 how would anyone end up with a stable working crate that would be broken by future changes to parser recovery?
This is saying we run the risk of an unprincipled list of special cases on which contemporary Rust versions produce good diagnostics, as an argument to produce poor diagnostics consistently in all situations instead. It's hard for me to see this as an argument against option 1.
This is the clear and succinct argument in favor of option 1. |
I ended up here from #90256, in which I reported that the 1.51+ behavior of passing original syntactically invalid input into attribute macros after a parser recovery is a diagnostics regression from older compilers that would only pass syntactically valid (possibly recovered) input to macros. Doing recovery, emitting diagnostics about the recovery, and then running macros on the original unrecovered input means that macros must reimplement their own recovery independent and inconsistent with rustc — so rustc will diagnose what it thinks you meant, and the macro will diagnose what it thinks, and probably they won't align. The correct behavior in my opinion is for rustc to perform its high-effort syntax recovery and report diagnostics on it as already done, then pass the recovered (pretty printed if necessary) input for the attribute to proceed on. I believe this is consistent with the option 1 proposal. |
A lot of work has been put in to get rid of pretty-printing/reparsing, mainly because it requires throwing away hygiene information. While you're correct that this would only apply when compilation is guaranteed to fail, I think that spurious hygiene-related errors ( e.g. an 'not found' error that goes away when you fix a seemingly unrelated syntax error) would be extremely confusing. It might not be too difficult to perform some kind of 'token-based recovery' (deleting the tokens for the syntactically invalid node being recovered). I'll look into that. |
A token-based recovery / deleting tokens of the nearest invalid node sounds awesome! Thanks. |
For the sake of completeness - a macro could perform a side effect using the input token stream. However, I do t think that's a use case we want to support (and such a macro would already need to handle being invoked multiple times across different compilation sessions). |
This seems like a real problem. I'm asking @estebank and @compiler-errors to try to dedicate some brain cycles to looking over the suggestions here and trying to drive progress towards a solution. |
2023 Q1 P-high triage: Assigning to WG-diagnostics to drive aforementioned followup work. |
Do not provide suggestions when the spans come from expanded code that doesn't point at user code Hide invalid proc-macro suggestions and track spans coming from proc-macros pointing at attribute. Effectively, unless the proc-macro keeps user spans, suggestions will not be produced for the code they produce. r? `@ghost` Fix rust-lang#107113, fix rust-lang#107976, fix rust-lang#107977, fix rust-lang#108748, fix rust-lang#106720, fix rust-lang#90557. Could potentially address rust-lang#50141, rust-lang#67373, rust-lang#55146, rust-lang#78862, rust-lang#74043, rust-lang#88514, rust-lang#83320, rust-lang#91520, rust-lang#104071. CC rust-lang#50122, rust-lang#76360.
Do not provide suggestions when the spans come from expanded code that doesn't point at user code Hide invalid proc-macro suggestions and track spans coming from proc-macros pointing at attribute. Effectively, unless the proc-macro keeps user spans, suggestions will not be produced for the code they produce. r? ``@ghost`` Fix rust-lang#107113, fix rust-lang#107976, fix rust-lang#107977, fix rust-lang#108748, fix rust-lang#106720, fix rust-lang#90557. Could potentially address rust-lang#50141, rust-lang#67373, rust-lang#55146, rust-lang#78862, rust-lang#74043, rust-lang#88514, rust-lang#83320, rust-lang#91520, rust-lang#104071. CC rust-lang#50122, rust-lang#76360.
We take particular care to make sure that compilation errors associated with the items to which our attribute macros are applied are comprehensible. To that end we have tests to check the output of a variety of expected illegal programs to make sure developers could reasonably be expected to understand and correct those errors. In 1.46.0 we noticed a change in that behavior. While there were some beneficial changes to the way errors qualify structs, traits, etc. there seems to be a regression where 1.46.0 shows less information than in 1.45.0
I've put together a small repo that demonstrates the problem: https://github.com/ahl/span_regression
The first example is this macro:
Now consider this test program:
Under 1.45.0 one of the errors produced looks like this:
Under 1.46.0 that has changed to this:
Note that under 1.46.0 no code is underlined.
Now consider a slightly more complicated macro:
(Note I had been using
quote!
but wanted to make sure that wasn't causing the problem)With a similar example as above on 1.45.0:
With 1.46.0:
Rather than pointing to the offending code, the rustc error now points (unhelpfully) to the macro itself.
The general improvements made to error reporting in 1.46.0 (simpler naming, reduced duplicate errors) is greatly appreciated. This was one small regression I saw amongst the otherwise monotonic improvements.
The text was updated successfully, but these errors were encountered: