[red-knot] Infer boolean literal expression #12688

dhruvmanila · 2024-08-05T12:12:46Z

Summary

This PR implements type inference for boolean literal expressions.

Test Plan

Add test cases for True and False.

github-actions · 2024-08-05T12:26:11Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

AlexWaygood

Nice. These changes all look good to me.

I have a concern about union types, though. How do we want a union of boolean literals to be represented internally? For an object that could be the True constant or could be the False constant, should we infer Literal[True] | Literal[False] (a union of literals), or should we eagerly normalize it to <instance of bool> (since we know that the bool class is special: there will only be two possible instance of bool).

If the former, I think we will want to modify the Display implementation for DisplayUnionType so that Literal[True] | Literal[False] is simplified to Literal[True, False], the same as we do for numeric literals
If the latter, some larger changes will be required

Mypy appears never to normalize Literal[True, False] to bool; pyright appears to always do so. (Link, link.) I think I weakly favour pyright's approach here, but not sure.

A separate question (which I think can almost certainly be deferred for now) is that @carljm added some fancy logic for int literals in infer_binary_expression. We could also add some understanding of bool literals to that method, since True + True == True + 1 == 1 + 1 == 2

AlexWaygood · 2024-08-05T12:41:09Z

crates/red_knot_python_semantic/src/types.rs

@@ -175,6 +177,7 @@ impl<'db> Type<'db> {
                // TODO raise error
                Type::Unknown
            }
+            Type::BooleanLiteral(_) => Type::Unknown,


Here we'll want to fall back to looking up attributes and methods on the builtins.bool class in our vendored typeshed stubs. But I think we can defer that for this PR.

What's the meaning of "member" in this context? From what I understand, it's the type of an attribute on the given type. So, for instance, set.append would be a function type. Is this understanding correct?

Yes, your understanding is correct there

AlexWaygood

I actually think even the concern I have about union types can probably be deferred for now. So I'm happy with this landing as-is, as long as we keep track of these open questions somewhere.

dhruvmanila · 2024-08-05T14:23:40Z

Mypy appears never to normalize Literal[True, False] to bool; pyright appears to always do so. (Link, link.) I think I weakly favour pyright's approach here, but not sure.

What's the benefit of the one or the other? I think I'd prefer Pyright's approach as well because it can be resolved.

A separate question (which I think can almost certainly be deferred for now) is that @carljm added some fancy logic for int literals in infer_binary_expression. We could also add some understanding of bool literals to that method, since True + True == True + 1 == 1 + 1 == 2

Yes, I started playing around with binary expressions as well but thought to have it as it's own PR which would include all possible combinations.

AlexWaygood · 2024-08-05T14:35:52Z

What's the benefit of the one or the other? I think I'd prefer Pyright's approach as well because it can be resolved.

I think one "benefit" of mypy's approach is that you don't have to apply quite so much special casing to boolean literals specifically inside the type checker implementation when figuring out how unions of boolean literals resolve. Literal[True] | Literal[False] resolves to Literal[True, False] in exactly the same way that Literal[1] | Literal[2] resolves to Literal[1, 2] and Literal["foo"] | Literal[True] resolves to Literal["foo", True].

One case where normalizing bool back to Literal[True, False] will be necessary will be reachability analysis. In the following snippet, it's self evident that the third case branch is unreachable, and the type checker is not required to special-case boolean types in any way in order to understand this; it follows from its generalised understanding of Literal types:

from typing import Literal

def f(x: Literal[True, False]):
    match x:
        case True:
            ...
        case False:
            ...
        case unreachable:
            ...

However: we'll need to apply that special casing to bool anyway. We'll also need to be able to understand types annotated with bool as being functionally equivalent to Literal[True, False], or we won't understand the third branch in this function as being unreachable:

from typing import Literal

def f(x: bool):
    match x:
        case True:
            ...
        case False:
            ...
        case unreachable:
            ...

So leaving the union unnormalized as Literal[True, False] doesn't really get us out of having to special-case bool.

I also... wouldn't necessarily jump to the conclusion that this is a deliberate choice by mypy with an explicit motivation 😄 it's a very old type checker, and support for Literal types was added to mypy after mypy had already been around for many years. To me, pyright's approach also seems significantly better here.

carljm

Looks great, thank you!

Enums are a similar case where we'll need to understand sealed types (i.e. types with a finite number of inhabitants). I'm fine with delaying handling of the equivalence of bool and Literal[True, False] for now. I added #12694 to track this issue.

## Summary This PR implements type inference for boolean literal expressions. ## Test Plan Add test cases for `True` and `False`.

[red-knot] Infer boolean literal expression

a117086

dhruvmanila added the red-knot Multi-file analysis & type inference label Aug 5, 2024

dhruvmanila requested review from carljm, MichaReiser and AlexWaygood as code owners August 5, 2024 12:12

AlexWaygood reviewed Aug 5, 2024

View reviewed changes

AlexWaygood approved these changes Aug 5, 2024

View reviewed changes

carljm approved these changes Aug 5, 2024

View reviewed changes

carljm merged commit a8e2ba5 into main Aug 5, 2024
20 checks passed

carljm deleted the dhruv/infer-bool-literal branch August 5, 2024 18:30

dylwil3 pushed a commit to dylwil3/ruff that referenced this pull request Aug 7, 2024

[red-knot] Infer boolean literal expression (astral-sh#12688)

e309b25

## Summary This PR implements type inference for boolean literal expressions. ## Test Plan Add test cases for `True` and `False`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[red-knot] Infer boolean literal expression #12688

[red-knot] Infer boolean literal expression #12688

dhruvmanila commented Aug 5, 2024

github-actions bot commented Aug 5, 2024

AlexWaygood left a comment •

edited

Loading

AlexWaygood Aug 5, 2024

dhruvmanila Aug 5, 2024

AlexWaygood Aug 5, 2024

AlexWaygood left a comment

dhruvmanila commented Aug 5, 2024 •

edited

Loading

AlexWaygood commented Aug 5, 2024

carljm left a comment •

edited

Loading

[red-knot] Infer boolean literal expression #12688

[red-knot] Infer boolean literal expression #12688

Conversation

dhruvmanila commented Aug 5, 2024

Summary

Test Plan

github-actions bot commented Aug 5, 2024

ruff-ecosystem results

Linter (stable)

Linter (preview)

AlexWaygood left a comment • edited Loading

Choose a reason for hiding this comment

AlexWaygood Aug 5, 2024

Choose a reason for hiding this comment

dhruvmanila Aug 5, 2024

Choose a reason for hiding this comment

AlexWaygood Aug 5, 2024

Choose a reason for hiding this comment

AlexWaygood left a comment

Choose a reason for hiding this comment

dhruvmanila commented Aug 5, 2024 • edited Loading

AlexWaygood commented Aug 5, 2024

carljm left a comment • edited Loading

Choose a reason for hiding this comment

`ruff-ecosystem` results

AlexWaygood left a comment •

edited

Loading

dhruvmanila commented Aug 5, 2024 •

edited

Loading

carljm left a comment •

edited

Loading