Inference discards bounds on abstract parameters #36454

timholy · 2020-06-27T12:58:40Z

Apologies, I'm sure this must have been reported before but a search didn't pick it up.

There appear to be cases where it would be "easy" to preserve bounds on abstract types. One that comes up a lot in my invalidation-squashing is Iterators.Stateful:

julia> code_typed(Iterators.Stateful, (AbstractString,))
1-element Array{Any,1}:
 CodeInfo(
1 ─ %1 = invoke Base.Iterators.approx_iter_type($(Expr(:static_parameter, 1))::Type{T} where T)::Type
│   %2 = Core.apply_type(Base.Iterators.Stateful, $(Expr(:static_parameter, 1)), %1)::Type{Base.Iterators.Stateful{_A,_B}} where _B where _A
│   %3 = Base.convert($(Expr(:static_parameter, 1)), itr)::Any
│   %4 = Core.fieldtype(%2, 2)::Type{var"#s428"} where var"#s428"<:(Union{Nothing, _B} where _B)
│   %5 = Base.Iterators.iterate(itr)::Any
│        Core.typeassert(%5, %1)::Any
│   %7 = Base.convert(%4, %5)::Any
│   %8 = %new(%2, %3, %7, 0)::Base.Iterators.Stateful{_A,_B} where _B where _A
└──      return %8
) => Base.Iterators.Stateful{_A,_B} where _B where _A

Given the definition

julia/base/iterators.jl

Lines 1243 to 1246 in d762e8c

    
           @inline function Stateful(itr::T) where {T} 
        
               VS = approx_iter_type(T) 
        
               return new{T, VS}(itr, iterate(itr)::VS, 0) 
        
           end

it would naively seem fairly straightforward to retain _A<:AbstractString.

The text was updated successfully, but these errors were encountered:

martinholters · 2020-06-29T08:06:51Z

The code to fix this is all there it seems, just disabled:

julia/base/compiler/tfuncs.jl

Lines 1100 to 1123 in 6185d24

    
           # These blocks improve type info but make compilation a bit slower. 
        
           # XXX 
        
           #unw = unwrap_unionall(ai) 
        
           #isT = isType(unw) 
        
           #if isT && isa(ai,UnionAll) && contains_is(outervars, ai.var) 
        
           #    ai = rename_unionall(ai) 
        
           #    unw = unwrap_unionall(ai) 
        
           #end 
        
           if istuple 
        
               if i == largs 
        
                   push!(tparams, Vararg) 
        
               # XXX 
        
               #elseif isT 
        
               #    push!(tparams, rewrap_unionall(unw.parameters[1], ai)) 
        
               else 
        
                   push!(tparams, Any) 
        
               end 
        
           # XXX 
        
           #elseif isT 
        
           #    push!(tparams, unw.parameters[1]) 
        
           #    while isa(ai, UnionAll) 
        
           #        push!(outervars, ai.var) 
        
           #        ai = ai.body 
        
           #    end

If this came up in your invalidation hunt, maybe it's time to reconsider this?

timholy · 2020-06-29T12:26:23Z

Nice find. It's a difficult design decision; on one hand, doing more accurate inference makes inference slower. On the other hand, doing more accurate inference reduces the "vulnerability" of code to invalidation because fewer MethodInstances end up having mt_backedges to Any-instances of their dependent methods. Thereby, investing in better inference at the outset might save inference in the long run.

It's quite difficult to come up with reasonable benchmarks for these things, since it depends entirely on what new methods you define. Therefore what works in practice is a bit of a cultural issue of which packages often get used in which combinations.

JuliaLang/julia#36280 introduced the ability to pre-allocate the container used to track values of `f.(itr)` in `unique(f, itr)`. Particularly for containers with `Union` elements, this circumvents significant inference problems. Related: JuliaLang/julia#36454

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline. (cherry picked from commit 815076b)

Inference loses track of `Tag` due to JuliaLang/julia#36454

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably JuliaLang#36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline.

`cat` is often called with Varargs or heterogenous inputs, and inference almost always fails. Even when all the arrays are of the same type, if the number of varargs isn't known inference typically fails. The culprit is probably #36454. This reduces the number of failures considerably, by avoiding creation of vararg length tuples in the shape-inference pipeline. (cherry picked from commit 815076b)

ChrisRackauckas · 2022-12-28T16:11:30Z

Bump this up to triage to reconsider given the many changes to precompilation and the increased cost of invalidations?

timholy · 2022-12-28T17:20:41Z

I think that's a good idea, but probably first someone should collect some data to bring to the discussion.

gbaraldi · 2023-01-05T19:58:56Z

Triage thinks that, as Tim said, this probably needs more data for a more informed decision.

vtjnash · 2024-08-07T15:09:23Z

code is enabled now

julia> code_typed(Iterators.Stateful, (AbstractString,))

1-element Vector{Any}:
 CodeInfo(
    @ iterators.jl:1452 within `Stateful`
1 ─ %1  = $(Expr(:static_parameter, 1))::Type{T} where T<:AbstractString
│   %2  = invoke Base.Iterators.approx_iter_type(%1::Type)::Type
│   @ iterators.jl:1453 within `Stateful`
│   %3  = $(Expr(:static_parameter, 1))::Type{T} where T<:AbstractString
│   %4  = Core.apply_type(Base.Iterators.Stateful, %3, %2)::Type{Base.Iterators.Stateful{T, T1}} where {T<:AbstractString, T1}
│   %5  = $(Expr(:static_parameter, 1))::Type{T} where T<:AbstractString
│   %6  = (itr isa %5)::Bool
└──       goto #3 if not %6
2 ─       goto #4
3 ─ %9  = $(Expr(:static_parameter, 1))::Type{T} where T<:AbstractString
└── %10 = Base.convert(%9, itr)::Any
4 ┄ %11 = φ (#2 => itr, #3 => %10)::Any
│   %12 = Core.fieldtype(%4, 2)::Type{<:Union{Nothing, T} where T}
│   %13 = Base.Iterators.iterate(itr)::Any
│         Core.typeassert(%13, %2)::Any
│   %15 = (%13 isa %12)::Bool
└──       goto #6 if not %15
5 ─       goto #7
6 ─ %18 = Base.convert(%12, %13)::Any
7 ┄ %19 = φ (#5 => %13, #6 => %18)::Any
│   %20 = %new(%4, %11, %19)::Base.Iterators.Stateful{T} where T<:AbstractString
└──       return %20
) => Base.Iterators.Stateful{T} where T<:AbstractString

timholy added the compiler:inference Type inference label Jun 27, 2020

timholy mentioned this issue Jul 7, 2020

Fix invalidations from loading OrderedCollections JuliaLang/Pkg.jl#1897

Merged

This was referenced Aug 17, 2020

Use Stateful in String code #25736

Merged

Redesign the HashType handling for better inference JuliaLang/Pkg.jl#1969

Merged

timholy mentioned this issue Nov 30, 2020

improve inferrability of peek(::Stateful, sentinel) #38625

Merged

timholy mentioned this issue Jan 6, 2021

SnoopCompile tricks tlienart/Franklin.jl#752

Open

timholy mentioned this issue Jan 17, 2021

Improve inferability of shape::Dims for cat #39294

Merged

timholy added a commit to timholy/TiffImages.jl that referenced this issue Feb 25, 2021

Improve inference of tag iteration

b082900

Inference loses track of `Tag` due to JuliaLang/julia#36454

timholy mentioned this issue Aug 28, 2021

Eliminating weird interactions with Julia's compiler JuliaGraphics/ColorTypes.jl#266

Open

timholy mentioned this issue Aug 31, 2022

fix some invalidations when loading Static.jl #46553

Merged

oscardssmith added the status:triage This should be discussed on a triage call label Dec 28, 2022

ChrisRackauckas mentioned this issue Dec 28, 2022

Options for separate optimization and inference accuracy level during precompilation #48021

Closed

gbaraldi removed the status:triage This should be discussed on a triage call label Jan 5, 2023

vtjnash closed this as completed Aug 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference discards bounds on abstract parameters #36454

Inference discards bounds on abstract parameters #36454

timholy commented Jun 27, 2020

martinholters commented Jun 29, 2020

timholy commented Jun 29, 2020

ChrisRackauckas commented Dec 28, 2022

timholy commented Dec 28, 2022

gbaraldi commented Jan 5, 2023

vtjnash commented Aug 7, 2024

Inference discards bounds on abstract parameters #36454

Inference discards bounds on abstract parameters #36454

Comments

timholy commented Jun 27, 2020

martinholters commented Jun 29, 2020

timholy commented Jun 29, 2020

ChrisRackauckas commented Dec 28, 2022

timholy commented Dec 28, 2022

gbaraldi commented Jan 5, 2023

vtjnash commented Aug 7, 2024