Preserve shape when collecting broadcasted objects #44061

BSnelling · 2022-02-07T12:49:44Z

I first implemented the fix proposed on the issue but as expected this was ambiguous. I'm not sure if my proposal is as general as the initial proposal but it is not ambiguous and results in desired behaviour in a test.

(Replacing #44039)

N5N3 · 2022-02-08T14:52:08Z

For me collect is not the best way to test Broadcasted's shape during iteration.
As copy(bc) should be the officical way to "collect" a Broadcasted.
Something like collect(Iterators.product(bc, bc)) make more sense.

BTW, not all AbstractArrayStyle track bc's dimensionality. (e.g. Broadcast.ArrayStyle)
I guess we will got a error for these kind style when testing collect(Iterators.product(bc, bc))

vtjnash

I can't think of any reason not to do this. @mbauman you have any objects?

base/broadcast.jl

DilumAluthge · 2022-02-17T03:58:15Z

Removing the merge me label until:

@N5N3 finishes their review
@mbauman weighs in

BSnelling · 2022-02-17T12:39:37Z

Thank you @N5N3, you suggestion seems to work great!

Something to note, this does broaden the definition of IteratorSize that was here originally:

Base.IteratorSize(::Type{<:Broadcasted{<:Any,<:NTuple{N,Base.OneTo}}}) where {N} = Base.HasShape{N}()

effectively becomes

Base.IteratorSize(::Type{<:Broadcasted{<:Any,<:NTuple{N,Any}}}) where {N} = Base.HasShape{N}()

I don't believe it's a problem but thought it was worth noting.

N5N3 · 2022-02-18T01:49:21Z

I tested locally with nest Broadcast{<:AbstractArrayStyle{Any}}, e.g.

bc = Base.broadcasted(+, AD1(randn(3)), AD1(randn(3)));
bc = Base.broadcasted(+, bc , bc);
bc = Base.broadcasted(+, bc , bc);
@inferred(Base.IteratorSize(bc)) # error on 9c82a3a

The easist solution is adding Base.@pure.

Base.@pure _maxndims(T) = mapfoldl(_ndims, max, fieldtypes(T)) # _fieldtypes is unneeded anymore

I'm not sure is @pure OK here. As we'd better avoid using it whenever possible.
But the recursiveness seems unavoidable.

Also for consistency, it would be good to add Base.ndims(bc::Broadcasted) = ndims(typeof(bc)) as ndims should be defined for all Type{<:Broadcast} after 9c82a3a.

vtjnash · 2022-02-22T17:46:06Z

base/broadcast.jl

+    N isa Integer && return N
+    _maxndims(fieldtype(BC, 2))
+end
+Base.@pure _maxndims(T) = mapfoldl(_ndims, max, fieldtypes(T))


See Iterators.zip_iteratorsize for and example of how this is normally implemented

looks like nested zip also has similar problem? on master:

julia> a = Iterators.zip(1:10,1:10) zip(1:10, 1:10) julia> b = zip(a, a); julia> c = zip(b, b); julia> @code_warntype Base.IteratorSize(c) MethodInstance for Base.IteratorSize(::Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}, Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}}}) from Base.IteratorSize(x) in Base at generator.jl:92 Arguments #self#::Core.Const(Base.IteratorSize) x::Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}, Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}}} Body::Any 1 ─ %1 = Base.typeof(x)::Core.Const(Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}, Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}}}) │ %2 = Base.IteratorSize(%1)::Any └── return %2 julia> @code_warntype Base.IteratorSize(b) MethodInstance for Base.IteratorSize(::Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}) from Base.IteratorSize(x) in Base at generator.jl:92 Arguments #self#::Core.Const(Base.IteratorSize) x::Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}} Body::Base.HasShape{1} 1 ─ %1 = Base.typeof(x)::Core.Const(Base.Iterators.Zip{Tuple{Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}, Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}}}) │ %2 = Base.IteratorSize(%1)::Core.Const(Base.HasShape{1}()) └── return %2 julia> @code_warntype Base.IteratorSize(a) MethodInstance for Base.IteratorSize(::Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}) from Base.IteratorSize(x) in Base at generator.jl:92 Arguments #self#::Core.Const(Base.IteratorSize) x::Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}} Body::Base.HasShape{1} 1 ─ %1 = Base.typeof(x)::Core.Const(Base.Iterators.Zip{Tuple{UnitRange{Int64}, UnitRange{Int64}}}) │ %2 = Base.IteratorSize(%1)::Core.Const(Base.HasShape{1}()) └── return %2

I've tried implementing this as in Iterators.zip_iteratorsize but can't get nested broadcasts to pass the @inferred test. Is there a reason @pure shouldn't be used here @vtjnash ? I'm not familiar with when it's safe or not.

@pure should not be used. It is never safe.

I've removed @pure and instead defined methods of _maxndims for small tuples which has helped the inference on nested broadcasts.

The inference through nested broadcasts won't work for more complex cases e.g. where the original broadcasted and the nested broadcast have >2 args. For example a test like this would fail:

bc = Base.broadcasted(+, AD1(randn(3)), AD1(randn(3)), AD1(randn(3))) bc_nest = Base.broadcasted(+, bc , bc, bc) @test @inferred(Base.IteratorSize(bc_nest)) === Base.HasShape{1}()

My thinking was that perhaps we could live with more complex cases like this being uninferrable, so long as the simpler cases can be inferred.

oxinabox · 2022-05-18T19:35:05Z

bumping this.

mbauman

Hey @BSnelling — I'm so sorry this languished. I think this is now a good workaround and can be merged exactly as it stands.

BUT: I think we can go one better by greedily doing a Broadcast.instantiate on user-constructed broadcasts. That'll take some thinking — it's not done for performance to prevent recursively spending effort constructing axes on inner (fused) broadcasts — but I think when a user constructs a broadcast manually they'll want the axes constructed. That'll take some more doing as it means splitting the internal API from the external one.

So in the meantime (and to ensure this works now), let's re-run CI here (it's been a few months) and get this in.

N5N3 · 2022-05-27T04:34:09Z

base/broadcast.jl

-Base.IteratorSize(::Type{<:Broadcasted{<:Any,<:NTuple{N,Base.OneTo}}}) where {N} = Base.HasShape{N}()
+Base.IteratorSize(::Type{T}) where {T<:Broadcasted} = Base.HasShape{ndims(T)}()
+Base.ndims(BC::Type{<:Broadcasted{<:Any,Nothing}}) = _maxndims(fieldtype(BC, 2))
+Base.ndims(::Type{<:Broadcasted{<:AbstractArrayStyle{N},Nothing}}) where {N<:Integer} = N


Looks like this line will never be hitted.
So even AbstractArrayStyle with dimension tracking now use the general fallback above.
Thus we have

julia> Base.broadcasted(randn,) |> collect ERROR: MethodError: reducing over an empty collection is not allowed; consider supplying `init` to the reducer

The `N<:Integer` constraint was nonsensical, given that `(N === Any) || (N isa Int)`. N5N3 noticed this back in 2022: JuliaLang#44061 (comment) Follow up on JuliaLang#44061. Also xref JuliaLang#45477.

The `N<:Integer` constraint was nonsensical, given that `(N === Any) || (N isa Int)`. N5N3 noticed this back in 2022: #44061 (comment) Follow up on #44061. Also xref #45477.

The `N<:Integer` constraint was nonsensical, given that `(N === Any) || (N isa Int)`. N5N3 noticed this back in 2022: #44061 (comment) Follow up on #44061. Also xref #45477. (cherry picked from commit d3964b6)

…aLang#56999) The `N<:Integer` constraint was nonsensical, given that `(N === Any) || (N isa Int)`. N5N3 noticed this back in 2022: JuliaLang#44061 (comment) Follow up on JuliaLang#44061. Also xref JuliaLang#45477. (cherry picked from commit d3964b6)

BSnelling mentioned this pull request Feb 7, 2022

Preserve shape when collecting broadcasted objects #44039

Closed

BSnelling force-pushed the bes/collect_broadcasted_2 branch from 95a5deb to faa8586 Compare February 8, 2022 14:20

vtjnash requested a review from mbauman February 10, 2022 21:27

vtjnash added the merge me PR is reviewed. Merge when all tests are passing label Feb 14, 2022

vtjnash approved these changes Feb 14, 2022

View reviewed changes

N5N3 reviewed Feb 15, 2022

View reviewed changes

base/broadcast.jl Outdated Show resolved Hide resolved

DilumAluthge removed the merge me PR is reviewed. Merge when all tests are passing label Feb 17, 2022

vtjnash reviewed Feb 22, 2022

View reviewed changes

vtjnash requested review from vtjnash and removed request for vtjnash February 22, 2022 17:46

mbauman approved these changes May 26, 2022

View reviewed changes

BSnelling added 6 commits May 26, 2022 13:14

define IteratorSize for array style broadcasted

a5575a0

test collected broadcasted objects retain their shape

e0511da

IteratorSize for ArrayStyle broadcasts that don't propagate dims

f0049ba

Generalise IteratorSize definition for broadcasted

81efab9

support itertor size for nested broadcasts using @pure

1b6ffda

define _maxndims methods for small tuples to help inference

70fc3cd

mbauman force-pushed the bes/collect_broadcasted_2 branch from 6a90f31 to 70fc3cd Compare May 26, 2022 17:14

mbauman added the merge me PR is reviewed. Merge when all tests are passing label May 26, 2022

vtjnash merged commit 938da26 into JuliaLang:master May 27, 2022

N5N3 reviewed May 27, 2022

View reviewed changes

N5N3 mentioned this pull request May 27, 2022

Fix ndims for Broadcasted with no args. #45477

Open

oxinabox mentioned this pull request Jun 6, 2022

collect doesn't preserve shape on Broadcased objects #43847

Closed

giordano removed the merge me PR is reviewed. Merge when all tests are passing label Jun 10, 2022

nsajko mentioned this pull request Jan 8, 2025

broadcast: align ndims implementation with intent behind code #56999

Merged

nsajko added the broadcast Applying a function over a collection label Jan 8, 2025

nsajko mentioned this pull request Feb 9, 2025

broadcast: align ndims implementation with intent behind code (#56999) #57322

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve shape when collecting broadcasted objects #44061

Preserve shape when collecting broadcasted objects #44061

BSnelling commented Feb 7, 2022

N5N3 commented Feb 8, 2022

vtjnash left a comment

DilumAluthge commented Feb 17, 2022

BSnelling commented Feb 17, 2022 •

edited

Loading

N5N3 commented Feb 18, 2022 •

edited

Loading

vtjnash Feb 22, 2022

N5N3 Feb 23, 2022 •

edited

Loading

BSnelling Feb 25, 2022

vtjnash Feb 25, 2022

BSnelling Mar 4, 2022

oxinabox commented May 18, 2022

mbauman left a comment

N5N3 May 27, 2022 •

edited

Loading

Preserve shape when collecting broadcasted objects #44061

Preserve shape when collecting broadcasted objects #44061

Conversation

BSnelling commented Feb 7, 2022

N5N3 commented Feb 8, 2022

vtjnash left a comment

Choose a reason for hiding this comment

DilumAluthge commented Feb 17, 2022

BSnelling commented Feb 17, 2022 • edited Loading

N5N3 commented Feb 18, 2022 • edited Loading

vtjnash Feb 22, 2022

Choose a reason for hiding this comment

N5N3 Feb 23, 2022 • edited Loading

Choose a reason for hiding this comment

BSnelling Feb 25, 2022

Choose a reason for hiding this comment

vtjnash Feb 25, 2022

Choose a reason for hiding this comment

BSnelling Mar 4, 2022

Choose a reason for hiding this comment

oxinabox commented May 18, 2022

mbauman left a comment

Choose a reason for hiding this comment

N5N3 May 27, 2022 • edited Loading

Choose a reason for hiding this comment

BSnelling commented Feb 17, 2022 •

edited

Loading

N5N3 commented Feb 18, 2022 •

edited

Loading

N5N3 Feb 23, 2022 •

edited

Loading

N5N3 May 27, 2022 •

edited

Loading