Implement sum #446

max-vassili3v · 2024-07-08T22:56:49Z

I struggled to find an elegant solution involving selecting relevant elements from B.data in the case where B is created by brand() with certain parameters (e.g very non square matrices, more bands than those that fit in the matrix). I decided to go with this solution that involves populating a data matrix only using relevant information accessed by B[band(i)]. Please let me know any improvements.

codecov · 2024-07-08T23:07:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 89.75%. Comparing base (47c15ab) to head (5ec4b3c).

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #446      +/-   ##
==========================================
+ Coverage   89.61%   89.75%   +0.14%     
==========================================
  Files          25       25              
  Lines        3571     3622      +51     
==========================================
+ Hits         3200     3251      +51     
  Misses        371      371

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

dlfivefifty · 2024-07-09T08:04:28Z

Can you make sure your unit tests are being run? I think just add include("test_sum.jl") to runtests.jl

src/banded/BandedMatrix.jl

max-vassili3v · 2024-07-09T14:56:46Z

added changes

src/banded/BandedMatrix.jl

dlfivefifty · 2024-07-10T13:47:34Z

The test failure is very surprising. It's possible its calling sum and this PR has changed the order of operations.

We should make sure we are consistent with the order. Can you add some tests that sum(A) == sum(Matrix(A)) for A a banded matrix?

max-vassili3v · 2024-07-10T13:57:41Z

I noticed earlier with == that there is some floating point error and so the test fails.

max-vassili3v · 2024-07-10T13:59:09Z

I thought it was to be expected but this could be the problem

dlfivefifty · 2024-07-10T14:00:42Z

Right, this floating point error is because you are computing the sums in a different order. This is unnecessary so we can change the implementation to make sure we do things in the right order. Eg:

julia> A = randn(5,5)
5×5 Matrix{Float64}:
 -0.574303  -0.909723    0.589035   0.125461  -0.85839
  2.36645   -2.01842     0.305596   0.739664   0.281112
 -0.449434   1.65078     0.293241  -0.12409    0.535829
  0.388728   1.3232     -1.61161   -0.54598   -0.237829
 -0.570773  -0.0989053  -0.515742   0.116799  -2.14109

julia> sum(A) == sum(vec(A)) # sum should traverse column-by-coumn
true

julia> sum(A) ≠ sum(vec(A')) # it doesn't match row-by-row 
true

src/banded/BandedMatrix.jl

dlfivefifty · 2024-07-10T14:03:02Z

I thought it was to be expected but this could be the problem

It is expected when the order of the operations change. But all things being equal we should avoid it.

Also, traversing column-by-column will be much faster than row-by-row since it accesses memory in order.

src/banded/BandedMatrix.jl

max-vassili3v · 2024-07-10T14:44:13Z

I've changed the traversal order but I still get floating point error on the tests without dims and dims = 1. It's a different type, but I've also noticed that sum(vec(A)) == sum(Matrix(A)) for BandedMatrix A returns false

dlfivefifty · 2024-07-10T14:47:40Z

Can you push your changes? Note I made a suggestion that fixes the order for the no-dims case

DanielVandH · 2024-07-14T22:26:42Z

It's a different type, but I've also noticed that sum(vec(A)) == sum(Matrix(A)) for BandedMatrix A returns false

Wouldn't a better check for this probably be that sum(vec(A)) == foldl(+, Matrix(A))?

This is a similar problem in e.g. SparseArrays where A = sprand(1000, 1000, 0.001); sum(vec(A)) == sum(Matrix(A)) returns false but A = sprand(1000, 1000, 0.001); sum(vec(A)) == foldl(+, Matrix(A)) is true

dlfivefifty · 2024-07-14T22:46:00Z

Can you explain the difference?

I take it foldl forces a specific order. Do you know why sum might choose a different order?

DanielVandH · 2024-07-14T23:05:38Z

As far as I can tell, the difference is that IndexStyle(vec(A)) = IndexCartesian() (since, for sparse arrays and banded matrices, vec(A) becomes a reshape type unlike a normal matrix where it becomes a vector) which uses mapfoldl, while Matrix(A) is an IndexLinear() which uses some sort of block-based summation.

Another way to test would be to check sum(Vector(vec(a))) == sum(Matrix(A)). I think the implementation in this PR is equivalent to doing a foldl implementation, so the tests should probably look at sum(B) == foldl(+, Matrix(B)) if I've read it correctly

dlfivefifty · 2024-07-15T13:00:47Z

I think in this case just use ≈ since we don't care that much about preserving order

dlfivefifty · 2024-07-15T15:46:08Z

src/generic/AbstractBandedMatrix.jl

+
+function sum!(ret::AbstractArray, A::AbstractBandedMatrix)
+    #Behaves similarly to Base.sum!
+    ret .= 0


Suggested change

ret .= 0

fill!(ret, zero(eltype(ret)))

dlfivefifty · 2024-07-15T15:46:21Z

src/generic/AbstractBandedMatrix.jl

+    if s[1] == 1 && (l == 1 || s[2]==1)
+        for j = 1:m, i = colrange(A, j)
+            ret .+= A[i, j]
+        end


Add tests fro this special case

dlfivefifty · 2024-07-15T15:47:27Z

src/generic/AbstractBandedMatrix.jl

+            ret[1, j] += A[i, j]
+        end
+    elseif s[1] == n && s[2] == m
+        ret = A


This is not changing ret!

Suggested change

ret = A

copyto!(ret, A)

dlfivefifty · 2024-07-15T15:47:41Z

src/generic/AbstractBandedMatrix.jl

+    elseif s[1] == n && s[2] == m
+        ret = A
+    else
+        throw(DimensionMismatch("reduction on matrix of size ($n, $m) with output size $s"))


Add test using @test_throws

test/test_sum.jl

max-vassili3v · 2024-07-16T14:04:27Z

seems to be the same issue as before with the floating point error

dlfivefifty · 2024-07-16T14:58:36Z

That error is unrelated I believe, we have seen it other places.

max-vassili3v added 5 commits July 8, 2024 16:22

add sum without dims

87c311f

add sum; dims=1

b6530b9

support for dims = 2 and error handling

bec277e

fix for empty matrices and added unit tests

cd92d7f

style

0768d34

dlfivefifty requested changes Jul 9, 2024

View reviewed changes

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

max-vassili3v added 3 commits July 9, 2024 14:17

make improvements

cb197f5

add test_sum.jl to runtests.jl

d0109be

fix method dispatch issue in a way that mimics Base.sum

8da60df

dlfivefifty requested changes Jul 9, 2024

View reviewed changes

update unit tests, reduce memory allocation, improve style

de15c53

dlfivefifty requested changes Jul 10, 2024

View reviewed changes

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

dlfivefifty reviewed Jul 10, 2024

View reviewed changes

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

dlfivefifty reviewed Jul 10, 2024

View reviewed changes

src/banded/BandedMatrix.jl Outdated Show resolved Hide resolved

update unit tests, add sum!, move to AbstractBandedMatrix.jl

0e0bc12

dlfivefifty approved these changes Jul 11, 2024

View reviewed changes

dlfivefifty self-requested a review July 11, 2024 13:35

revert tests to \approx

cfc895e

dlfivefifty requested changes Jul 15, 2024

View reviewed changes

DanielVandH reviewed Jul 15, 2024

View reviewed changes

test/test_sum.jl Outdated Show resolved Hide resolved

max-vassili3v added 2 commits July 16, 2024 14:07

make improvements in AbstractBandedMatrix.jl

dc9a82d

test special cases of sum!

393bf62

add some tests and avoid CI failure

5ec4b3c

dlfivefifty approved these changes Jul 16, 2024

View reviewed changes

dlfivefifty merged commit ea616cc into JuliaLinearAlgebra:master Jul 16, 2024
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement sum #446

Implement sum #446

max-vassili3v commented Jul 8, 2024

codecov bot commented Jul 8, 2024 •

edited

Loading

dlfivefifty commented Jul 9, 2024

max-vassili3v commented Jul 9, 2024

dlfivefifty commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

DanielVandH commented Jul 14, 2024

dlfivefifty commented Jul 14, 2024

DanielVandH commented Jul 14, 2024 •

edited

Loading

dlfivefifty commented Jul 15, 2024

dlfivefifty Jul 15, 2024

dlfivefifty Jul 15, 2024

dlfivefifty Jul 15, 2024

dlfivefifty Jul 15, 2024

max-vassili3v commented Jul 16, 2024

dlfivefifty commented Jul 16, 2024

Implement sum #446

Implement sum #446

Conversation

max-vassili3v commented Jul 8, 2024

codecov bot commented Jul 8, 2024 • edited Loading

Codecov Report

dlfivefifty commented Jul 9, 2024

max-vassili3v commented Jul 9, 2024

dlfivefifty commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

max-vassili3v commented Jul 10, 2024

dlfivefifty commented Jul 10, 2024

DanielVandH commented Jul 14, 2024

dlfivefifty commented Jul 14, 2024

DanielVandH commented Jul 14, 2024 • edited Loading

dlfivefifty commented Jul 15, 2024

dlfivefifty Jul 15, 2024

Choose a reason for hiding this comment

dlfivefifty Jul 15, 2024

Choose a reason for hiding this comment

dlfivefifty Jul 15, 2024

Choose a reason for hiding this comment

dlfivefifty Jul 15, 2024

Choose a reason for hiding this comment

max-vassili3v commented Jul 16, 2024

dlfivefifty commented Jul 16, 2024

codecov bot commented Jul 8, 2024 •

edited

Loading

DanielVandH commented Jul 14, 2024 •

edited

Loading