nonzero beta + flipkernel bugfix #519

nikopj · 2023-07-04T23:06:37Z

Addresses #518.

CarloLucibello · 2023-07-05T05:15:04Z

can you add a test?

ToucheSir · 2023-07-05T13:54:34Z

It also feels like it should be possible to address the issue with a fix to how the function does indexing in the inner loop instead of wholesale flipping the kernel gradients inside the function (won't they be flipped back and forth for multiple calls?) but I may be missing something.

nikopj · 2023-07-06T05:52:10Z

It also feels like it should be possible to address the issue with a fix to how the function does indexing in the inner loop instead of wholesale flipping the kernel gradients inside the function (won't they be flipped back and forth for multiple calls?) but I may be missing something.

You're right, it can be done much more cleanly with view.

On writing the tests for this, I've come across a much wider set of bugs across conv implementations with non-zero beta. For example,

for beta=(0f0, 1f0) 
    @show beta
    x = fill(1f0, 2, 1, 1)
    w = [-1f0; 1f0;;;]
    y = fill(1f0, 1, 1, 1)
    cdims = NNlib.DenseConvDims(x, w)

    x_direct = NNlib.∇conv_data_direct!(copy(x), y, w, cdims; alpha=1f0, beta=beta)
    x_im2col = NNlib.∇conv_data_im2col!(copy(x), y, w, cdims; alpha=1f0, beta=beta)

    @show x_direct
    @show x_im2col
    @show x_direct ≈ x_im2col
end

output:

beta = 0.0f0
x_direct = [1.0; -1.0;;;]
x_im2col = [1.0; -1.0;;;]
x_direct ≈ x_im2col = true
beta = 1.0f0
x_direct = [2.0; 0.0;;;]
x_im2col = [1.0; -1.0;;;]
x_direct ≈ x_im2col = false

Here the direct version is correct and the im2col version is incorrect. This can be seen in src/impl/conv_im2col.jl line 163, 165. I think the cleanest solution would be to add alpha/beta args to col2im! and im2col!. Let me know what you think and I can work on fixing this too.

ToucheSir · 2023-07-06T14:17:59Z

I haven't worked through the math yet, but assuming it's correct I think it would be easiest to add the tests first and mark them as broken for the im2col path. Then we can merge those quickly and allow the fixes to be made at a more leisurely pace. It concerns me greatly that nobody bothered to write tests with a non-zero beta (or a non-one alpha!) when the conv kernels were first added.

nikopj · 2023-07-06T17:42:14Z

Tests are added. I kept alpha=2, beta=-1 in all tests. I generate some random data for the conv! tests. I chose to do rand(rng, -9e0:9e0) so that debugging may be easier by keeping answers as integers. I can change those if there is some standard protocol for these that I'm not aware of.

In general, the filter and data im2col implementations are broken with non-zero beta. It seems that the direct implementations are also broken for some cases of spatial-rank, stride, and dilation.

The original flipkernel + non-zero beta bug with \delconv_filter_direct! is fixed.

ToucheSir

Like the change and the expanded test suite is great!

src/impl/conv_direct.jl

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

nonzero beta + flipkernel bugfix

ef2505d

nikopj mentioned this pull request Jul 4, 2023

cudnn complex convolution via gauss trick #517

Merged

2 tasks

conv! alpha/beta tests added, conv_filter_direct flipkernel with view

aceb24d

Merge branch 'FluxML:master' into nikopj-directbeta

85fd6f2

ToucheSir approved these changes Jul 8, 2023

View reviewed changes

src/impl/conv_direct.jl Outdated Show resolved Hide resolved

Update src/impl/conv_direct.jl

8f57824

Co-authored-by: Brian Chen <ToucheSir@users.noreply.github.com>

ToucheSir merged commit 629475a into FluxML:master Jul 8, 2023
9 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nonzero beta + flipkernel bugfix #519

nonzero beta + flipkernel bugfix #519

nikopj commented Jul 4, 2023

CarloLucibello commented Jul 5, 2023

ToucheSir commented Jul 5, 2023

nikopj commented Jul 6, 2023 •

edited

Loading

ToucheSir commented Jul 6, 2023

nikopj commented Jul 6, 2023

ToucheSir left a comment

nonzero beta + flipkernel bugfix #519

nonzero beta + flipkernel bugfix #519

Conversation

nikopj commented Jul 4, 2023

CarloLucibello commented Jul 5, 2023

ToucheSir commented Jul 5, 2023

nikopj commented Jul 6, 2023 • edited Loading

ToucheSir commented Jul 6, 2023

nikopj commented Jul 6, 2023

ToucheSir left a comment

Choose a reason for hiding this comment

nikopj commented Jul 6, 2023 •

edited

Loading