Where is JuMP's sparse AD code, and can DifferentiationInterface help? #3806

gdalle · 2024-08-14T17:37:47Z

Hi there @odow and friends!

Looking at our work on sparse autodiff with @adrhill and @amontoison, I've been wondering how it could be useful to JuMP. We have developed a combination of three new packages:

for sparsity detection (SparseConnectivityTracer.jl)
for matrix colorings and decompression (SparseMatrixColorings.jl)
for backend-independent gradients, (sparse) Jacobians and (sparse) Hessians (DifferentiationInterface.jl)

From what I understand, JuMP's current sparse AD engine is Nonlinear.ReverseAD, but there are also experiments going on in MathOptSymbolicAD.jl. Did I miss anything?

I'm not suggesting that DifferentiationInterface and friends should replace your own default AD solution. But since you say in the docs that testing other AD backends would be nice, maybe there's an angle there?
In any case, perhaps DI could be a nice addition to the docs page on autodiff of user-defined operators?

The text was updated successfully, but these errors were encountered:

odow · 2024-08-14T21:00:21Z

Did I miss anything?

Nope. That is the situation. All nonlinear/AD stuff is in the MOI.Nonllinear module. There is nothing in JuMP.

But since you say in the docs that testing other AD backends would be nice, maybe there's an angle there?

New AD backends need to be implemented as an oracle like this:

https://github.com/lanl-ansi/MathOptSymbolicAD.jl/blob/e5d3eaa1f56346d53a5ef8f895743ab3e89ff38c/src/nonlinear_oracle.jl#L278-L496

In any case, perhaps DI could be a nice addition to the docs page on autodiff of user-defined operators?

Feel free to open a PR 😄

amontoison · 2024-08-15T05:08:10Z

What could be useful for JuMP is star coloring for the Hessian of the Lagrangian.
It requires more colors compared to acyclic coloring (the coloring used by default in MOI.Nonlinear), but it puts less pressure on memory by allowing not storing all columns of the compressed Hessian for very large problems.

The issue is that the code related to nonlinear stuff in MOI is highly specialized for what they use in JuMP, so it's probably not easy to create a bridge.

odow · 2024-08-18T21:41:04Z

I have no plans to work on a DI interface, but anyone interested is encouraged to give it a go. It can (and should) be done first in a separate repo like MathOptSymbolicAD.jl. One goal of refactoring the nonlinear stuff out of JuMP was so that we could experiment without needing to make source code changes to JuMP or MathOptInterface.

For the user-defined gradients with DI, I'm open to including this in the documentation, but it is lower priority, so perhaps we could close this issue in favor of adding a TODO item to #2348.

amontoison · 2024-08-18T23:49:09Z

I agree with that Oscar.

odow · 2024-08-19T00:05:59Z

Closing in favor of #2348 (comment)

odow mentioned this issue Aug 19, 2024

Suggestions for documentation improvements #2348

Open

13 tasks

odow closed this as completed Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Where is JuMP's sparse AD code, and can DifferentiationInterface help? #3806

Where is JuMP's sparse AD code, and can DifferentiationInterface help? #3806

gdalle commented Aug 14, 2024

odow commented Aug 14, 2024

amontoison commented Aug 15, 2024 •

edited

Loading

odow commented Aug 18, 2024

amontoison commented Aug 18, 2024

odow commented Aug 19, 2024

Where is JuMP's sparse AD code, and can DifferentiationInterface help? #3806

Where is JuMP's sparse AD code, and can DifferentiationInterface help? #3806

Comments

gdalle commented Aug 14, 2024

odow commented Aug 14, 2024

amontoison commented Aug 15, 2024 • edited Loading

odow commented Aug 18, 2024

amontoison commented Aug 18, 2024

odow commented Aug 19, 2024

amontoison commented Aug 15, 2024 •

edited

Loading