Add support for Float32 #187

sshin23 · 2022-07-04T00:23:33Z

This PR adds support for Float32, or any other precision types.

To make this possible, we make the AbstractLinearSovler a parametric type, where the precision is given as a type parameter. The Float32 version of the solver interfaces is added when it is available.

frapac

That's a significant improvement in MadNLP! I read this PR and had only a few minor comments so far. It would be interesting to find a good use-case to advertise this new capability. Maybe we should run the GPU benchmark on a simple GPU that does not support double precision?

lib/MadNLPGPU/src/MadNLPGPU.jl

lib/MadNLPGPU/src/lapackgpu.jl

lib/MadNLPPardiso/test/runtests.jl

lib/MadNLPTests/src/Instances/dummy_qp.jl

src/IPM/IPM.jl

src/IPM/kernels.jl

src/LinearSolvers/linearsolvers.jl

sshin23 · 2022-07-05T05:06:42Z

Thanks, @frapac for the review! Indeed creating a good use case would be important, and probably it doesn't have a big advantage on CPU.

Just ran a simple experiment on my laptop:

julia> T=Float32; N=400; a = CUDA.randn(T,N,N); a = a*a'+I; @time cholesky(a);
  0.000969 seconds (163 allocations: 9.016 KiB)

julia> T=Float64; N=400; a = CUDA.randn(T,N,N); a = a*a'+I; @time cholesky(a);
  0.002487 seconds (163 allocations: 9.016 KiB)

so, would be interesting to test the performance with a very large-scale dense problem on GPU. Would be interesting to test it with DynamicNLPModels.jl

cc: @dlcole3

sshin23 added 7 commits May 17, 2022 20:41

float32 on cpu works

44f3015

tests passing

ba7814d

merge with master

462d1a7

general precision

fde2f52

gpu test passing

62c8f98

everything in HSL works

17d6dd7

first draft done

6a67616

sshin23 requested a review from frapac July 4, 2022 00:26

sshin23 added 4 commits July 3, 2022 21:44

export linear solvers

063c438

krylov subtyping

55bdfca

hsl build improvement, hopefully final

cf0658e

HSL build improvment - no fakemetis

1bb05a6

frapac reviewed Jul 5, 2022

View reviewed changes

sshin23 added 3 commits July 5, 2022 09:33

addressed francois's comments

a66c8f7

remove unncessary

94914f3

typo fix

70fd6fe

sshin23 merged commit 897acf1 into master Jul 5, 2022

sshin23 deleted the ss/float32 branch July 5, 2022 15:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Float32 #187

Add support for Float32 #187

sshin23 commented Jul 4, 2022

frapac left a comment

sshin23 commented Jul 5, 2022

Add support for Float32 #187

Add support for Float32 #187

Conversation

sshin23 commented Jul 4, 2022

frapac left a comment

Choose a reason for hiding this comment

sshin23 commented Jul 5, 2022