performance improvement #64

sshin23 · 2021-08-30T04:49:48Z

In this PR, we make several changes for performance improvement.

drop the support for unreduced KKT system
bugfix in solve_refine! (reduces allocation)
use @simd macro when possible
removed specialization functions when not performance-critical
use apply_step! function

codecov · 2021-08-30T04:59:54Z

Codecov Report

Merging #64 (ac53d9e) into master (0a471d4) will decrease coverage by 0.30%.
The diff coverage is 95.59%.

@@            Coverage Diff             @@
##           master      #64      +/-   ##
==========================================
- Coverage   86.88%   86.58%   -0.31%     
==========================================
  Files          30       30              
  Lines        3150     3109      -41     
==========================================
- Hits         2737     2692      -45     
- Misses        413      417       +4

Impacted Files	Coverage Δ
src/MadNLP.jl	`50.00% <ø> (-12.50%)`	⬇️
src/interiorpointsolver.jl	`92.89% <95.54%> (-0.60%)`	⬇️
src/LinearSolvers/richardson.jl	`100.00% <100.00%> (ø)`
src/kktsystem.jl	`100.00% <100.00%> (ø)`
src/utils.jl	`73.68% <0.00%> (-7.80%)`	⬇️
src/Interfaces/MOI_interface.jl	`84.37% <0.00%> (-0.60%)`	⬇️
lib/MadNLPHSL/src/ma97.jl	`84.00% <0.00%> (+4.00%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0a471d4...ac53d9e. Read the comment docs.

frapac

It overall looks good to me. I am just not sure to understand why we get rid of the SparseUnreducedKKTSystem here? Was it really performance detrimental?

About the @simd macro, I am wondering if in the long term we could incorporate a proper package for SIMD vectorization in MadNLP. E.g.

frapac · 2021-08-30T07:37:38Z

src/interiorpointsolver.jl

@@ -709,9 +690,8 @@ function regular!(ips::AbstractInteriorPointSolver)
        switching_condition = is_switching(varphi_d,ips.alpha,ips.opt.s_phi,ips.opt.delta,2.,ips.opt.s_theta)
        armijo_condition = false
        while true
-            ips.x_trial .= ips.x .+ ips.alpha.*ips.dx
+            apply_step!(ips.x_trial,ips.x,ips.dx,ips.alpha)


instead of using a new function apply_step!, how about using directly a BLAS function?

copyto!(ips.x_trial, ips.x) axpy!(ips.alpha, ips.dx, ips.x_trial)

Yes, using BLAS makes more sense 👍

frapac · 2021-08-30T07:38:22Z

src/interiorpointsolver.jl

-        ips.l.+=ips.alpha.*ips.dl
-        ips.zl_r.+=ips.alpha_z.*ips.dzl
-        ips.zu_r.+=ips.alpha_z.*ips.dzu
+        apply_step!(ips.l,ips.l,ips.dl,ips.alpha)


Same here?

axpy!(ips.alpha, ips.dl, ips.l)

frapac · 2021-08-30T07:39:04Z

src/interiorpointsolver.jl

-    _set_aug_diagonal_unreduced!(kkt.pr_diag, kkt.du_diag, kkt.l_lower, kkt.u_lower, kkt.l_diag, kkt.u_diag,
-        ips.zl_r, ips.zu_r, ips.xl_r, ips.xu_r, ips.x_lr, ips.x_ur,
-    )
+    kkt.pr_diag .= ips.zl./(ips.x.-ips.xl) .+ ips.zu./(ips.xu.-ips.x)


Nice that we can get rid of the auxiliary function!

frapac · 2021-08-30T07:39:45Z

src/interiorpointsolver.jl

-        xll = x_lr[i]-xl_r[i]
-        @inbounds xll < 0 && return Inf
-        @inbounds varphi -= mu*log(xll)
+    @simd for i=1:length(x_lr)


Do we observe a difference when using @simd?

not quite sure, I think that will need a separate benchmark

frapac

This PR looks good to me now. Nice that we can remove the auxiliary functions we implemented in #58 !
I will start working on implementing the KKT right-hand-side structure once this PR merged in master.

performance improvement

8980f63

frapac reviewed Aug 30, 2021

View reviewed changes

addressing reviews

924913f

frapac approved these changes Sep 2, 2021

View reviewed changes

addressing comments

ac53d9e

sshin23 merged commit 64657de into master Sep 2, 2021

frapac deleted the ss/performance branch March 8, 2022 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance improvement #64

performance improvement #64

sshin23 commented Aug 30, 2021

codecov bot commented Aug 30, 2021 •

edited

Loading

frapac left a comment

frapac Aug 30, 2021

sshin23 Aug 30, 2021

frapac Aug 30, 2021

frapac Aug 30, 2021

frapac Aug 30, 2021

sshin23 Aug 30, 2021

frapac left a comment

performance improvement #64

performance improvement #64

Conversation

sshin23 commented Aug 30, 2021

codecov bot commented Aug 30, 2021 • edited Loading

Codecov Report

frapac left a comment

Choose a reason for hiding this comment

frapac Aug 30, 2021

Choose a reason for hiding this comment

sshin23 Aug 30, 2021

Choose a reason for hiding this comment

frapac Aug 30, 2021

Choose a reason for hiding this comment

frapac Aug 30, 2021

Choose a reason for hiding this comment

frapac Aug 30, 2021

Choose a reason for hiding this comment

sshin23 Aug 30, 2021

Choose a reason for hiding this comment

frapac left a comment

Choose a reason for hiding this comment

codecov bot commented Aug 30, 2021 •

edited

Loading