-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Drastic increased startup lag when increasing polydeg #516
Comments
Ok, it also seems to be connected to the ODE solver used. The time for ODE goes down again, when using SSPRK33() with a timestep callback. But the major part still is the call of solve(), here it seems to not scale well with polydeg and it takes couple of minutes before the first output on screen is shown. |
I think the same goes for the analysis/IO...it seems to take substantially more time with N=15, before the first error norm is plottet on screen. |
I expect this latency overhead to be caused by our extensive use of
We use
The latter is already on my TODO list (and I'm working on debugging some performance issues). However, I would like to get some canonical performance benchmarks at first before really going into this kind of optimizations to reduce the amount of manual testing, cf. #503. |
I will have a look at this but I strongly suggest to setup some canonical performance (runtime and latency) first to make this task easier, cf. #503 |
Having merged #535, we get the following results for julia --check-bounds=no --threads=1 -e 'using Trixi; @time trixi_include("examples/2d/elixir_advection_extended.jl", tspan=(0.0, 1.0e-10), polydeg=POLYDEG)' | egrep "seconds"
Could you please check whether things work better for you now, @gregorgassner? If not, please re-open this issue and post some details. |
Fantastic work, @ranocha, given the surgical precision of the changes to the general code (virtually none) |
I am observing that the startup time (time for setting up ODE and time for setting up solve) scales very strongly with polydeg. Instead of a second for the KHI when using N=3, it takes a couple of seconds for N=7 and a couple of minutes for N=15 (while keeping the total number of DOF equal, i.e. reducing the grid level).
Any idea what is going on?
The text was updated successfully, but these errors were encountered: