Precompute FFTW plan #159

giordano · 2021-03-04T11:25:11Z

Very work-in-progress (still needs to adapt tests)

.github/workflows/benchmark.yml

giordano · 2021-03-05T15:16:18Z

Ready for review but not for merging, I'll need to revert the change to the benchmark before that

tkoskela · 2021-03-08T13:49:35Z

src/OptimalFilter.jl

-# Operates on the 2d data stored as a matrix.
-# From Dietrich & Newsam 96 in text following equation 12
-function normalized_inverse_2d_fft!(transformed_array::AbstractMatrix{T}, array::AbstractMatrix{S}, grid_ext) where {T,S}
+function normalized_2d_fft!(transformed_vector::AbstractVector{<:Complex}, vector::AbstractVector{S}, fft_plan::FFTW.FFTWPlan, fft_plan!::FFTW.FFTWPlan, grid_ext, f=identity) where {S}


fft_plan is not used?

That's correct, but all these methods need to have the same signature in terms of number of arguments.

tkoskela · 2021-03-08T13:49:58Z

src/OptimalFilter.jl

+        mul!(transformed_array, f(fft_plan!), array)
+    else
+        mul!(transformed_array, f(fft_plan), array)


What's the difference between fft_plan and fft_plan!?

The out-of-place fft_plan can be used only in

mul!(B, fft_plan, A)

with A and B different arrays, or more precisely pointer(A) != pointer(B): https://github.com/JuliaMath/FFTW.jl/blob/2c20abe7098a2380fa37fe021b3d21fc593059ec/src/fft.jl#L438. Instead, the in-place fft_plan! can only be used in

mul!(A, fft_plan!, A)

We do have cases where we overwrite the input array.

tkoskela · 2021-03-08T13:50:11Z

src/OptimalFilter.jl

-    normalization_factor = 1.0 / sqrt(grid_ext.nx * grid_ext.ny)
-    transformed_array .= fft(array) .* normalization_factor
+    normalization_factor = f(sqrt(grid_ext.nx * grid_ext.ny))
+    if pointer(transformed_array) == pointer(array)


What are the pointers for?

See the answer to #159 (comment)

src/OptimalFilter.jl

tkoskela · 2021-03-08T13:54:08Z

src/ParticleDA.jl

+    fft_plan, fft_plan! = FFTW.plan_fft(tmp_array), FFTW.plan_fft!(tmp_array)

-    offline_matrices = init_offline_matrices(grid, grid_ext, stations, model_noise_params, obs_noise_std, T)
+    offline_matrices = init_offline_matrices(grid, grid_ext, stations, model_noise_params, obs_noise_std, fft_plan, fft_plan!, T)


It looks like fft_plan, fft_plan! are always passed together so could just store them as a tuple?

This is actually what I did at the beginning, but I didn't like it too much because it was opaque what fft_plan[1] and fft_plan[2] were (which is the out-of-place, which is the in-place?). I guess you'd suggest a named tuple? 😄

😁 Not a big deal either way

* Add functions for Alex optimal filter * Rewrite functions with rho_bar a vector and R21,R12 matrices * sample_height_proposal passes preliminary tests * sample_height_proposal passes preliminary tests * Add validation against Alex code * Add performance benchmarks * Shift grid indices so that both station and grid indexing starts at 1. Shift indices when initialising StationVectors from st.st_ij of Sample_Optimal_Height_Proposal.jl * Add get_log_weights function * Add comments, fix bugs change i to iprt for correct behaviour in get_log_weights! interpolate variables in function calls that are timed with @Btime * Apply suggestions from code review Add dots and in-place operations Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> * matrices.K*KH is a real matrix product * Use mul! to do multiplication in-place * Remove duplicate call * Minimal allocations reduction * Add get_stations and set_particles! to the interface * Reorder data structures for compatibility. Remove unit tests (will move to runtests.jl) * Integrate Optimal Filter to ParticleDA - Added unit tests and validation tests - Made changes to model interface to expose fields needed by optimal filter - Added a OptimalFilter type for dispatching - No implementation of run_particle_filter yet * Apply suggestions from code review Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> * Draft implementation * Changes suggested by review Changing rand(rng,nd) to randn(rng) causes validation test to fail unless it's done on both sides. Unclear why. * Implementation according to comment on 5 Feb in #136 * Add compat entry for FFTW * Change to a more sensible test height field * Make grid size nx (not nx+1) on both ParticleDA and OptimalFilter. Something goes wrong. * Correct dx,dy in covariance calculations. tests pass. * Fix bugs * Apply suggestions from code review Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> * Apply suggestions from code review In two parts because I didn't realize github doesn't automatically load a large diff even when there are comments. Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> * Define model grid as storing values at the nodes (instead of cell faces) The only change required is defining dx as L_x/(nx-1) instead of L_x/nx In my opinion this is more consistent with the way we define the stations Makes it easier (or perhaps even possible) to integrate with the optimal filter Also fixed a bug in write_stations that created an Int dataset on the hdf5 file for the coordinate (float) value Reference data and some tests had to be updated * Pass grid parameters instead of using model_params * Fix bugs * Minor improvements - Parametric types in init_filter - Comments - Indentations * Fix arguments to init_*line_matrices * Use sigma and lambda from model params instead of redefining them in filter. * update notebook with animation and parameters * Point out bootstrap filter in example * add FFTW to benchmark environment * Pass filter type to run_particle_filter and use it to dispatch (#155) * Pass filter type to run_particle_filter and use it to dispatch appropriate functions * Update to simpler syntax * Update benchmarks and the benchmark environment Co-authored-by: Mosè Giordano <mose@gnu.org> * Split get_grid_size into three functions (#157) * Split get_grid_size into three functions that return NTuples * Remove extra parenthesis * Clarify function names * Changes suggested by review - Define Grid in OptimalFilter - Annotate argument types - Fix runtests.jl - Fix docstrings * Add integration test with optimal filter * Add input file for the test * Precompute FFTW plan (#159) * Precompute FFTW plan * Support in-place and out-of-place FFTs * Reduce duplication in `normalized_2d_fft!` methods * Force A to be hermitian to avoid roundoff errors Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> * Throw error if matrix is not approximately Hermitian * clean up * Pass RNG directly to the filter instead of the model Also, use the RNG also in the `resample!` function. * Reduce allocations in `sample_height_proposal!` (#162) Co-authored-by: Mosè Giordano <giordano@users.noreply.github.com> Co-authored-by: Mosè Giordano <mose@gnu.org>

giordano marked this pull request as draft March 4, 2021 11:25

giordano force-pushed the mg/fft-plan branch 6 times, most recently from 8f7cefe to c64d531 Compare March 5, 2021 00:47

giordano commented Mar 5, 2021

View reviewed changes

.github/workflows/benchmark.yml Outdated Show resolved Hide resolved

giordano force-pushed the mg/fft-plan branch 3 times, most recently from 5e8df32 to 96fbec3 Compare March 5, 2021 15:02

giordano requested a review from tkoskela March 5, 2021 15:15

giordano marked this pull request as ready for review March 5, 2021 15:15

tkoskela reviewed Mar 8, 2021

View reviewed changes

giordano force-pushed the mg/fft-plan branch from 96fbec3 to d2effbf Compare March 8, 2021 14:34

giordano added 3 commits March 8, 2021 16:09

Precompute FFTW plan

7bb56b0

Support in-place and out-of-place FFTs

69abf94

Reduce duplication in normalized_2d_fft! methods

cf90791

giordano force-pushed the mg/fft-plan branch from d2effbf to cf90791 Compare March 8, 2021 16:09

giordano merged commit 7fcf0aa into tk/optimal-filter Mar 8, 2021

giordano deleted the mg/fft-plan branch March 8, 2021 16:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Precompute FFTW plan #159

Precompute FFTW plan #159

giordano commented Mar 4, 2021 •

edited

Loading

giordano commented Mar 5, 2021

tkoskela Mar 8, 2021

giordano Mar 8, 2021

tkoskela Mar 8, 2021

giordano Mar 8, 2021

tkoskela Mar 8, 2021

giordano Mar 8, 2021

tkoskela Mar 8, 2021

giordano Mar 8, 2021

tkoskela Mar 8, 2021

Precompute FFTW plan #159

Precompute FFTW plan #159

Conversation

giordano commented Mar 4, 2021 • edited Loading

giordano commented Mar 5, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

giordano commented Mar 4, 2021 •

edited

Loading