GitHub

Highly Multivariate Large-scale Spatial Stochastic Processes -- A Cross-Markov Random Field Approach

This repo contains the code for the manuscript, entitled "Highly Multivariate Large-scale Spatial Stochastic Processes -- A Cross-Markov Random Field Approach", by Xiaoqing Chen, Peter Diggle, James.V.Zideck, Gavin Shaddick.

We propose a cross-MRF model class, consisting of a mixed spatial graphical model framework and a cross-MRF theory to address various challenges of highly multivariate large-scale spatial data collectively within one unified framework.

The core contribution of the cross-MRF theory is that it realises doubly conditional independence (CI) among both p variates and n spatial locations, see here.

We achieved:

utmost sparsity in the joint precision matrix
lowest generation order of the joint precision matrix
asymmetric cross-correlation in the joint covariance matrix
scientific interpretability

[Comparative study results] [Models comparison] [Asymmetry and sparsity]

Scripts Contents

GPU folder: auto-correlation matrix and cross-correlation matrix plots
Figure folder: Sigma, Sigma_inv plots, p = 10, CI among p only; CAMS denoising plot
032c: Tst9c, 1D SG and SG_inv construction, Matern, CI among p only (non-cross-MRF), SpN + Reg, thres = 1e-3, reg_num = 1e-9 (Test construction joint Sigma and Sigma_inv; p = 6, n = 40; p = 10, n = 400, 600, 800; exact zero percentage CI among p only)
032d: 1D simulation plots functions for non-cross-MRF, C.I. among p only; (Plot joint Sigma and Sigma_inv; p = 6, n = 40)
032e: 1D SG, SG_inv construction, CI among p only (p = 10, n = 40, 400, 600, 800; elapsed system wall time, CI among p only)
032f: SG, SG_inv plots (p = 10)
034b: SG_inv construction, sparse percentage comparison among cross-MRF and non-cross-MRF for Tri-Wave and Wendland; (Percentage of exact-zero entries; elapsed system wall time cross-MRF)
034c: Tst10c, 1D SG_inv construction, cross-MRF, with SpNorm + Reg for b function, b can be chosen; (Joint Sigma_inv plots, p = 6)
034e: CI among n only, Mardia 1988; p =10, n = 400, 600, 800, 1000 (Exact-zero percentage; fully-connected graph reach memory limit)
037: 100 randomly evaluated Sigma_inv generation microbenchmark; (Sigma and Sigma_inv generation time)
046b: generate 1D true processes and noisy data, Tri-Wave and Wendland
046c: generate 2D true processes and noisy data, Tri-Wave and Wendland - consistent relationship between sparsity in uni-SG_inv and joint SG_inv
047b: optimization, Tri-Wave, Tst10c (cross-MRF)
047c: optimization, Wendland, Tst10c (cross-MRF)
048b: co-krig, Tri-Wave, 1 fold C.V. results
048d: co-krig, Wendland, 1 fold C.V. results
049: neg_logL function of non-cross-MRF, TST9d
049b: optimization using 049, Tri-Wave, Wendland
055: 2D inference (neg_logL_2D, optim) for 6 fields in Fig12, Tri-Wave (converged), Wendland (converged)
056: 2D cokrig (pure denoising)
057: Data processing, generate df_Res_log_16_sorted, sorted by Lon (asc), then by Lat (desc); 4 Lon strips
059: TST12 GPU version
060: GPU parallel + optim on 1 CPU
061: GPU parallel + optim on 4 CPUs
062: pure optim parallel on 51 CPUs, no GPU parallelisation
063: CAMS data processing
064a: CAMS data with 060, GPU parallel + optim on 1 CPU, Lon_Strip_1
064b: CAMS data with 060, GPU parallel + optim on 1 CPU, Lon_Strip_4
065a: CAMS one complete construction time for SG, and SG_inv, with GPU off-loading, df_Lon_Strip_1_Sort_new.rds;(real-world data illustration)
065b: CAMS data denoising
065c: Plot of CAMS 5
066a: CAMS one complete construction time for SG, and SG_inv, solo CPU, df_Lon_Strip_1_Sort_new.rds;(real-world data illustration)

Acknowledgements

Iain Steison recommended using optimParallel() for parallel L-BFGS-B optimization on the CPU.
David Llewellyn-Jones helped set up the HPC resource and answered lots of elementary questions regarding Baskerville HPC.
Ryan Chan reminded XC that traditional R code will not automatically utilize GPU resources even when run on HPC.

Name		Name	Last commit message	Last commit date
Latest commit History 859 Commits
Assets		Assets
Data		Data
Figure		Figure
GPU		GPU
.gitignore		.gitignore
001_Recap_24May.R		001_Recap_24May.R
002_Map_Cmpts_Residuals.R		002_Map_Cmpts_Residuals.R
003_MvCon_Stationary_Modelling.R		003_MvCon_Stationary_Modelling.R
004_Lon_Lat_Global.R		004_Lon_Lat_Global.R
005_1-D_simu_OLD.R		005_1-D_simu_OLD.R
005_1-D_simulation.R		005_1-D_simulation.R
006_1-D_simulation_mix.R		006_1-D_simulation_mix.R
007_1-D_simulation_bdsparse_compare.R		007_1-D_simulation_bdsparse_compare.R
008_1D_simu_NEW_algo.R		008_1D_simu_NEW_algo.R
009_1D_simu_NEW_algo_SIGMA_inv.R		009_1D_simu_NEW_algo_SIGMA_inv.R
010_Visualise_Wave_v5_3D_1D.R		010_Visualise_Wave_v5_3D_1D.R
010_Visualise_Waves_3D_1D.R		010_Visualise_Waves_3D_1D.R
011_Investigate_pertub_on_wave_v4.R		011_Investigate_pertub_on_wave_v4.R
012_Investigate_pd_SGInv_TST2_Wave_v5_small_pertub.R		012_Investigate_pd_SGInv_TST2_Wave_v5_small_pertub.R
013_Compare_wave_v5_v6.R		013_Compare_wave_v5_v6.R
014_Investigate_p.d._condition_SIGMA_inv_Haville.R		014_Investigate_p.d._condition_SIGMA_inv_Haville.R
015_Test_Fn_Pert_Mat.R		015_Test_Fn_Pert_Mat.R
016_Ranges_4_waves.R		016_Ranges_4_waves.R
016_Test_col_rank_B.R		016_Test_col_rank_B.R
017_pre-pert_SG_inv_each_run.R		017_pre-pert_SG_inv_each_run.R
018_Compare_Cond_Numb_wave_v4_v5.R		018_Compare_Cond_Numb_wave_v4_v5.R
018_Compare_Cond_Numb_waves.R		018_Compare_Cond_Numb_waves.R
019_Tst_wave_v5_different_graphs.R		019_Tst_wave_v5_different_graphs.R
020_Understand_Wendland.R		020_Understand_Wendland.R
021_Test_p.d_SIGMA_inv_Wendland.R		021_Test_p.d_SIGMA_inv_Wendland.R
022_1D_simu_plots_SG_SGInv_7_fields.R		022_1D_simu_plots_SG_SGInv_7_fields.R
023_Understand_Ricker_Wave.R		023_Understand_Ricker_Wave.R
025_1D_simu_plt_6_fileds.R		025_1D_simu_plt_6_fileds.R
026_invest_strategy_lower_B_CN.R		026_invest_strategy_lower_B_CN.R
027_Tst_5&7fields_lower_CN_B.R		027_Tst_5&7fields_lower_CN_B.R
028_Tst_original_B_5_fileds.R		028_Tst_original_B_5_fileds.R
029_Tst_7fields_different_waves_wendland_B_spN.R		029_Tst_7fields_different_waves_wendland_B_spN.R
030_Tst_differentB_ds01_D[-2,2].R		030_Tst_differentB_ds01_D[-2,2].R
031_Tst_differentB_ds_D[-10,10].R		031_Tst_differentB_ds_D[-10,10].R
032_1D_simu_plt_SpN_6_fileds.R		032_1D_simu_plt_SpN_6_fileds.R
032b_1D_simu_SpN_6_filds_Threshold.R		032b_1D_simu_SpN_6_filds_Threshold.R
032c_1D_simu_SpN_6_field_Reg_Thres.R		032c_1D_simu_SpN_6_field_Reg_Thres.R
032d_1D_simu_plts_Orig_SpNReg_6_fields.R		032d_1D_simu_plts_Orig_SpNReg_6_fields.R
032e_SG_SG_inv_p=10.R		032e_SG_SG_inv_p=10.R
032f_SG_SG_inv_plots_p10.R		032f_SG_SG_inv_plots_p10.R
033_UniCAR.R		033_UniCAR.R
034_1D_simu_SG_SG_inv_UniCAR.R		034_1D_simu_SG_SG_inv_UniCAR.R
034b_1D_simu_CAR_SpNReg_Thres_sparse_percentage.R		034b_1D_simu_CAR_SpNReg_Thres_sparse_percentage.R
034c_1D_simu_CAR_SpNReg_Thres_b_choice.R		034c_1D_simu_CAR_SpNReg_Thres_b_choice.R
034d_investigate_dlt_upperbd.R		034d_investigate_dlt_upperbd.R
034e_1D_simu_CAR_SpNReg_Thres_CI_n_Mardia.R		034e_1D_simu_CAR_SpNReg_Thres_CI_n_Mardia.R
035_1D_simu_Matern_Chain_6F.R		035_1D_simu_Matern_Chain_6F.R
036_1D_simu_CAR_Chain_6F.R		036_1D_simu_CAR_Chain_6F.R
037_microbenchmark_MaternChain_UniCARChain.R		037_microbenchmark_MaternChain_UniCARChain.R
038_Uni_Wendland_Tapper.R		038_Uni_Wendland_Tapper.R
039_1D_simu_Taper_Matern.R		039_1D_simu_Taper_Matern.R
040_mat2vec_&_vec2mat_.R		040_mat2vec_&_vec2mat_.R
041_inference_functions_1D.R		041_inference_functions_1D.R
042_Tst_Fit_Obs_indx.R		042_Tst_Fit_Obs_indx.R
043_generation_true_process_noisy_data_3fields.R		043_generation_true_process_noisy_data_3fields.R
044_inference_1D_optim_neg-logL_cokrig.R		044_inference_1D_optim_neg-logL_cokrig.R
045_Matern_1D_3_fields_4folds_CV.R		045_Matern_1D_3_fields_4folds_CV.R
046_generation_true_process_noisy_data_6_fields.R		046_generation_true_process_noisy_data_6_fields.R
046b_generation_True_Y_noisy_data_Tst10c.R		046b_generation_True_Y_noisy_data_Tst10c.R
046c_generation_True_Y_noisy_data_TST12_2D.R		046c_generation_True_Y_noisy_data_TST12_2D.R
046d_generation_True_Y_noisy_data_CAMS_datastr.R		046d_generation_True_Y_noisy_data_CAMS_datastr.R
047_1D_inference_6_fields.R		047_1D_inference_6_fields.R
047b_1D_Inf_6_Fig12_TST10c_TriW.R		047b_1D_Inf_6_Fig12_TST10c_TriW.R
047c_1D_inf_Fig12_Tst10c_WL.R		047c_1D_inf_Fig12_Tst10c_WL.R
047d_invst_1D_inf_WL_pars_issue.R		047d_invst_1D_inf_WL_pars_issue.R
048_cokrig_Tst10c_WL.R		048_cokrig_Tst10c_WL.R
048_cokrig_formula_Tst10c_bchoice.R		048_cokrig_formula_Tst10c_bchoice.R
048b_cokrig_TW_1_fold_CV.R		048b_cokrig_TW_1_fold_CV.R
048c_4_folds_CV_TW.R		048c_4_folds_CV_TW.R
048c_cokrig_TW_4_folds_CV.R		048c_cokrig_TW_4_folds_CV.R
048d_cokrig_WL_1_fold_CV.R		048d_cokrig_WL_1_fold_CV.R
048d_cokrig_WL_4_folds_CV.R		048d_cokrig_WL_4_folds_CV.R
049_ID_Inf_Fig12_TST9d_neg_logL_Matern.R		049_ID_Inf_Fig12_TST9d_neg_logL_Matern.R
049b_ID_Inf_Fig12_TST9d_TW_&_WL.R		049b_ID_Inf_Fig12_TST9d_TW_&_WL.R
050_2D_coords_displacement.R		050_2D_coords_displacement.R
051_2D_shifted_distance_matrix.R		051_2D_shifted_distance_matrix.R
052_2D_Wendland_Tri-Wave_fns.R		052_2D_Wendland_Tri-Wave_fns.R
053_Euclidean_dist_vs_magnitude_dsp.R		053_Euclidean_dist_vs_magnitude_dsp.R
054_2D_simu_CAR_6_TST12.R		054_2D_simu_CAR_6_TST12.R
055_2D_Inf_neg_logL_CAR_2D.R		055_2D_Inf_neg_logL_CAR_2D.R
056_cokrig_2D.R		056_cokrig_2D.R
057_Data_process_df_Res_log_16_sorted.R		057_Data_process_df_Res_log_16_sorted.R
058_CAMS_TST12_construct.R		058_CAMS_TST12_construct.R
059_TST12_SG_SG_inv_GPU.R		059_TST12_SG_SG_inv_GPU.R
060_2D_Inf_neg_logL_CAR_GPU.R		060_2D_Inf_neg_logL_CAR_GPU.R
061_2D_Inf_neg_logL_CAR_GPU_CPU.R		061_2D_Inf_neg_logL_CAR_GPU_CPU.R
062_Pure_CPUs.R		062_Pure_CPUs.R
063_Data_process_df_Lon_strips.R		063_Data_process_df_Lon_strips.R
064a_Optm_GPU_Lon_Strip_1.R		064a_Optm_GPU_Lon_Strip_1.R
064b_Denoising_CAMS_5.R		064b_Denoising_CAMS_5.R
064c_plot_CAMS		064c_plot_CAMS
065a_One_complete_construct_SG_SGinv_GPU_Lon_strip1.R		065a_One_complete_construct_SG_SGinv_GPU_Lon_strip1.R
065b_One_complete_construct_SG_SGinv_GPU_Lon_strip4.R		065b_One_complete_construct_SG_SGinv_GPU_Lon_strip4.R
065c_One_complete_construct_SG_SGinv_GPU_Lon_strip32.R		065c_One_complete_construct_SG_SGinv_GPU_Lon_strip32.R
065d_One_complete_construct_SG_SGinv_GPU_Lon_strip2.R		065d_One_complete_construct_SG_SGinv_GPU_Lon_strip2.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Highly Multivariate Large-scale Spatial Stochastic Processes -- A Cross-Markov Random Field Approach

Scripts Contents

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

License

xc308/XC_Work

Folders and files

Latest commit

History

Repository files navigation

Highly Multivariate Large-scale Spatial Stochastic Processes -- A Cross-Markov Random Field Approach

Scripts Contents

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages