New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add the block-based SDA workflow. #3197

Merged

mdietze merged 62 commits into PecanProject:develop from DongchenZ:develop

Nov 24, 2023

Contributor

DongchenZ commented Jul 13, 2023 •

edited

Loading

Description

This PR added the highly modulized block-based SDA workflow, which has complete try-catch detections that will facilitate the user-end debugging process. This workflow is also highly parallelized, which will increase the speed a lot.

Motivation and Context

The multisite SDA workflow has been problematic due to the high-dimension Bayesian sampling and speed. By using the block-based SDA workflow, people can maintain the multi-site structure and, meanwhile, the covariance structure. For the Wishart case of the MCMC sampling, it's been problematic due to the changes in observations, which is now been solved by employing the global aqq that is defined by the number of state variables.
Beyond that, this PR has the following updates:

Allow pre-existed aqq & bqq imported from outside.
Allow pre-specified prior of Q imported from outside.
Enabled the MCMC sampling with NA observations to help estimate the process variance for a free run.
The qsub_parallel function is updated and more robust.
Fixed the bug of the edge cases of the single site and single observation scenario.
Fixed the bug where there is no nc_close and on.exit() paired with each nc_open function for the prepare_pools function.

Review Time Estimate

Immediately
Within one week
When possible

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

Checklist:

My change requires a change to the documentation.
My name is in the list of CITATION.cff
I have updated the CHANGELOG.md.
I have updated the documentation accordingly.
I have read the CONTRIBUTING document.
I have added tests to cover my changes.
All new and existing tests passed.

Dongchen Zhang added 16 commits

July 13, 2023 14:23


          Add the matrix_network and the GrabFillMatrix functions into the util…

363b42c

…s package.


          Update the documentation.

ee5ce67


          Add the namespace for the assimSeqential package and the construct_ni…

5282ab7

…mble_H function.


          Add the script for the block-based SDA workflow.

57d0b07


          Added the nimble code for the block-based MCMC sampling.

dc949e8


          Update the script.

b704d70


          Add the MCMC_Init function.

61f28ff


          Add the MCMC_block_function.

5b6019f


          Add the analysis_sda_block function.

c78f5b1


          Add the block.2.vector function.

8fd4485


          Add the build.block.xy function.


          Add the update_q function.

7e48b21


          Reformat the arguments and connected this function to the block analy…

fc5ad95

…sis function.


          Format.

2b883e5


          Merge branch 'develop' of https://github.com/DongchenZ/pecan into dev…

987e86d

…elop


          Update the changelog file.

4f0959f

DongchenZ requested a review from mdietze

July 13, 2023 18:43

Dongchen Zhang and others added 13 commits

July 13, 2023 15:20


          Export the GEF.Block.Nimble code, and use the future_map function ins…

feb47ab

…tead of the foreach function for the parallelization.


          Move the matrix_operation.R into the AssimSequential package.

7a1679e


          Move the script from util to Assimsequential.

b352a15


          Remove the namespace for the AssimSequential package.

678393f


          Add namespace.


          Tweak the documents.

463e48a


          bug fixes for initializing aqq and bqq.

cbfe7b6


          Add the tolerance for detecting the nc files.

1d0b8ce


          Bug fixes for unusually high soil carbon values.

b886ae4


          lower the threshold.

020174b


          Update the function.

81dc1dd


          Merge branch 'PecanProject:develop' into develop

be2ca7b


          1) Allow pre-existed aqq & bqq imported from outside.

339581a

2) Allow pre-specified prior of Q imported from outside.
3) Allow NA values for any site that doesn't have any observation.
4) Solve the one observation or one site problem.
5) Solved the aqq & bqq length issue.

Dongchen Zhang added 3 commits

August 22, 2023 14:56


          Merge branch 'develop' of https://github.com/DongchenZ/pecan into dev…

d0896fd

…elop


          Remove the %>% operator to avoid adding a dependency.

70eabaa


          Add namespace for the var function.

68cdac1

Qianyuxuan reviewed

View reviewed changes

modules/assim.sequential/inst/MultiSite-Exs/SDA/Create_Multi_settings.R Outdated

Collaborator

Qianyuxuan Aug 23, 2023

Shall we add "aqq.Init" and "bqq.Init" under the tag "<state.data.assimilation>"?

Contributor Author

DongchenZ Nov 13, 2023

Fixed. You can now add those two arguments inside this script.

Qianyuxuan reviewed

View reviewed changes

book_source/02_demos_tutorials_workflows/04_more_web_interface/02_hidden_analyses.Rmd Outdated

+                           Profiling = FALSE,
+                           OutlierDetection=FALSE,
+                           parallel_qsub = TRUE,
+                           free_run = FALSE,

Collaborator

Qianyuxuan Aug 23, 2023

So we probably need to remove "free_run" here? And "SDA_runner.R" should be updated too.

Contributor Author

DongchenZ Nov 13, 2023

Fixed.

Dongchen Zhang and others added 8 commits

August 23, 2023 13:26


          update the documentation.

026f329


          update the documentation.

dc613a3


          Fix bug for the single site and no observation case.

59eeadf


          Merge branch 'PecanProject:develop' into develop

892b379


          Merge branch 'develop' of https://github.com/DongchenZ/pecan into dev…

957d72c

…elop


          Merge branch 'PecanProject:develop' into develop

289f59a


          Fix the issue when some sites have no observation.

d08f417


          Fix the logic for the analysis function.

c16f88a

mdietze requested changes

View reviewed changes

base/remote/R/qsub_parallel.R Show resolved Hide resolved

book_source/02_demos_tutorials_workflows/04_more_web_interface/02_hidden_analyses.Rmd Outdated


		* pre_enkf_params - (optional) Used for carrying out SDA with pre-existed enkf.params, in which the Pf, aqq, and bqq can be used for the analysis step. Defualt is NULL.

		* ensemble.samples - (optional) Pass ensemble.samples from outside to avoid GitHub check issues. Defualt is NULL.

Member

mdietze Nov 8, 2023

check spelling on "default" throughout.
explain what ensemble.samples can be used for productively! This argument doesn't just exist to avoid a github check. Indeed, there's a legitimate question about whether pre_enkf_params and ensemble.samples should just be part of the restart list rather than their own arguments (e.g. are there scenarios where you'd use them outside of a restart run?)

Contributor Author

DongchenZ Nov 14, 2023

Fixed.
Fixed.

book_source/02_demos_tutorials_workflows/04_more_web_interface/02_hidden_analyses.Rmd Outdated

+              #### **sda.enkf.multisite.R Arguments**
+              * settings - (required) [State Data Assimilation Tags Example] settings object
+              * obs.mean - (required) List of sites named by site ids, which contains dataframe for observation means, named with observation datetime.

Member

mdietze Nov 8, 2023

I don't know if it's documented somewhere else, but this explanation of how obs.mean and obs.cov have to be structured is clearly insufficient. Also, lets make sure the documentation here matches the ROxygen documentation of the function itself.

Contributor Author

DongchenZ Nov 14, 2023

Added documentation about why we need observations, how to generate them, and what are the basic formats for them.

book_source/02_demos_tutorials_workflows/04_more_web_interface/02_hidden_analyses.Rmd Outdated


		* control - (optional) List of flags controlling the behaviour of the SDA. Default is as follows:

Member

mdietze Nov 8, 2023

If the list below isn't documented here, point the reader to where they can find an explanation of what all the flags below do (e.g. make sure they're documented in the function's Roxygen

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

book_source/02_demos_tutorials_workflows/04_more_web_interface/02_hidden_analyses.Rmd Outdated

@@ @@ -324,34 +363,124 @@ $`2010/12/31`$`1000000651` @@
               [1,] 15.2821691 0.513584319
               [2,]  0.1213583 0.001162113
               ```
-              An example of multi-settings pecan xml file also may look like below:
+              #### Anlysis SDA workflow
+              Before running the SDA analysis functions, the ensemble forecast results have to be generated, and arguments such as H matrix, MCMC arguments, and multi-site Y and R (by `Construct.R` function) have to be generated as well. If you want to proceed the block-based SDA workflow for a free run (estimate the process error to the free run), you need to first specify the `state.data.assimilation` as follows. Beyond that, if any of your sites has zero observation, you should flag the `by.site` as TRUE.

Member

mdietze Nov 8, 2023

The sentence "If you want to proceed the block-based SDA workflow for a free run (estimate the process error to the free run)" is confusing since you haven't said what the block-block based SDA is, what a free run is, or what each of the tags below actually does. e.g. is by.site what turns on the block-based version? And if so, why isn't it called by.block and why would you want it turned on if sites have zero observations? What does turning on the free.run tag cause to happen?

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

modules/assim.sequential/R/Multi_Site_Constructors.R

+              ##' @param pft.path physical path to the pft.csv file.
+              ##' @param by criteria, it supports by variable, site, pft, all, and single Q.
+              ##'
+              ##' @description This function is an upgrade to the Construct.H.multisite function which provides the index by different criteria.

Member

mdietze Nov 8, 2023

If it's an upgrade, why not just fix the previous function? The whole point of a function is that if the inputs and outputs are defined then anything internal should be malleable with affecting the rest of the code.

Contributor Author

DongchenZ Nov 14, 2023

I haven't cleaned up this function. Only partial features work now (but this is enough for the current SDA workflow). I will make another PR to build the H matrix under different situations.

modules/assim.sequential/R/Nimble_codes.R Outdated

+                #I think the blocked nimble has to be implemented and used instead of a long vector sampling.
+                #1) due to the convergence of X.mod.
+                #2) temporal efficiency. MVN sampling over a large cov matrix can be time consuming.
+                #4) this data structure design allows us to implement the MCMC sampling parallely.

Member

mdietze Nov 8, 2023

Is this model actually different or is it just called by block instead of overall? I'm not sure I 100% understand why we need a new nimble model.

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

modules/assim.sequential/R/SDA_OBS_Assembler.R Outdated

@@ @@ -19,6 +19,21 @@ SDA_OBS_Assembler <- function(settings){ @@
                 #extract Obs_Prep object from settings.
                 Obs_Prep <- settings$state.data.assimilation$Obs_Prep
+                #check if we want to proceed the free run without any observations.
+                if (as.logical(settings$state.data.assimilation$free.run)) {
+                  #calcualte time points.

Member

mdietze Nov 8, 2023

calculate

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

modules/assim.sequential/R/sda.enkf_MultiSite.R Outdated

@@ @@ -362,7 +355,7 @@ sda.enkf.multisite <- function(settings, @@
                       )
                      })
                 ###------------------------------------------------------------------------------------------------###
-                ### loop over time                                                                                 ###
+                ### w over time                                                                                 ###

Member

mdietze Nov 8, 2023

not sure I'm following the change in comment

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

modules/data.land/R/pool_ic_netcdf2list.R Outdated

@@ @@ -20,6 +20,8 @@ pool_ic_netcdf2list <- function(nc.path){ @@
                   for(varname in names(vals)){
                     vals[[varname]] <- ncdf4::ncvar_get(IC.nc,varname)
                   }
+                  ncdf4::nc_close(IC.nc)
+                  on.exit()

Member

mdietze Nov 8, 2023

These two lines should be combined into one line and moved up to right after the nc_open. That ensures that the nc file is closed when the function exits, even if the function exits with an error. See for example

pecan/models/ed/R/model2netcdf.ED2.R

Line 1005 in 91531ae

on.exit(ncdf4::nc_close(nc), add = FALSE)

Contributor Author

DongchenZ Nov 14, 2023

Fixed.

DongchenZ and others added 11 commits

November 9, 2023 12:46


          Merge branch 'PecanProject:develop' into develop

d4a3548


          Merge block GEF into general GEF.

f81a414


          Add sleep time.

e551332


          Update documentations.

2927ac4


          Update documentations.

778f9ca


          Upgrade the detection of the boundary case for wood carbon propagation.

729873e


          Update the documentation of parameters and functions per Mike's request.

d39fbde


          Typo

4a37538


          Update the documentation of the roxygen structure.

50b61ec


          Fixes on nc_close and on.exit usage.

a3fc931


          Update Rd files.

0be65ac

mdietze approved these changes

View reviewed changes


          Merge branch 'develop' into develop

6eec29f

mdietze added this pull request to the merge queue

Merged via the queue into PecanProject:develop with commit 824bcfa

10 of 12 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet