-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RC (Register Contents) for ARMBE #304
Comments
@gleckler1 hello Peter, I'm trying to fill in the issue, and asked Shaocheng and Cheng @EmmyChengTao for a review. For items 6 and 7 that we don't have a clear idea, since point in-situ measurements were not described in the obs4MIPs docs. Please advice.. |
@chengzhuzhang @EmmyChengTao dod_version = "armbeatm-c1-1.8... we could use this, to create something like: ARB-BE-c1-1-8 ('.' replaced with '-' because '.' can't be used in Source_id). |
@gleckler1 Hi Peter, thanks for your future guidance. Sorry for the delayed response. I did some research on the "dod_version", ARM has a whole documentation about convention as well. For finalizing the version number in our case, since the source_name already as "ARMBE", we can perhaps just use "atm-c1-1_8" as 'source_version_number'. One complication I want to bring up is that, there are subsets of ARMBE datasets: ATM (atmosphere), CLDRAD (cloud and radiation), LND (land) groups. In this case can we use "atm-c1-1_8", "cldrad-c1-1_8", etc. to distinguish them? Or should we register with more datasets? In the mean time, @EmmyChengTao helped to prepare the variable list, Variables_ARMBE_CMIP_CT_ZY.xlsx, which can be refereed to later. |
@chengzhuzhang thanks for pushing on this. The point source data is indeed a new category to consider, we have time point (temporal index) that has been considered before, but the single location is not something that has been captured before. The region is also not well accounted for, as obs4MIPs-cmor-tables/obs4MIPs_region.json has dealt with regions defined by CF, which doesn't allow for great plains ARM data - this will require some thinking |
Perhaps a single location can be treated similarly to the CFMIP "site" data (which was ~100 locations). The site dimension would be 1 if only a single dimension. |
@chengzhuzhang @taylor13 @durack1. Since the goal of obs4MIPs it to be technically aligned with CMIP, I still think following the CFMIP site example is defensible. However, if there is now (retrospectively) a much better way to describe site data (e.g., CFMIP will modify it for CMIP7) the path forward is less clear. But that (defining changes) could take years so again a defensible way forward (for now) may be to latch on to what was done in CMIP6. Regarding the version number, which we do need to resolve to get started, obs4MIPs does not strive to enforce a uniform template for version numbers, i.e., how the versions are defined can be dataset dependent. So atm-c1-1_8" and "cldrad-c1-1_8" are acceptable choices. But the aspiration is that for a given version we will be able to point the files from which they were constructed. |
@chengzhuzhang. When you have a chance, can you check to see if you have permissions to create a branch of this repo? |
thank you @gleckler1 , I cloned the repo and created a branch locally, but it doesn't seem that I have permission to push the branch to this repo. I assume that I may need to be added to write to this repo. |
@chengzhuzhang I thought that might be the case - thanks for trying. We looking for the best path to relax pushing branches. I'll get back to you soon on this. |
I would suggest against using the |
@durack1 agreed with swap for "_" I see ARMBE variable list now. |
Thanks! I will use "_" instead, and will extract the variables from the spreadsheet to include in my branch. |
Sorry to be clear, I was suggesting using "-"/hyphen rather than "_"/underscore in the source_id |
Oh, i misspoken. I will use hypen instead. Thank you for confirming. |
Updates equivalent CMIP variable names here: It seems |
It hasn't yet been decided on how to include in a CMIP7 filename the sampling-Interval and data-Region, but I've proposed a template in section 6 (pg. 14) of this document . For CMIP7, "region" is almost always "global" (glb), so it seems less than ideal that this should be indicated in every file, but for easy construction and parsing of file names, I think we need to include it. Note in the referenced document the differences in filenames between CMIP6 and in the CMIP7 proposal. A major change is that "outname" + "table_id" get replaced by the branded-variable name (or possibly by "outname" + "branding suffix"; again no final decision has been made). Note also that CORDEX must also identify "region" in its file names. |
Another option to consider: the grid_label needs rethinking, so it might be possible to specify the region of a single site by a special text string put there (e.g., instead of gn, gr1, gr2, etc., you might have s-sgp (site: Southern Great Plains)) I vaguely recall there was some other grid format being considered for obs4MIPs, so we need to think about this. |
@chengzhuzhang I'm trying to get over some technical setbacks and will get back to you soon regarding next steps for prototyping processing of ARBE data. I think it will be helpful for us to have the processing set up so that we can try out different options and think about them, including perhaps getting Sasha's help to test ESGF publication at some point. |
@taylor13 Thank you. To use grid_label for including site information makes good sense to me... |
@chengzhuzhang Sorry for the delay. Your permission to write is now pending, so soon you should be able to upload a test branch. I'll available to talk about what to do with the branch once its in the repo. ARMBE metadata now in a PR. Soon we'll be able to move onto thinking about how to incorporate insitu data. This will involve the , <source_type>, the possibility of a spatial coordinate, etc. |
Hi @gleckler1 thank you for working on this PR! The initial entry for ARMBE looks great. Yes! I think next we will need to sort out how to incorporate region related specs.. Let me know if there are anything I can do for testing.. P.S. I'm officially a collaborator now on this repo! |
hello Peter @gleckler1 I made some minor updates to the json file that describes ARMBE data, in the Pull Request here: #321 |
Hi Jill,
I spoke with Karl and Paul Durack about this at length yesterday. I have a meeting at 11AM, but do you want to have a quick chat now? If not, I have some openings later today.
P
From: Jill Chengzhu Zhang ***@***.***>
Date: Thursday, February 8, 2024 at 10:30 AM
To: PCMDI/obs4MIPs-cmor-tables ***@***.***>
Cc: Gleckler, Peter John ***@***.***>, Mention ***@***.***>
Subject: Re: [PCMDI/obs4MIPs-cmor-tables] RC (Register Contents) for ARMBE (Issue #304)
hello Peter @gleckler1<https://urldefense.us/v3/__https:/github.com/gleckler1__;!!G2kpM7uM-TzIFchu!wI0aTgUY5E4jN33-13lN4YAJVsnmCrJ3UjoidvpC3wp475TY6Ni2V95wCi8B67-aXIt7pjczJ6X3MPDPqkU2jk-Uu68$> I made some minor updates to the json file that describes ARMBE data, in the Pull Request here: #321<https://urldefense.us/v3/__https:/github.com/PCMDI/obs4MIPs-cmor-tables/pull/321__;!!G2kpM7uM-TzIFchu!wI0aTgUY5E4jN33-13lN4YAJVsnmCrJ3UjoidvpC3wp475TY6Ni2V95wCi8B67-aXIt7pjczJ6X3MPDPqkU2SlMZofc$>
Though there still some parameters that are specific to in-situ data we need to iron out. For instance, where to list the ARM site name, should we include in source_id or other field. At this point, I'm sure I won't be able to test my run script and json file with CMOR. Not sure what is the next step to take here..
—
Reply to this email directly, view it on GitHub<https://urldefense.us/v3/__https:/github.com/PCMDI/obs4MIPs-cmor-tables/issues/304*issuecomment-1934710748__;Iw!!G2kpM7uM-TzIFchu!wI0aTgUY5E4jN33-13lN4YAJVsnmCrJ3UjoidvpC3wp475TY6Ni2V95wCi8B67-aXIt7pjczJ6X3MPDPqkU2qoAGJAE$>, or unsubscribe<https://urldefense.us/v3/__https:/github.com/notifications/unsubscribe-auth/ABCXVLLEQDWAWTZ6PUXEG5DYSUKVRAVCNFSM6AAAAAA6H6AHJSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSMZUG4YTANZUHA__;!!G2kpM7uM-TzIFchu!wI0aTgUY5E4jN33-13lN4YAJVsnmCrJ3UjoidvpC3wp475TY6Ni2V95wCi8B67-aXIt7pjczJ6X3MPDPqkU2J6Uvx4M$>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
@gleckler1 thanks for the meeting! As discussed, I'm providing the initial list of Cfsites information that we also provide data in ARMBE.
Attached also a copy of the cmip6 cfsites locations information. Among the complete list, there are additional sites that ARMBE provide data, but we can identify at a later time. Below is the ARM site acronyms, which were embedded in the ARMBE data stream name, e.g. sgparmbeatmC1.c1.20200101.003000.nc.
|
@chengzhuzhang ok that last bit should help us prototype! Thank you. I did not mention, we'll try using "gn" in the filename... I hope to try that out in the days ahead... See "grid label" in Table1 and filename template near the end of this document: https://pcmdi.github.io/obs4MIPs/docs/ODSv2.5-DRAFT.pdf |
@gleckler1 thanks! Please don't hesitate to let me know if anything I can help to work out this prototype! |
@gleckler1 following up our discussion yesterday, I made some code change (which is based on instruction from @taylor13 at PCMDI/cmor#728). The python script is now working with single lat/lon included in the file (code change in d6c7b95). The metadata is shown as follows:
|
The next step is to work on "region". If we chose to include the lat/lon value, I think region is less important. Yesterday, we also realized, it is also not a search facet currently supported by ESGF MetaGrid, and it seems to just serve as a global attribute. My proposal is to have the site name as part of the source_id of ARM data, the rational is that for these site data, their available time periods, and the update frequencies are site specific. With each site has one data stream/source_id, it also makes easier to maintain the datasets. |
@chengzhuzhang Great progress! We definitely want to consider inclusion of coordinates in source_id as an option but there are lots of moving parts that need to be considered. "region" will be an important option for obs4MIPs moving foward. As a next step, lets consider a few test-case site specific source_ids. Here is a first guess: 'ARMBE-SGP-atm-c1-1-8' Maybe you can improve on these names or do you think these are ok? It's just a test run so we don't have to get it perfect yet. I think its reasonable consider two options, that 1) identifies the location only by the acronym SGP and 2) explicitly includes coordinates. Once we chosen a few test-case source_ids I'll run the script to add them so that CMOR recognizes them. |
@chengzhuzhang btw I've confirmed your option #2 runs. |
@gleckler1 Thank you for testing option 2! |
The following are required registered content (with example content for each item in bold). Please replace the example text below with your information to the right of the equal sign (DO NOT MAKE ANY CHANGES TO THE LEFT HAND SIDE OF THE EQUAL SIGN):
See note 14 and Appendix II of the obs4MIPs data specifications (https://goo.gl/jVZsQl) for more information regarding registered content, and feel free to ask questions!
The text was updated successfully, but these errors were encountered: