-
Notifications
You must be signed in to change notification settings - Fork 145
Python cmorizers for CDR1 and CDR2 ESACCI H2O (TCWV=prw) data. #2152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Documentation update is not ready, yet. I'm not sure how to include it into the documentation, because the data is not public now but should be come public at some point. |
|
@remi-kazeroni I don't really know why github added you here automatically, either because it is a cmorizer or I left your name in somewhere in the code :) |
This is because I'm codeowner for everything in |
axel-lauer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just tested the new CMORizers and have a few comments:
- Both CMORizers complain about "The reference file .../esmvaltool/references/.bibtex does not exist.". I guess this is because "reference" in the config files is set to ""?
- I would strongly prefer that the dataset name used is "ESACCI-WATERVAPOUR" or maybe "ESACCI-WV". I find "ESACCI-TCWV" and "ESACCI-TCWV-CDR1" confusing and not consistent with the naming used for the other ESACCI datasets.
- For the same reasons, I would prefer a single CMORizer for ESACCI-WATERVAPOUR instead of separate scripts for each of the 2 (eventually 4) CDRs.
- I only tested with monthly data. For those, the output time_bounds seem to be wrong, e.g. giving 15-Jan-2014 00:00:00 --> 16-Jan-2014 00:00:00 instead of 1-Jan-2014 00:00:00 --> 1-Feb-2014 00:00:00. This needs to be fixed.
- The documentation on how to download and prepare the data (e.g. putting all files into the same folder, i.e. no yearly folders) is still missing.
|
Time bounds and naming should be fixed now. I also added some documentation on how to download and prepare the data. I'm not sure about the reference, I put in the Water Vapour Climate Change Initiative (WV_cci) - CCI+ Phase 1 Climate Research Data Package (CRDP) document (https://climate.esa.int/en/projects/water-vapour/key-documents/), but it has no DOI, only an ESA specific identification number? |
|
Further changes would be necessary for daily data. Should I do them first or should we merge the version for monthly data first? Public data are not there, yet and expected for August or September, but maybe there are other reasons to merge it soon? |
axel-lauer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for applying all the changes. The output fields look good but the time bounds still seem to be a bit odd. For example, the time bounds of the first time step in the year 2003 file looks like:
30-Dec-2002 12:00:00 -> 30-Jan-2003 12:00:00
but should probably rather be something like
1-Jan-2003 00:00:00 -> 31-Jan-2003 00:00:00
Could you please take another look at the time bounds?
@axel-lauer Thanks for spotting this: Therefore I'm wondering:
|
|
I did now fix the bound for this cmorizer, using _get_time_bounds from esmvalcore.cmor.check _get_time_bounds (without changing esmvaltool/cmorizers/obs/utilities.py, which is proabably used by other cmorizers). There are many python cmorizers using fix_bounds, should I open an issue about it? I don't know what the commit check wants to tell me? |
axel-lauer
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The output including the time bounds looks good now. Not sure about the import error reported by CircleCI. @ESMValGroup/tech-reviewers any opinion on this?
maybe merge |
valeriupredoi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
looks good technically, cheers @katjaweigel -> @remi-kazeroni pls test when you gots time, mate 🍺
| raw_info = {'name': vals['raw'], 'file': vals['file']} | ||
| inpfile = os.path.join(in_dir, cfg['filename']) | ||
| logger.info("CMORizing var %s from file type %s", var, inpfile) | ||
| # years = range(vals['start_year'], vals['end_year'] + 1) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
okease remove commented out code if not needed anymore
|
also please make sure all boxes are ticked and the ones that are irrelevant pls remove or strike them |
I wasn't sure if I have to tick them, if I think they are ok, or if the reviewers have to do that after there check? |
afraid it is your responsability to tick/delete boxes, but if you are unsure you can always ask the technical reviewer 👍 |
|
@valeriupredoi ok, thanks! I did that for PRs before the "Technical review" and "Scientific review" symbols were added, they seemed to indicate for me, that it should be done by the reviewers. |
remi-kazeroni
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @katjaweigel! The cmorizer runs fine for me with the monthly data available and the recipe_check_obs.yml as well. Please let me know when the data can be put into the Mistral and Jasmin Obs directories.
|
@remi-kazeroni the public release of the data is planned for September, therefore I guess it is better not to put them into the Obs folders, yet? |
|
For this reason, we put the data into tier 3 until they become publicly available. Then we would move the data to tier 2 just like all other ESACCI datasets. So in short, I think it is fine to put the data into the OBS folder. |
Fine, so the data are now in Tier3 OBS folder. We will need to update the cmorizer once the data become publicly available. |
Description
Cmorizer for the new ESACCI H2O (TCWV=prw) CDR-1 and CDR-2 data sets.
It has been tested for 0.5 and 0.05deg data.
So far, it only works without ancillary data, they will be added later.
Before you get started
Checklist
It is the responsibility of the author to make sure the pull request is ready to review. The icons indicate whether the item will be subject to the 🛠 Technical or 🧪 Scientific review.
Any changed dependencies have been added or removed correctlyNew or updated recipe/diagnosticRecipe runs successfullyRecipe is well documentedFigure(s) and data look as expected from literatureProvenance information has been addedNew or updated data reformatting script
To help with the number of pull requests: