Enh/manage groups #66

bzah · 2024-01-16T13:10:40Z

Include documentation when adding new features.
Include new tests or update existing tests when applicable.

This PR fixes #64

This adds an optional group argument to open_ncml
When present, only the given group of the ncml will be read.
When absent, the root group '/' is read.
The above is similar to xarray's open_dataset.

In addition, using group='*', it flattens every group into a single datasets (somewhat preserving the current behavior).
The names conflicts between groups are solved by appending an incrementing __n to the variable names, where n is a number.
Plus, an attribute group_path is added to variables in order to retrieve their original path once they have been flatten.
This can be useful to recreate the original structure.

When open_ncml is called with group="*", every group will be read and they will be flatten in the resulting dataset. If names are conflicting, the dimensions and varaibles names are appended with __n where n is the number of existing simlar names.

bzah · 2024-01-16T15:23:02Z

Converting PR to draft as I discovered flatten is not working properly at the moment.

bzah · 2024-01-16T16:54:36Z

xncml/parser.py

@@ -579,7 +675,8 @@ def build_scalar_variable(var_name: str, values_tag: Values, var_type: str) -> x
            ' <values> is empty. Provide a single values within <values></values>'
            ' to preserve the type.'
        )
-        return xr.Variable(data=None, dims=())
+        default_value = nctype(var_type)()


I changed (again) the behavior of empty scalar parsing here.
For context, here the ncml describe a scalar with a certain type but without providing any value. We would ideally like to create a placeholder for a scalar variable with a numpy type.
But numpy doesn't allow this for scalar, only with an ndarray we can create a typed empty array of a certain shape.

I first though it would be better to loose the type and create a empty scalar with the value None but not having this type can mess with subsequent processing.
Now I think it's better to preserve the dtype and fill the scalar with the default value of this dtype.

I would appreciate comments/suggestions here.

- improved performances by parsing ncml reprsentation only once - fixed issues of rewritting content when read multiple times

xncml/parser.py

CHANGELOG.md

xncml/parser.py

Co-authored-by: David Huard <[email protected]>

Abel Aoun added 5 commits January 11, 2024 20:01

FIX: pre-commit hook for black

d5df915

ENH: Add group filtering to open_ncml

be6acbc

Add test for error

87aa891

DOC: document changes

3243a01

bzah marked this pull request as draft January 16, 2024 15:22

Abel Aoun added 2 commits January 16, 2024 17:15

FIX: scalar parsing and nested groups

5daba4a

remove dead code

1959e92

bzah commented Jan 16, 2024

View reviewed changes

Abel Aoun added 3 commits February 2, 2024 10:23

Improve generator

c32d437

ENH: Improve group parsing

ff807ef

- improved performances by parsing ncml reprsentation only once - fixed issues of rewritting content when read multiple times

Merge remote-tracking branch 'origin/main' into enh/manage_groups

00e9ed1

bzah marked this pull request as ready for review February 5, 2024 12:37

Clean

18a7c8b

bzah requested a review from huard February 5, 2024 12:45

huard approved these changes Feb 5, 2024

View reviewed changes

bzah and others added 2 commits February 5, 2024 18:10

Apply suggestions from code review

ae842fe

Co-authored-by: David Huard <[email protected]>

ENH: Improve parser following code review

dac5870

bzah merged commit dc7ab52 into main Feb 6, 2024
6 checks passed

bzah deleted the enh/manage_groups branch February 6, 2024 08:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enh/manage groups #66

Enh/manage groups #66

bzah commented Jan 16, 2024 •

edited

Loading

bzah commented Jan 16, 2024

bzah Jan 16, 2024

Enh/manage groups #66

Enh/manage groups #66

Conversation

bzah commented Jan 16, 2024 • edited Loading

bzah commented Jan 16, 2024

bzah Jan 16, 2024

Choose a reason for hiding this comment

bzah commented Jan 16, 2024 •

edited

Loading