Best-practice for zarr format data collections on linux filesystems #2

Thomas-Moore-Creative · 2024-06-29T02:42:09Z

RE: the conversation about how to best use zarr on Gadi I have my own issue here that I've not yet had the chance to make progress on. write code to convert zarr collections to zarr-zipstore

While I don't yet have any direct experience with using zarr-ZipStore my understanding is it doesn't appear to effect performance much ( this needs to be tested ) and it solves the inode problem that cloud optimised formats like zarr have on linux filesystems. Normally zarr looks like "one file per chunk" to a linux FS which is typically a much larger inode footprint than netcdf.

See: https://zarr.readthedocs.io/en/stable/api/storage.html#zarr.storage.ZipStore

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Best-practice for zarr format data collections on linux filesystems #2

Best-practice for zarr format data collections on linux filesystems #2

Thomas-Moore-Creative commented Jun 29, 2024

Best-practice for zarr format data collections on linux filesystems #2

Best-practice for zarr format data collections on linux filesystems #2

Comments

Thomas-Moore-Creative commented Jun 29, 2024