You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I don't know if the spec here has a place for this, but I wonder if there are any opinions on the storage encoding for geo data.
I would expect the following to perform the best, but I have not done testing. It would be interesting if this repo could make some interesting measures of what decisions make the biggest difference.
FLOAT64 data is probably needed to preserve accuracy on lat/lon data
BYTE_STREAM_SPLIT will make this much more compressible, especially for the case that the data in a page has similar values
v2 pages, to separate the poorly compressible but compact repetition values from the compressed data
Zstd is probably the best compressor in terms of space/CPU tradeoff
For the case that somewhat lossy compression is allowable:
FLOAT32 (of course)
scale/offset (to be interpreted in the application layer)
Correct. I don't think there will be too many options for WKT/WKB columns, and any compressor should pick up interior repetitions in that case. I suppose DELTA_LEN encoding could be tried, but my hunch is, that it won't matter.
I don't know if the spec here has a place for this, but I wonder if there are any opinions on the storage encoding for geo data.
I would expect the following to perform the best, but I have not done testing. It would be interesting if this repo could make some interesting measures of what decisions make the biggest difference.
For the case that somewhat lossy compression is allowable:
( @kylebarron suggested this might be a good discussion to have)
The text was updated successfully, but these errors were encountered: