Predicate-size uid partitioning #65

EnricoMi · 2020-11-16T18:15:21Z

Given a predicate partitioning, we can further partition each partition orthogonally by uids. Some partitions may contain more rows than others. By splitting large partitions into more parts than smaller ones, we can achieve a more even final partitioning.

The zero service provides predicate size statistics. With these, we can compute the size of each partition. This does not reflect the number of rows, but the size of the predicate values, where string, geo, password and default refer to variable size predicates. This makes estimating the number of rows difficult. Compression might also make estimation more complex. However, we can see this size as a transfer-cost estimate and make the final partitioning even-sized regarding that metric.

The text was updated successfully, but these errors were encountered:

EnricoMi added the enhancement New feature or request label Nov 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Predicate-size uid partitioning #65

Predicate-size uid partitioning #65

EnricoMi commented Nov 16, 2020

Predicate-size uid partitioning #65

Predicate-size uid partitioning #65

Comments

EnricoMi commented Nov 16, 2020