feat: Support Bucket and Truncate transforms on write #1345

sungwy · 2024-11-20T02:36:30Z

Getting the PR ready for when pyiceberg_core is released from iceberg-rust

PR to introduce python binding release: apache/iceberg-rust#705

Consideration: we could replace the existing pyarrow dependency on order_preserving transforms (Month,Year,Date) with pyiceberg_core for consistency

kevinjqliu

LGTM! Great to have writes for all the different transformations!

kevinjqliu · 2024-12-24T23:07:48Z

tests/integration/test_writes/test_partitioned_writes.py

+@pytest.mark.parametrize(
+    "spec, expected_rows",
+    [
+        # none of non-identity is supported


Suggested change

# none of non-identity is supported

kevinjqliu · 2024-12-24T23:18:55Z

tests/test_transforms.py

+    source_type: PrimitiveType,
+    input_arr: Union[pa.Array, pa.ChunkedArray],
+    expected: Union[pa.Array, pa.ChunkedArray],
+    num_buckets: int,


nit: wydt of reordering these for readability? num_buckets, source_type and input_arr are configs of the BucketTransform; expected is the output

Hmm I think I feel indifferent here - there’s something nice about having the input and expected arrays side by side

introduce bucket transform

dd888ec

kevinjqliu self-requested a review December 19, 2024 17:15

include pyiceberg-core

bd80f39

sungwy force-pushed the bucket-transforms branch from 560ba20 to bd80f39 Compare December 24, 2024 18:27

sungwy added 4 commits December 24, 2024 18:30

introduce bucket transform

27ade9a

include pyiceberg-core

fcd654c

resolve poetry conflict

a0a9c58

Merge branch 'bucket-transform' into bucket-transforms

a4137e0

sungwy marked this pull request as ready for review December 24, 2024 18:35

support truncate transforms

05c440f

sungwy changed the title ~~Introduce bucket transform~~ feat: Support bucket and Truncate transforms on write Dec 24, 2024

sungwy changed the title ~~feat: Support bucket and Truncate transforms on write~~ feat: Support Bucket and Truncate transforms on write Dec 24, 2024

sungwy requested a review from Fokko December 24, 2024 20:45

kevinjqliu approved these changes Dec 24, 2024

View reviewed changes

Remove stale comment

7079265

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Support Bucket and Truncate transforms on write #1345

feat: Support Bucket and Truncate transforms on write #1345

sungwy commented Nov 20, 2024 •

edited

Loading

kevinjqliu left a comment

kevinjqliu Dec 24, 2024

kevinjqliu Dec 24, 2024

sungwy Dec 25, 2024

feat: Support Bucket and Truncate transforms on write #1345

Are you sure you want to change the base?

feat: Support Bucket and Truncate transforms on write #1345

Conversation

sungwy commented Nov 20, 2024 • edited Loading

kevinjqliu left a comment

Choose a reason for hiding this comment

kevinjqliu Dec 24, 2024

Choose a reason for hiding this comment

kevinjqliu Dec 24, 2024

Choose a reason for hiding this comment

sungwy Dec 25, 2024

Choose a reason for hiding this comment

sungwy commented Nov 20, 2024 •

edited

Loading