Skip to content

Commit

Permalink
Allow setting write.parquet.page-row-limit (#1017)
Browse files Browse the repository at this point in the history
* Allow setting `write.parquet.page-row-limit`

It is being passed down to PyArrow here:

https://github.com/apache/iceberg-python/blob/8aeab4951080fa196c0d29c72cba1cbba824ffc4/pyiceberg/io/pyarrow.py#L2374-L2377

* Update test
  • Loading branch information
Fokko authored and sungwy committed Aug 9, 2024
1 parent 09997c4 commit 8a8e813
Show file tree
Hide file tree
Showing 2 changed files with 0 additions and 2 deletions.
1 change: 0 additions & 1 deletion pyiceberg/io/pyarrow.py
Original file line number Diff line number Diff line change
Expand Up @@ -2285,7 +2285,6 @@ def _get_parquet_writer_kwargs(table_properties: Properties) -> Dict[str, Any]:

for key_pattern in [
TableProperties.PARQUET_ROW_GROUP_SIZE_BYTES,
TableProperties.PARQUET_PAGE_ROW_LIMIT,
TableProperties.PARQUET_BLOOM_FILTER_MAX_BYTES,
f"{TableProperties.PARQUET_BLOOM_FILTER_COLUMN_ENABLED_PREFIX}.*",
]:
Expand Down
1 change: 0 additions & 1 deletion tests/integration/test_writes/test_writes.py
Original file line number Diff line number Diff line change
Expand Up @@ -528,7 +528,6 @@ def test_write_parquet_other_properties(
"properties",
[
{"write.parquet.row-group-size-bytes": "42"},
{"write.parquet.page-row-limit": "42"},
{"write.parquet.bloom-filter-enabled.column.bool": "42"},
{"write.parquet.bloom-filter-max-bytes": "42"},
],
Expand Down

0 comments on commit 8a8e813

Please sign in to comment.