Skip to content

Commit

Permalink
Update changelog for 3.9.0
Browse files Browse the repository at this point in the history
  • Loading branch information
fjetter authored Jun 3, 2020
1 parent b73e0d8 commit e62b943
Showing 1 changed file with 14 additions and 3 deletions.
17 changes: 14 additions & 3 deletions CHANGES.rst
Original file line number Diff line number Diff line change
Expand Up @@ -2,12 +2,23 @@
Changelog
=========

Version 3.9.0 (UNRELEASED)
Version 3.9.0 (2020-06-03)
==========================

* Significant performance improvements for shuffle operations in :func:`~kartothek.io.dask.dataframe.update_dataset_from_ddf`
Improvements
^^^^^^^^^^^^
* Arrow 0.17.X support
* Significant performance improvements for shuffle operations in
:func:`~kartothek.io.dask.dataframe.update_dataset_from_ddf`
for large dask.DataFrames with many payload columns by using in-memory
compression during the shuffle operation.
* Allow calling :func:`~kartothek.io.dask.dataframe.update_dataset_from_ddf`
without `partition_on` when `shuffle=True`
without `partition_on` when `shuffle=True`.
* :func:`~kartothek.io.dask.dataframe.read_dataset_as_ddf` supports kwarg ``dispatch_by``
to control the internal partitioning structure when creating a dataframe.
* :func:`~kartothek.io.dask.dataframe.read_dataset_as_ddf` and :func:`~kartothek.io.dask.dataframe.update_dataset_from_ddf`
now allow the keyword ``table`` to be optional, using the default SINGLE_TABLE identifier.
(recommended since the multi table dataset support is in sunset).


Version 3.8.2 (2020-04-09)
Expand Down

0 comments on commit e62b943

Please sign in to comment.