Questions regarding zero-copy #187

rschu1ze · 2024-01-15T23:44:54Z

Hello,

I browsed the source code of chDB today (having read DuckDBs papers first) and had two questions:

As of today, chDB takes over the final query result (which is presumably refcounted) for further processing (see LocalServer.cpp).

Is it planned (or even possible) to have a more fine-granular mechanism for handing over results based on the data chunks used internally for query processing? In DuckDB, the application can fetch individual result chunks by triggering pull() on the execution plan.
Likewise, I did not find a zero-copy mechanism for source data, meaning that right now the host process must first write the data to process to a local file and then let the embedded ClickHouse load it via the file engine. Did I miss something?

Thanks!

The text was updated successfully, but these errors were encountered:

auxten · 2024-01-16T12:58:11Z

Thanks for your great questions!

The current implementation can only read all the returned data once. However, it does not yet support a smooth scanning of results using a cursor. I will attempt to address this in the next release.
Another insightful point is that zero-copy only works in Python when handling returned data with memoryview.

When querying Parquet, CSV, or any other files using something like SELECT * FROM file('path', Parquet), there is no unnecessary data copy.
However, when querying "memory" objects such as DataFrame, ArrowTable, or data returned by chDB, the current implementation is just "okay to work." It writes the data into a temporary file or memfd and then queries it as a file. This approach is also unsightly and I will attempt to improve it.

Regarding the last question, I would like to share some thoughts as well. The main idea is to use Arrow as the in-memory data type. Currently, I am researching two ways to achieve this:

Develop new file-related APIs that enable ClickHouse engine to read memory like a file.
Create a new storage type that allows ClickHouse to read Arrow buffer from memory.

I haven't decide which way is better. Welcome to discuss with us.

rschu1ze · 2024-01-17T15:22:01Z

Thanks, that is very helpful. Great to hear that the points are known and being improved.

I don't have much insights in the internals of chDB. About importing data efficiently: ClickHouse supports lots of input/output format, e.g. cat filename.orc | clickhouse-client --query="INSERT INTO some_table FORMAT ORC". Perhaps it is an option to add another special memory-view-like I/O format which is passed a pointer + size and which internally assumes the data is encoded as Arrow format.

SChakravorti21 · 2024-03-19T22:11:21Z

Just throwing in my $0.02 in case it's of any use. Regarding this part of the discussion:

Currently, I am researching two ways to achieve this:

Develop new file-related APIs that enable ClickHouse engine to read memory like a file.

Create a new storage type that allows ClickHouse to read Arrow buffer from memory.

Perhaps one option is to use Arrow's C data interface or C stream interface. These allow Arrow buffers to be shared across language boundaries in a zero-copy manner within a single process. If I understand correctly, this is how some engines like Polars and DuckDB already handle querying in-memory Arrow tables today.

I don't know anything about the internals of ClickHouse, but maybe this approach could make it easier/cleaner to implement a custom "storage type" as you mention. And vice versa, the C stream interface might help with exposing results as a stream of Arrow record batches.

auxten · 2024-03-20T00:30:39Z

Just throwing in my $0.02 in case it's of any use. Regarding this part of the discussion:

Currently, I am researching two ways to achieve this:

Develop new file-related APIs that enable ClickHouse engine to read memory like a file.

Create a new storage type that allows ClickHouse to read Arrow buffer from memory.

Perhaps one option is to use Arrow's C data interface or C stream interface. These allow Arrow buffers to be shared across language boundaries in a zero-copy manner within a single process. If I understand correctly, this is how some engines like Polars and DuckDB already handle querying in-memory Arrow tables today.

I don't know anything about the internals of ClickHouse, but maybe this approach could make it easier/cleaner to implement a custom "storage type" as you mention. And vice versa, the C stream interface might help with exposing results as a stream of Arrow record batches.

Thanks for your great advice. I'm researching on it.

rschu1ze added the question Further information is requested label Jan 15, 2024

rschu1ze closed this as completed Jan 17, 2024

auxten reopened this Feb 6, 2024

auxten mentioned this issue Feb 6, 2024

CHDB is significantly slower on Arrow tables (in-memory) than with CSV / Parquet #195

Closed

auxten added the Arrow Apache Arrow support label Mar 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions regarding zero-copy #187

Questions regarding zero-copy #187

rschu1ze commented Jan 15, 2024

auxten commented Jan 16, 2024 •

edited

Loading

rschu1ze commented Jan 17, 2024 •

edited

Loading

SChakravorti21 commented Mar 19, 2024 •

edited

Loading

auxten commented Mar 20, 2024

Questions regarding zero-copy #187

Questions regarding zero-copy #187

Comments

rschu1ze commented Jan 15, 2024

auxten commented Jan 16, 2024 • edited Loading

rschu1ze commented Jan 17, 2024 • edited Loading

SChakravorti21 commented Mar 19, 2024 • edited Loading

auxten commented Mar 20, 2024

auxten commented Jan 16, 2024 •

edited

Loading

rschu1ze commented Jan 17, 2024 •

edited

Loading

SChakravorti21 commented Mar 19, 2024 •

edited

Loading