Support for Apache Arrow #51

MedAnd · 2019-02-09T21:21:34Z

Support for Apache Arrow which is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication.

Apache Arrow is backed by key developers of 13 major open source projects, including Calcite, Cassandra, Drill, Hadoop, HBase, Ibis, Impala, Kudu, Pandas, Parquet, Phoenix, Spark, and Storm making it the de-facto standard for columnar in-memory analytics.

Related issues:

apscomp · 2019-08-26T18:30:08Z

I would like to second this... as apache arrow is slowly becoming mainstream.

AlgorithmsAreCool · 2020-01-06T04:37:43Z

So Arrow is definitely gaining steam, but how would an arrow integration look for Trill?

et say they convert the internal columnar format to Arrow. Trill is still going to be mutating those structures constantly since it is an incremental platform, how would an integration work with those internal structures safely and what would it do with them?

AlgorithmsAreCool · 2020-08-07T23:31:47Z

A few months on, I can answer my own questions here.

The most obvious point is to open the door to very high performance interop with other applications or runtimes. Someone could write a database plugin that uses Trill operations to compute data.

Furthermore, bulk Arrow structures could be stored directly on disk and accessed via memory mapping. Or systems can make use of standardized readers to import CSVs or Parquet files into arrow structures.

Overall, using a standardized memory layout allows Trill to be integrated more easily and more efficiently with a wider array of projects. It is a very enticing benefit.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Apache Arrow #51

Support for Apache Arrow #51

MedAnd commented Feb 9, 2019 •

edited

Loading

apscomp commented Aug 26, 2019

AlgorithmsAreCool commented Jan 6, 2020

AlgorithmsAreCool commented Aug 7, 2020

Support for Apache Arrow #51

Support for Apache Arrow #51

Comments

MedAnd commented Feb 9, 2019 • edited Loading

apscomp commented Aug 26, 2019

AlgorithmsAreCool commented Jan 6, 2020

AlgorithmsAreCool commented Aug 7, 2020

MedAnd commented Feb 9, 2019 •

edited

Loading