[spark] Support report scan ordering #4026

ulysses-you · 2024-08-21T08:41:06Z

Purpose

This pr makes PaimonScan support SupportsReportOrdering. For primary key table, we will do a sorted run during write so the primary keys in one file are always ordered. And skip report ordering if it contains a rawConvertible split since we will not do merge sort read.

This can help eliminate local sort for smj/sorted agg etc..

Tests

add tests

API and Format

no

Documentation

JingsongLi · 2024-08-21T10:15:11Z

Thanks @ulysses-you for the contribution.

This optimization looks good to me!

But there is some potential issues.
Perhaps we need to add an interface to the ReadBuilder, because even for primary key tables, the reads may not be ordered. For example, if the deletion vectors mode or read-optimized mode is enabled.

ulysses-you · 2024-08-22T08:22:20Z

paimon-core/src/main/java/org/apache/paimon/table/source/DataSplit.java

+    /**
+     * To indicate if this `DataSplit` keep the raw table ordering. For example, for the primary key
+     * table, we will do sorted run during write and do merge read during read, so the data is
+     * sorted by the primary keys. Return `false` means the ordering is broken. If a `DataSplit` is
+     * `rawConvertible` then there is no sort merge read, so only if the data file number less than
+     * 2, we can get the correct data ordering.
+     */
+    public boolean keepOrdering() {
+        return !rawConvertible || dataFiles.size() < 2;
+    }


cc @JingsongLi , If I get it correctly, this should address your concern ?

JingsongLi · 2024-08-26T02:25:04Z

paimon-spark/paimon-spark-common/src/main/scala/org/apache/paimon/spark/PaimonScan.scala

+      return Array.empty
+    }
+
+    val allSplitsKeepOrdering = lazyInputPartitions.toSeq


Maybe we should make sure there is only one split in a bucket?

thank you @JingsongLi, please correct me if wrong. Per my understanding, if a partition contains multi-splits using merge file read, then all splits should be ordered and never be overlapped ?

Merging only occurs within a single split.

JingsongLi · 2024-08-28T03:20:09Z

paimon-spark/paimon-spark-3.3/src/main/scala/org/apache/paimon/spark/PaimonScan.scala

+
+import scala.collection.JavaConverters._
+
+case class PaimonScan(


Is there possibility to extract a base class PaimonBucketedScan to reuse codes?

JingsongLi · 2024-08-28T03:20:54Z

paimon-core/src/main/java/org/apache/paimon/table/source/DataSplit.java

+     * 2, we can get the correct data ordering.
+     */
+    public boolean keepOrdering() {
+        return !rawConvertible || dataFiles.size() < 2;


Maybe we don't need to introduce method here, just use !rawConvertible || dataFiles.size() < 2 in spark.

ulysses-you · 2024-08-28T05:38:38Z

thank you @JingsongLi , addressed comments

JingsongLi

Looks good to me!

ulysses-you force-pushed the ordering branch from 75f5166 to ccb9605 Compare August 22, 2024 06:11

ulysses-you closed this Aug 22, 2024

ulysses-you reopened this Aug 22, 2024

ulysses-you commented Aug 22, 2024

View reviewed changes

ulysses-you force-pushed the ordering branch from b2a9da4 to e5247d0 Compare August 26, 2024 00:59

JingsongLi reviewed Aug 26, 2024

View reviewed changes

JingsongLi reviewed Aug 28, 2024

View reviewed changes

Support report scan ordering

08ba913

ulysses-you force-pushed the ordering branch from 81b6572 to 08ba913 Compare August 28, 2024 05:37

JingsongLi approved these changes Aug 28, 2024

View reviewed changes

JingsongLi merged commit 3efd2f3 into apache:master Aug 28, 2024
10 checks passed

ulysses-you deleted the ordering branch August 28, 2024 08:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spark] Support report scan ordering #4026

[spark] Support report scan ordering #4026

ulysses-you commented Aug 21, 2024 •

edited

Loading

JingsongLi commented Aug 21, 2024

ulysses-you Aug 22, 2024

JingsongLi Aug 26, 2024

ulysses-you Aug 26, 2024

JingsongLi Aug 26, 2024

JingsongLi Aug 28, 2024

JingsongLi Aug 28, 2024

ulysses-you commented Aug 28, 2024

JingsongLi left a comment


		import scala.collection.JavaConverters._

		case class PaimonScan(

[spark] Support report scan ordering #4026

[spark] Support report scan ordering #4026

Conversation

ulysses-you commented Aug 21, 2024 • edited Loading

Purpose

Tests

API and Format

Documentation

JingsongLi commented Aug 21, 2024

ulysses-you Aug 22, 2024

Choose a reason for hiding this comment

JingsongLi Aug 26, 2024

Choose a reason for hiding this comment

ulysses-you Aug 26, 2024

Choose a reason for hiding this comment

JingsongLi Aug 26, 2024

Choose a reason for hiding this comment

JingsongLi Aug 28, 2024

Choose a reason for hiding this comment

JingsongLi Aug 28, 2024

Choose a reason for hiding this comment

ulysses-you commented Aug 28, 2024

JingsongLi left a comment

Choose a reason for hiding this comment

ulysses-you commented Aug 21, 2024 •

edited

Loading