[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node #44038

morningman · 2024-11-15T09:29:53Z

What problem does this PR solve?

Opt the parallelism when doing count push down optimization

Count push down optimization is used to optimize queries such as select count(*) from table.
In this scenario, we can directly obtain the number of rows through the row count statistics
of the external table, or the metadata of the Parquet/ORC file,
without reading the actual file content, thereby speeding up such queries.

Currently, we support count push down optimization for Hive, Iceberg, and Paimon tables.
There are two ways to obtain the number of rows:
1. Obtain directly from statistics
  
  For Iceberg tables, we can obtain the number of rows directly from statistics.
  However, due to the historical issues of Iceberg, if there is position/equality delete in the table,
  this method cannot be used to prevent incorrect row count.
  In this case, it will degenerate to obtaining from the metadata of the file.
2. Obtain from the metadata of the file
  
  For Hive, Paimon, and some of Iceberg tables, the number of rows can be obtained directly
  from the metadata of the Parquet/ORC file.
  For Text format tables, efficiency can also be improved by only performing row separation, without column separation.
In the task splitting logic, for Count push-down optimization, the number of split tasks should comprehensively
consider the file format, number of files, parallelism, number of BE nodes, and the Local Shuffle:
1. Count push-down optimization should avoid Local Shuffle, so the number of split tasks should be greater than or equal to parallelism * number of BE nodes.
Fix the incorrect logic of Count push-down optimization

In the previous code, for Iceberg and Paimon tables, Count push-down optimization did not take effect because we did not push
CountPushDown information to FileFormatReader inside TableForamtReader. This PR fixes this problem.
Store SessionVaraible variables in FileQueryScanNode.

SessionVaraible is a variable in ConnectionContext. And ConnectionContext is a ThreadLocal variable.
In FileQueryScanNode, SessionVaraible may be accessed in other threads in some cases,
so ThreadLocal variables may not be obtained.
Therefore, the SessionVaraible reference is stored in FileQueryScanNode to prevent illegal access.
Independent FileSplitter class.

The FileSplitter class is a tool class that allows users to split Split according to different strategies.
This PR does not modify the splitting strategy, but only extracts this part of the logic separately,
to be able to perform logic optimization later.

Release note

None

Check List (For Author)

Test
- Regression test
- Unit Test
- Manual test (add detailed scripts or steps below)
- No need to test or manual test. Explain why:
  - This is a refactor/code format and no logic has been changed.
  - Previous test can cover this change.
  - No code files have been changed.
  - Other reason
Behavior changed:
- No.
- Yes.
Does this need documentation?
- No.
- Yes.

Check List (For Reviewer who merge this PR)

Confirm the release note
Confirm test cases
Confirm document
Add branch pick label

doris-robot · 2024-11-15T09:29:59Z

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

What problem was fixed (it's best to include specific error reporting information). How it was fixed.
Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
What features were added. Why was this function added?
Which code was refactored and why was this part of the code refactored?
Which functions were optimized and what is the difference before and after the optimization?

github-actions · 2024-11-15T14:16:15Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-11-18T08:08:19Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-11-20T08:18:02Z

clang-tidy review says "All clean, LGTM! 👍"

morningman · 2024-12-07T06:58:18Z

run buildall

doris-robot · 2024-12-07T07:35:51Z

TeamCity be ut coverage result:
Function Coverage: 38.48% (10006/26002)
Line Coverage: 29.51% (83906/284377)
Region Coverage: 28.60% (43113/150735)
Branch Coverage: 25.20% (21914/86974)
Coverage Report: http://coverage.selectdb-in.cc/coverage/b955b479e1cfe99d0e43b45f618aba83f82af0ca_b955b479e1cfe99d0e43b45f618aba83f82af0ca/report/index.html

morningman · 2024-12-08T21:24:51Z

run buildall

doris-robot · 2024-12-08T22:02:30Z

TeamCity be ut coverage result:
Function Coverage: 38.49% (10007/26002)
Line Coverage: 29.51% (83914/284381)
Region Coverage: 28.61% (43128/150744)
Branch Coverage: 25.20% (21921/86976)
Coverage Report: http://coverage.selectdb-in.cc/coverage/0d6cc74a96542717523af72bb7fec831e48b5274_0d6cc74a96542717523af72bb7fec831e48b5274/report/index.html

morningman · 2024-12-09T04:37:42Z

run buildall

doris-robot · 2024-12-09T05:50:47Z

TeamCity be ut coverage result:
Function Coverage: 38.68% (10061/26013)
Line Coverage: 29.59% (84211/284574)
Region Coverage: 28.68% (43268/150843)
Branch Coverage: 25.24% (21973/87054)
Coverage Report: http://coverage.selectdb-in.cc/coverage/f4a3f49968151b46094b27c2d0045aae284e7ae4_f4a3f49968151b46094b27c2d0045aae284e7ae4/report/index.html

morningman · 2024-12-09T07:49:23Z

run buildall

github-actions · 2024-12-09T07:53:57Z

clang-tidy review says "All clean, LGTM! 👍"

github-actions · 2024-12-09T12:19:53Z

PR approved by anyone and no changes requested.

3 fix fix count fix number backends wait file filter

morningman · 2024-12-09T19:45:51Z

run buildall

github-actions · 2024-12-09T19:50:26Z

clang-tidy review says "All clean, LGTM! 👍"

doris-robot · 2024-12-09T20:24:28Z

TeamCity be ut coverage result:
Function Coverage: 38.80% (10103/26037)
Line Coverage: 29.70% (84716/285199)
Region Coverage: 28.77% (43480/151147)
Branch Coverage: 25.32% (22088/87228)
Coverage Report: http://coverage.selectdb-in.cc/coverage/4d3920f7e0ba13b006060d612a352fc1fbdde81d_4d3920f7e0ba13b006060d612a352fc1fbdde81d/report/index.html

kaka11chen

LGTM

github-actions · 2024-12-10T03:45:58Z

PR approved by at least one committer and no changes requested.

…an node (apache#44038) ### What problem does this PR solve? 1. Opt the parallelism when doing count push down optimization Count push down optimization is used to optimize queries such as `select count(*) from table`. In this scenario, we can directly obtain the number of rows through the row count statistics of the external table, or the metadata of the Parquet/ORC file, without reading the actual file content, thereby speeding up such queries. Currently, we support count push down optimization for Hive, Iceberg, and Paimon tables. There are two ways to obtain the number of rows: 1. Obtain directly from statistics For Iceberg tables, we can obtain the number of rows directly from statistics. However, due to the historical issues of Iceberg, if there is position/equality delete in the table, this method cannot be used to prevent incorrect row count. In this case, it will degenerate to obtaining from the metadata of the file. 2. Obtain from the metadata of the file For Hive, Paimon, and some of Iceberg tables, the number of rows can be obtained directly from the metadata of the Parquet/ORC file. For Text format tables, efficiency can also be improved by only performing row separation, without column separation. In the task splitting logic, for Count push-down optimization, the number of split tasks should comprehensively consider the file format, number of files, parallelism, number of BE nodes, and the Local Shuffle: 1. Count push-down optimization should avoid Local Shuffle, so the number of split tasks should be greater than or equal to `parallelism * number of BE nodes`. 2. Fix the incorrect logic of Count push-down optimization In the previous code, for Iceberg and Paimon tables, Count push-down optimization did not take effect because we did not push CountPushDown information to FileFormatReader inside TableForamtReader. This PR fixes this problem. 3. Store SessionVaraible variables in FileQueryScanNode. SessionVaraible is a variable in ConnectionContext. And ConnectionContext is a ThreadLocal variable. In FileQueryScanNode, SessionVaraible may be accessed in other threads in some cases, so ThreadLocal variables may not be obtained. Therefore, the SessionVaraible reference is stored in FileQueryScanNode to prevent illegal access. 4. Independent FileSplitter class. The FileSplitter class is a tool class that allows users to split `Split` according to different strategies. This PR does not modify the splitting strategy, but only extracts this part of the logic separately, to be able to perform logic optimization later.

…an node (#44038) (#45224) bp #44038

…an node (apache#44038) 1. Opt the parallelism when doing count push down optimization Count push down optimization is used to optimize queries such as `select count(*) from table`. In this scenario, we can directly obtain the number of rows through the row count statistics of the external table, or the metadata of the Parquet/ORC file, without reading the actual file content, thereby speeding up such queries. Currently, we support count push down optimization for Hive, Iceberg, and Paimon tables. There are two ways to obtain the number of rows: 1. Obtain directly from statistics For Iceberg tables, we can obtain the number of rows directly from statistics. However, due to the historical issues of Iceberg, if there is position/equality delete in the table, this method cannot be used to prevent incorrect row count. In this case, it will degenerate to obtaining from the metadata of the file. 2. Obtain from the metadata of the file For Hive, Paimon, and some of Iceberg tables, the number of rows can be obtained directly from the metadata of the Parquet/ORC file. For Text format tables, efficiency can also be improved by only performing row separation, without column separation. In the task splitting logic, for Count push-down optimization, the number of split tasks should comprehensively consider the file format, number of files, parallelism, number of BE nodes, and the Local Shuffle: 1. Count push-down optimization should avoid Local Shuffle, so the number of split tasks should be greater than or equal to `parallelism * number of BE nodes`. 2. Fix the incorrect logic of Count push-down optimization In the previous code, for Iceberg and Paimon tables, Count push-down optimization did not take effect because we did not push CountPushDown information to FileFormatReader inside TableForamtReader. This PR fixes this problem. 3. Store SessionVaraible variables in FileQueryScanNode. SessionVaraible is a variable in ConnectionContext. And ConnectionContext is a ThreadLocal variable. In FileQueryScanNode, SessionVaraible may be accessed in other threads in some cases, so ThreadLocal variables may not be obtained. Therefore, the SessionVaraible reference is stored in FileQueryScanNode to prevent illegal access. 4. Independent FileSplitter class. The FileSplitter class is a tool class that allows users to split `Split` according to different strategies. This PR does not modify the splitting strategy, but only extracts this part of the logic separately, to be able to perform logic optimization later.

…an node (#44038) (#45564) bp #44038

morningman force-pushed the paimon_pushdown branch from d146e3f to 57654e8 Compare November 20, 2024 08:12

morningman force-pushed the paimon_pushdown branch from 57654e8 to e7d15a1 Compare December 6, 2024 23:19

morningman added dev/2.1.x dev/3.0.x labels Dec 7, 2024

morningman force-pushed the paimon_pushdown branch from 1adc1f7 to a61a4d4 Compare December 8, 2024 20:57

morningman changed the title ~~[opt](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node~~ [fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node Dec 8, 2024

morningman marked this pull request as ready for review December 8, 2024 21:53

morningman force-pushed the paimon_pushdown branch from 0d6cc74 to 8281749 Compare December 9, 2024 04:36

wuwenchi approved these changes Dec 9, 2024

View reviewed changes

github-actions bot added the reviewed label Dec 9, 2024

morningman added 7 commits December 10, 2024 03:45

1

14c2f0c

3 fix fix count fix number backends wait file filter

2

9220eab

3

6a6e74d

table level row count

5cd0ef1

test

ab4f0a6

5

c294920

fix

4d3920f

morningman force-pushed the paimon_pushdown branch from 64dae3d to 4d3920f Compare December 9, 2024 19:45

kaka11chen approved these changes Dec 10, 2024

View reviewed changes

github-actions bot added the approved Indicates a PR has been approved by one committer. label Dec 10, 2024

morningman merged commit 18dc92a into apache:master Dec 10, 2024
24 of 26 checks passed

github-actions bot added dev/3.0.x-conflict dev/2.1.x-conflict labels Dec 10, 2024

morningman mentioned this pull request Dec 10, 2024

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node (#44038) #45224

Merged

morningman added a commit that referenced this pull request Dec 15, 2024

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive sc…

af0c1ac

…an node (#44038) (#45224) bp #44038

morningman added dev/3.0.4-merged and removed dev/3.0.x dev/3.0.x-conflict labels Dec 15, 2024

morningman mentioned this pull request Dec 17, 2024

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node (#44038) #45564

Merged

morningman added a commit that referenced this pull request Dec 18, 2024

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive sc…

855e9a5

…an node (#44038) (#45564) bp #44038

morningman added dev/2.1.8-merged and removed dev/2.1.x dev/2.1.x-conflict labels Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node #44038

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node #44038

morningman commented Nov 15, 2024 •

edited

Loading

doris-robot commented Nov 15, 2024

github-actions bot commented Nov 15, 2024

github-actions bot commented Nov 18, 2024

github-actions bot commented Nov 20, 2024

morningman commented Dec 7, 2024

doris-robot commented Dec 7, 2024

morningman commented Dec 8, 2024

doris-robot commented Dec 8, 2024

morningman commented Dec 9, 2024

doris-robot commented Dec 9, 2024

morningman commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

morningman commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

doris-robot commented Dec 9, 2024

kaka11chen left a comment

github-actions bot commented Dec 10, 2024

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node #44038

[fix](catalog) opt the count pushdown rule for iceberg/paimon/hive scan node #44038

Conversation

morningman commented Nov 15, 2024 • edited Loading

What problem does this PR solve?

Release note

Check List (For Author)

Check List (For Reviewer who merge this PR)

doris-robot commented Nov 15, 2024

github-actions bot commented Nov 15, 2024

github-actions bot commented Nov 18, 2024

github-actions bot commented Nov 20, 2024

morningman commented Dec 7, 2024

doris-robot commented Dec 7, 2024

morningman commented Dec 8, 2024

doris-robot commented Dec 8, 2024

morningman commented Dec 9, 2024

doris-robot commented Dec 9, 2024

morningman commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

morningman commented Dec 9, 2024

github-actions bot commented Dec 9, 2024

doris-robot commented Dec 9, 2024

kaka11chen left a comment

Choose a reason for hiding this comment

github-actions bot commented Dec 10, 2024

morningman commented Nov 15, 2024 •

edited

Loading