-
Notifications
You must be signed in to change notification settings - Fork 442
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CH] Shuffle writer connects to CH pipeline #6723
Conversation
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/apache/incubator-gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
Run Gluten Clickhouse CI |
86b3227
to
17fd827
Compare
Run Gluten Clickhouse CI |
17fd827
to
78431ef
Compare
Run Gluten Clickhouse CI |
78431ef
to
bf78670
Compare
Run Gluten Clickhouse CI |
bf78670
to
02c3893
Compare
Run Gluten Clickhouse CI |
1 similar comment
Run Gluten Clickhouse CI |
91df9a5
to
3eb182b
Compare
Run Gluten Clickhouse CI |
3eb182b
to
7f50cbd
Compare
Run Gluten Clickhouse CI |
7f50cbd
to
48781a3
Compare
Run Gluten Clickhouse CI |
48781a3
to
749fb10
Compare
Run Gluten Clickhouse CI |
749fb10
to
ee4bf11
Compare
Run Gluten Clickhouse CI |
ee4bf11
to
c7cdffe
Compare
Run Gluten Clickhouse CI |
c7cdffe
to
0e9c9c7
Compare
Run Gluten Clickhouse CI |
0e9c9c7
to
18435c7
Compare
Run Gluten Clickhouse CI |
88c5b47
to
0f7982b
Compare
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
2217b22
to
aa46eea
Compare
Run Gluten Clickhouse CI |
aa46eea
to
86d92e2
Compare
Run Gluten Clickhouse CI |
auto * current_executor = local_engine::LocalExecutor::getCurrentExecutor(); | ||
chassert(current_executor); | ||
local_engine::SplitterHolder * splitter = nullptr; | ||
// handle fallback, whole stage fallback or partial fallback |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In which cases, current_executor will be nullptr or not? It's better to add a comment here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
86d92e2
to
e00a93c
Compare
Run Gluten Clickhouse CI |
e00a93c
to
da12381
Compare
Run Gluten Clickhouse CI |
da12381
to
86087bf
Compare
Run Gluten Clickhouse CI |
LGTM |
Fix UT build due to apache#7193
Fix UT build due to apache#7193
* [GLUTEN-1632][CH]Daily Update Clickhouse Version (20240918) * Fix build due to ENABLE_ROCKSDB=OFF caused by #7239 * Fix UT build due to #6723 * Fix UT build due to #7193 * Fix Build due to ClickHouse/ClickHouse#69298 --------- Co-authored-by: kyligence-git <[email protected]> Co-authored-by: Chang Chen <[email protected]>
What changes were proposed in this pull request? shuffle writer现在可以作为Processor接入ClickHouse pipeline当中。 image 在fallback模式下,会在jni中以循环的形式完成stage的执行,主要原因是,某些情况下的fallback会有spark的whole code gen, 其中code gen生成的代码会使用TaskContext,需要保证执行线程为task线程 image 移除了CachedShuffleWriter,新的SparkExchangeSink行为与原有shuffleWriter保持一致 同时还做了一下改动: 支持native的inputFileName,InputBlockStart, InputBlockLength shuffle Wall time统计,在Processor层面统计完整的shuffle耗时 LocalExecutor移出SerilizedPlanParser DefaultHashAggregateResultStep与DefaultHashAggregateResultTransform的output header不匹配问题 How was this patch tested? unit tests (If this patch involves UI changes, please attach a screenshot; otherwise, remove this)
) * [GLUTEN-1632][CH]Daily Update Clickhouse Version (20240918) * Fix build due to ENABLE_ROCKSDB=OFF caused by apache#7239 * Fix UT build due to apache#6723 * Fix UT build due to apache#7193 * Fix Build due to ClickHouse/ClickHouse#69298 --------- Co-authored-by: kyligence-git <[email protected]> Co-authored-by: Chang Chen <[email protected]>
What changes were proposed in this pull request?
shuffle writer现在可以作为Processor接入ClickHouse pipeline当中。
在fallback模式下,会在jni中以循环的形式完成stage的执行,主要原因是,某些情况下的fallback会有spark的whole code gen,
其中code gen生成的代码会使用TaskContext,需要保证执行线程为task线程
移除了CachedShuffleWriter,新的SparkExchangeSink行为与原有shuffleWriter保持一致
同时还做了一下改动:
How was this patch tested?
unit tests
(If this patch involves UI changes, please attach a screenshot; otherwise, remove this)