-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[core] fix skew bug of stream read unware bucket table #3955
[core] fix skew bug of stream read unware bucket table #3955
Conversation
4b1efe3
to
aced230
Compare
return Tuple2.of(dataSplit.partition(), dataSplit.bucket()); | ||
}) | ||
.transform(name + "-Reader", typeInfo, new ReadOperator(readBuilder)); | ||
if (shuffleByPartition) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
You really want this? The streaming-read.shuffle-by-partition
is true by default. It means?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In my business scenario, I will set streaming-read.shuffle-by-partition
to false.
I'm not sure if any users need shuffle by partition when read the unware bucket table. So for compatibility reasons, I add this if.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is bug, you should fix it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
...n-flink/paimon-flink-common/src/test/java/org/apache/paimon/flink/AppendOnlyTableITCase.java
Outdated
Show resolved
Hide resolved
aced230
to
ef0ab81
Compare
ef0ab81
to
b0c1360
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @wwj6591812 , looks good to me!
Now when stream read a unware bucket table with comsume_id and consumer.mode = exactly-once. SKEW will happy, this is a bug, now fix it.
Purpose
Linked issue: close #xxx
Tests
API and Format
Documentation