Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[core] fix skew bug of stream read unware bucket table #3955

Conversation

wwj6591812
Copy link
Contributor

@wwj6591812 wwj6591812 commented Aug 13, 2024

Now when stream read a unware bucket table with comsume_id and consumer.mode = exactly-once. SKEW will happy, this is a bug, now fix it.
image

Purpose

Linked issue: close #xxx

Tests

API and Format

Documentation

@wwj6591812 wwj6591812 force-pushed the fix_skew_bug_of_stream_read_unware_bucket_table_0813 branch from 4b1efe3 to aced230 Compare August 14, 2024 01:49
return Tuple2.of(dataSplit.partition(), dataSplit.bucket());
})
.transform(name + "-Reader", typeInfo, new ReadOperator(readBuilder));
if (shuffleByPartition) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You really want this? The streaming-read.shuffle-by-partition is true by default. It means?

Copy link
Contributor Author

@wwj6591812 wwj6591812 Aug 14, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my business scenario, I will set streaming-read.shuffle-by-partition to false.
I'm not sure if any users need shuffle by partition when read the unware bucket table. So for compatibility reasons, I add this if.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is bug, you should fix it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done

@wwj6591812 wwj6591812 force-pushed the fix_skew_bug_of_stream_read_unware_bucket_table_0813 branch from aced230 to ef0ab81 Compare August 14, 2024 06:21
@wwj6591812 wwj6591812 force-pushed the fix_skew_bug_of_stream_read_unware_bucket_table_0813 branch from ef0ab81 to b0c1360 Compare August 14, 2024 09:03
@wwj6591812 wwj6591812 requested a review from JingsongLi August 14, 2024 09:28
Copy link
Contributor

@JingsongLi JingsongLi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @wwj6591812 , looks good to me!

@JingsongLi JingsongLi merged commit fff9a31 into apache:master Aug 14, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants