Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[spark] CompactUnAwareBucketTable uses union to replace submit job by self #3997

Merged
merged 1 commit into from
Aug 19, 2024

Conversation

ulysses-you
Copy link
Contributor

Purpose

This pr uses Union(datasets) to replace submit each dataset by self to reudce driver limiation and overhead, and also remove the option max_concurrent_jobs, it is unncessary since we do not have such config for hash_fixed or hash_dynamic.

Tests

Pass CI

API and Format

no

Documentation

@ulysses-you
Copy link
Contributor Author

cc @JingsongLi thank you

@JingsongLi
Copy link
Contributor

CC @Zouxxyy to take a look.

Copy link
Contributor

@Zouxxyy Zouxxyy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Look good to me!

@JingsongLi
Copy link
Contributor

+1

@JingsongLi JingsongLi merged commit 14d82f9 into apache:master Aug 19, 2024
10 of 11 checks passed
@ulysses-you ulysses-you deleted the compact-unaware branch August 20, 2024 01:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants