-
Notifications
You must be signed in to change notification settings - Fork 990
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[flink] PartitionMarkDone enables the use of various partition trigge… #4386
Conversation
WDYT? @JingsongLi |
Hi @Aitozi , can you explain this pr from API level? Or you are modifying current behavior? |
@JingsongLi The behavior is not change. In this PR, I extract the In |
|
||
public PartitionMarkDoneTrigger( | ||
State state, | ||
PartitionTimeExtractor timeExtractor, | ||
@Nullable Duration timeInterval, | ||
@Nullable Duration idleTime, | ||
boolean markDoneWhenEndInput) | ||
boolean markDoneWhenEndInput, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you keep PartitionMarkDone
as it is?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I reset the commit, and updated in #4398
import java.util.List; | ||
|
||
/** Partition collector. */ | ||
public class PartitionCollector implements Closeable { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PartitionListeners
?
import java.util.List; | ||
|
||
/** The partition trigger. */ | ||
public interface PartitionTrigger extends Closeable { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
PartitionListener
?
…r strategies.
Purpose
Linked issue: close #xxx
In our company, we have encountered an issue with the HMS partition statistic being incorrect. This is because during the writing process, we only update the metastore partition when it is first written to.
Therefore, we would like to implement the PartitionMarkDone strategy to update the statistics in HMS after a short idle period for each partition.
We need a separate configuration for PartitionMarkDone due to differing requirements:
To achieve this, we plan on extending
PartitionMarkDone
functionality to support different trigger strategies for partitions. Following this extension, we will be able to introduce custom triggers such asPartitionHmsReporterTrigger
.Tests
API and Format
Documentation