-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] first_value semantic conflict #3393
Comments
Thanks @luowanghaoyun for report this. IMO, the aggregate function for I try to introduce an |
@Aitozi Why not keep the earliest sequence under the mentioned scenarios? I think sequence.field should also be constrainted with the aggregation function. |
@liyubin117 If we have serval aggregation function fields, a single sequence.field is not enough. The |
@Aitozi Thanks for claring that! it is actually more complex than I thought. |
@Aitozi hi, when is this issue expected to be fixed? |
Search before asking
Paimon version
0.8
Compute Engine
Flink
Minimal reproduce step
I have read this issues: #3020
I discovered this phenomenon:
The result is
[1, 4, 2, 2, 2]
, as expectedIf I manually trigger a compaction before the third insert, like:
The result is
[1, 4, 3, 3, 3]
, and there is a semantic problem.For local debugging, in this case, there are only two values merged ([1, 3, 3, 3, 3], [1, 4, 2, 2, 2]) when querying.
It seems that after compaction, the previous sequence information was lost.
What doesn't meet your expectations?
The Doc. description is vague:
I'm not sure if there's something wrong with my understanding. Should the oldest sequence be maintained? Otherwise the compaction will lead to inconsistent first_value semantics
Anything else?
No response
Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: