Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] query values with the same primary key #3157

Closed
2 tasks done
zhourui999 opened this issue Apr 5, 2024 · 1 comment
Closed
2 tasks done

[Bug] query values with the same primary key #3157

zhourui999 opened this issue Apr 5, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@zhourui999
Copy link
Contributor

zhourui999 commented Apr 5, 2024

Search before asking

  • I searched in the issues and found nothing similar.

Paimon version

0.8-snapshot

Compute Engine

spark 3.1
flink-1.17.2

Minimal reproduce step

A large online table with frequent data writing and updates, with a data volume of approximately 600 million. CDC synchronizes to the payment table, and after running for a period of time, it is found that the data in the table has duplicate primary keys (using SparkSQL and FlinkSQL queries will result in duplicate data). After executing compact, the same primary key will be merged out

image

What doesn't meet your expectations?

The query result does not have duplicate primary keys

Anything else?

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR!
@JingsongLi
Copy link
Contributor

Thanks @zhourui999

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants