Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] streaming read by from-timestamp may be occur exception(snapshot file not fund) sometimes #3479

Open
1 of 2 tasks
Mr-j-yangyu opened this issue Jun 6, 2024 · 4 comments
Labels
bug Something isn't working

Comments

@Mr-j-yangyu
Copy link

Search before asking

  • I searched in the issues and found nothing similar.

Paimon version

0.8

Compute Engine

Flink

Minimal reproduce step

Analyze source code in SnapshotManager.class

when read earliest snapshot file at step2, the snapshot maybe expired and deleted

20240606165856

What doesn't meet your expectations?

query job will failed by throw FileNotFoundException

Anything else?

No response

Are you willing to submit a PR?

  • I'm willing to submit a PR!
@Mr-j-yangyu Mr-j-yangyu added the bug Something isn't working label Jun 6, 2024
@Aitozi
Copy link
Contributor

Aitozi commented Jun 12, 2024

@Mr-j-yangyu Have you try to point to a safer timestamp to read (keep a margin from the oldest snapshot/changelog)?

@Mr-j-yangyu
Copy link
Author

Mr-j-yangyu commented Jun 13, 2024

@Mr-j-yangyu Have you try to point to a safer timestamp to read (keep a margin from the oldest snapshot/changelog)?

@Aitozi It is necessary to read earliest snapshot or changelog in some usage scenarios.
Can add a logic to verify file exist when read earliest snapshot or changelog ?,just read next if not exist to reduce the probability of exception.

@discivigour
Copy link
Contributor

Hi, @Mr-j-yangyu Could you give the detailed minimal reproduce step? The code in this picture will get earliest+1 snapshot in the end.

@eemario
Copy link

eemario commented Dec 27, 2024

Hi, @Mr-j-yangyu @discivigour I have found similar issue with SnapshotManager#earlierOrEqualTimeMills. An exception will be thrown complaining about the snapshot file does not exist. It is a bit difficult to reproduce because it only happens when the earliest snapshot expires between step1 and step 2 in the code picture.
Is there anybody working on fixing this?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants