Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Flink]Optimize flink listPartitions speed #4495

Closed
wants to merge 1 commit into from

Conversation

herefree
Copy link
Contributor

Purpose

Flink listPartitions from catalog.listPartitions, caching catalog can increase acquisition speed of partition.

Linked issue: close #xxx

Tests

API and Format

Documentation

@JingsongLi
Copy link
Contributor

Hi @herefree , can you explain what is the difference?

@herefree
Copy link
Contributor Author

Hi @herefree , can you explain what is the difference?

Before, in flink, every time we get partition, we need to get it from readBuilder.newScan().listPartitionEntries(). After modification, we get partitions from Catalog.listPartitions. If Catalog is CachingCatalog, we can get partitions in cache.

@JingsongLi
Copy link
Contributor

Hi @herefree , can you explain what is the difference?

Before, in flink, every time we get partition, we need to get it from readBuilder.newScan().listPartitionEntries(). After modification, we get partitions from Catalog.listPartitions. If Catalog is CachingCatalog, we can get partitions in cache.

Really? How to get partitions from cache? It seems that you do not modify the CachingCatalog.

@herefree
Copy link
Contributor Author

Hi @herefree , can you explain what is the difference?

Before, in flink, every time we get partition, we need to get it from readBuilder.newScan().listPartitionEntries(). After modification, we get partitions from Catalog.listPartitions. If Catalog is CachingCatalog, we can get partitions in cache.

Really? How to get partitions from cache? It seems that you do not modify the CachingCatalog.

image
I found that we have implemented partitionCache in CacheCatalog.
I found a problem. There is no place to refresh the partition in Flink. Sorry,maybe I should turn off this feature.

@herefree herefree closed this Nov 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants