[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

amkhar · 2024-02-23T11:48:26Z

Describe the bug

Overall project issue : #8098

AsyncShardFetch class cache is like private final Map<String, NodeEntry<T>> cache = new HashMap<>();
This is just a map which will always store the full response (T) returned from the data nodes when we want to get the metadata of shards after node-left/join. The responses from new transport return the data for all the shards in a map like Map<ShardId, ShardResponse>
So when we fill the cache it'll store the data like

Map<String, Map<ShardId, ShardResponse>>

This map stores the data for every nodeId, so ShardId will get repeated for every node. What we should do is not store the shardId in repeated manner but use some array to store just the data and keep ShardId mapping separately.

Current Map private final Map<String, NodeEntry<T>> cache can not support this use case.

Related issue for adding a new caching strategy for a batch of shards #12248

Related component

Cluster Manager

To Reproduce

N/A

Expected behavior

Current cache can not support other strategies. So abstracting out this implementation in a separate class will help other caching strategies to be implemented easily.
To have that restructuring in place, I'm suggesting to have three simple methods like

initData - Initialize the cache entry
getData - get the data from the cache
putData - store the data in the cache

Each caching strategy should implement these methods and handle how things should be done. And driver class should just care about node level response and executing the end to end flow (common functionalities to be put in base class to avoid duplication).

This way, new caching strategy for batch of shards would be able to put the response data in relevant format like in an array and get it accordingly. Existing implementation can be same.

Additional Details

N/A

The text was updated successfully, but these errors were encountered:

peternied · 2024-02-28T16:53:27Z

[Triage - attendees 1 2 3 4 5]
@amkhar Thanks for creating this issue; however, it isn't being accepted due to the issue not having enough context on its own please recreate with more details. Please feel free to open a new issue after addressing the reason.

amkhar · 2024-02-28T17:34:54Z

Sure @peternied , I'll add more details in this issue itself (if it's okay to re-open with more details ?).

Just curious if you went through #12248 to understand how we want to implement the cache for new transport actions being written for a batch of shards ?

For that implementation to be clean, first we need to modify the current cache (map) into a class so existing implementation and new implementation can be put in separate child classes.

peternied · 2024-02-28T17:39:51Z

Just curious if you went through...

This issue was reviewed by the triage team during the core triage meeting, we read the issue as it is. While additional context / links are useful the general guidance is the problem and expected end state are clearly described in the issue.

if it's okay to re-open with more details

If you have the ability to do so, please feel free to reopen.

amkhar · 2024-02-28T17:47:51Z

Thanks for confirmation.
Added more details :)

peternied · 2024-03-06T16:25:06Z

[Triage - attendees 1 2 3 4 5]
@amkhar Thanks for the additional context - this issue looks good.

amkhar · 2024-03-14T09:43:55Z

Completed by #12441

amkhar added bug Something isn't working untriaged labels Feb 23, 2024

github-actions bot added the Cluster Manager label Feb 23, 2024

github-project-automation bot added this to Cluster Manager Project Board Feb 23, 2024

amkhar mentioned this issue Feb 23, 2024

[META] Cluster Manager Async Shard Fetch Revamp #8098

Open

13 tasks

github-project-automation bot moved this to 🆕 New in Cluster Manager Project Board Feb 23, 2024

amkhar mentioned this issue Feb 23, 2024

Abstract AsyncShardFetch cache to allow restructuring for other caching strategies #12441

Merged

6 tasks

amkhar self-assigned this Feb 28, 2024

peternied closed this as completed Feb 28, 2024

github-project-automation bot moved this from 🆕 New to ✅ Done in Cluster Manager Project Board Feb 28, 2024

amkhar reopened this Feb 28, 2024

github-project-automation bot moved this from ✅ Done to 🏗 In progress in Cluster Manager Project Board Feb 28, 2024

peternied removed the untriaged label Mar 6, 2024

amkhar closed this as completed Mar 14, 2024

github-project-automation bot moved this from 🏗 In progress to ✅ Done in Cluster Manager Project Board Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

amkhar commented Feb 23, 2024 •

edited

Loading

peternied commented Feb 28, 2024

amkhar commented Feb 28, 2024

peternied commented Feb 28, 2024

amkhar commented Feb 28, 2024

peternied commented Mar 6, 2024

amkhar commented Mar 14, 2024

[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

Comments

amkhar commented Feb 23, 2024 • edited Loading

Describe the bug

Related component

To Reproduce

Expected behavior

Additional Details

peternied commented Feb 28, 2024

amkhar commented Feb 28, 2024

peternied commented Feb 28, 2024

amkhar commented Feb 28, 2024

peternied commented Mar 6, 2024

amkhar commented Mar 14, 2024

amkhar commented Feb 23, 2024 •

edited

Loading