[Restructure] Change Cache Structure in AsyncShardFetch class #12248

amkhar · 2024-02-08T08:02:02Z

Describe the bug

AsyncShardFetch Revamp strategy (explained #5098 (comment) ) uses ShardId object directly in the key to store metadata of all the shards received from all the nodes. So overall memory usage goes in the factor of

ShardId object size * shard_count * node_count = this goes in GBs as ShardId object contains more data.
One optimization issue is already opened to use smaller size : #12010

After reducing the size from 208 Bytes to 72 bytes (using a string = indexUUID_shardNumber), overall impact on heap is reduced to 16 GBs for a 500N, 500K shards setup.

As we're storing the full response(T) from data nodes

OpenSearch/server/src/main/java/org/opensearch/gateway/AsyncShardFetch.java

Line 89 in a0b5198

private final Map<String, NodeEntry<T>> cache = new HashMap<>();

it'll still keep the cache structure like Map<NodeId, Map<ShardId, ShardMetaData>>

Note : ShardMetaData is not an actual class, just using here to show metadata or primary/replica.
As we can see ShardId(72 Bytes) will keep getting repeated for all the nodes.

Approach

We should use an array/ArrayList instead of Map<ShardId, ShardMetaData> so only data is stored for each shard not the shard Id itself. And ShardId should only be stored once in the cache, may be in another map which can store the reference to the array index. Map<ShardId, array_index>

Map<NodeId, ShardMetadata[]>

This will take actual ShardId size in heap i.e. = 34 MBs and we'll save all that 16GBs.

Related component

Cluster Manager

To Reproduce

Use the latest code of this project
spin up a new cluster with 500 nodes and 500K shards
Restart all cluster manager nodes at once
We'll see heap getting full, can go up to 50GB

Expected behavior

Ideally ShardId key should not get repeated even when its size is reduced.

Additional Details

This change depends on the already existing PRs of the overall project #8098 to revamp reroute flow.

The text was updated successfully, but these errors were encountered:

amkhar added bug Something isn't working untriaged labels Feb 8, 2024

github-actions bot added the Cluster Manager label Feb 8, 2024

amkhar mentioned this issue Feb 8, 2024

[META] Cluster Manager Async Shard Fetch Revamp #8098

Open

13 tasks

shwetathareja removed the untriaged label Feb 9, 2024

shwetathareja assigned amkhar Feb 9, 2024

This was referenced Feb 23, 2024

[Refactoring] Abstract AsyncShardFetch cache to allow restructuring for batch mode #12440

Closed

Add ShardBatchCache to support caching for TransportNodesListGatewayStartedShardsBatch #12504

Merged

rwali-aws added this to Cluster Manager Project Board Apr 20, 2024

github-project-automation bot moved this to 🆕 New in Cluster Manager Project Board Apr 20, 2024

rwali-aws closed this as completed Jun 21, 2024

github-project-automation bot moved this from 🆕 New to ✅ Done in Cluster Manager Project Board Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Restructure] Change Cache Structure in AsyncShardFetch class #12248

[Restructure] Change Cache Structure in AsyncShardFetch class #12248

amkhar commented Feb 8, 2024 •

edited

Loading

[Restructure] Change Cache Structure in AsyncShardFetch class #12248

[Restructure] Change Cache Structure in AsyncShardFetch class #12248

Comments

amkhar commented Feb 8, 2024 • edited Loading

Describe the bug

Approach

Related component

To Reproduce

Expected behavior

Additional Details

amkhar commented Feb 8, 2024 •

edited

Loading