feat(cache): add support for surrogate cache key #6234

bnjjj · 2024-11-06T09:50:46Z

Context

Existing caching systems often support a concept of surrogate keys, where a key can be linked to a specific piece of cached data, independently of the actual cache key.

As an example, a news website might want to invalidate all cached articles linked to a specific company or person following an event. To that end, when returning the article, the service can add a surrogate key to the article response, and the cache would keep a map from surrogate keys to cache keys.

Surrogate keys and the router’s entity cache

To support a surrogate key system with the entity caching in the router, we make the following assumptions:

The subgraph returns surrogate keys with the response. The router will not manipulate those surrogate keys directly. Instead, it leaves that task to a coprocessor
The coprocessor tasked with managing surrogate keys will store the mapping from surrogate keys to cache keys. It will be useful to invalidate all cache keys related to a surrogate cache key in Redis.
The router will expose a way to gather the cache keys used in a subgraph request

Router side support

The router has two features to support surrogate cache key:

An id field for subgraph requests and responses. This is a random, unique id per subgraph call that can be used to keep state between the request and response side, and keep data from the various subgraph calls separately for the entire client request. You have to enable it in configuration (subgraph_request_id):

coprocessor:
  url: http://127.0.0.1:3000 # mandatory URL which is the address of the coprocessor
  supergraph:
    response: 
      context: true
  subgraph:
    all:
      response: 
        subgraph_request_id: true
        context: true

The entity cache has an option to store in the request context, at the key apollo::entity_cache::cached_keys_status, a map subgraph request id => cache keys only when it's enabled in the configuration (expose_keys_in_context)):

preview_entity_cache:
  enabled: true
  expose_keys_in_context: true
  metrics:
    enabled: true
  invalidation:
    listen: 0.0.0.0:4000
    path: /invalidation
  # Configure entity caching per subgraph
  subgraph:
    all:
      enabled: true
      # Configure Redis
      redis:
        urls: ["redis://localhost:6379"]
        ttl: 24h # Optional, by default no expiration

The coprocessor will then work at two stages:

Subgraph response:
- Extract the subgraph request id
- Extract the list of surrogate keys from the response
Supergraph stage:
- Extract the map subgraph request id => cache keys
- Match it with the surrogate cache keys obtained at the subgraph response stage

The coprocessor then has a map of surrogate keys => cache keys that it can use to invalidate cached data directly from Redis.

Example workflow

The router receives a client request
The router starts a subgraph request:
- The entity cache plugin checks if the request has a corresponding cached entry:
  - If the entire response can be obtained from cache, we return a response here
  - If it cannot be obtained, or only partially (_entities query), a request is transmitted to the subgraph
- The subgraph responds to the request. The response can contain a list of surrogate keys in a header: Surrogate-Keys: homepage, feed
- The subgraph response stage coprocessor extracts the surrogate keys from headers, and stores it in the request context, associated with the subgraph request id 0e67db40-e98d-4ad7-bb60-2012fb5db504:

{
  "0ee3bf47-5e8d-47e3-8e7e-b05ae877d9c7": ["homepage", "feed"]
}

The entity cache processes the subgraph response:
- It generates a new subgraph response by interspersing data it got from cache with data from the original response
- It stores the list of keys in the context. new indicates newly cached data coming from the subgraph, linked to the surrogate keys, while cached is data obtained from the cache. These are the keys directly used in Redis:

{
  "apollo::entity_cache::cached_keys_status": {
    "0ee3bf47-5e8d-47e3-8e7e-b05ae877d9c7": [
      {
        "key": "version:1.0:subgraph:products:type:Query:hash:af9febfacdc8244afc233a857e3c4b85a749355707763dc523a6d9e8964e9c8d:data:d9d84a3c7ffc27b0190a671212f3740e5b8478e84e23825830e97822e25cf05c",
        "status": "new",
        "cache_control": "max-age=60,public"
      }
    ]
  }
}

The supergraph response stage loads data from the context and creates the mapping:

{
  "homepage": [
    {
      "key": "version:1.0:subgraph:products:type:Query:hash:af9febfacdc8244afc233a857e3c4b85a749355707763dc523a6d9e8964e9c8d:data:d9d84a3c7ffc27b0190a671212f3740e5b8478e84e23825830e97822e25cf05c",
      "status": "new",
      "cache_control": "max-age=60,public"
    }
  ],
  "feed": [
    {
      "key": "version:1.0:subgraph:products:type:Query:hash:af9febfacdc8244afc233a857e3c4b85a749355707763dc523a6d9e8964e9c8d:data:d9d84a3c7ffc27b0190a671212f3740e5b8478e84e23825830e97822e25cf05c",
      "status": "new",
      "cache_control": "max-age=60,public"
    }
  ]
}

When a surrogate key must be used to invalidate data, that mapping is used to obtained the related cache keys

Checklist

Complete the checklist (and note appropriate exceptions) before the PR is marked ready-for-review.

Exceptions

Note any exceptions here

Notes

It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. ↩
Configuration is an important part of many changes. Where applicable please try to document configuration examples. ↩
Tick whichever testing boxes are applicable. If you are adding Manual Tests, please document the manual testing (extensively) in the Exceptions. ↩

Signed-off-by: Benjamin <[email protected]>

svc-apollo-docs · 2024-11-06T09:50:49Z

✅ Docs Preview Ready

No new or changed pages found.

github-actions · 2024-11-06T09:51:00Z

@bnjjj, please consider creating a changeset entry in /.changesets/. These instructions describe the process and tooling.

router-perf · 2024-11-06T09:51:18Z

CI performance tests

apollo-router/src/plugins/cache/entity.rs

Geal · 2024-11-06T10:05:11Z

at line 714 let (new_entities, new_errors) = assemble_response_from_errors(, when we got an error from the subgraph response, we still return a partial response with some data from cache, so we need to store the cache keys for those entities

Signed-off-by: Benjamin <[email protected]>

…_659

examples/coprocessor-surrogate-cache-key/README.md

docs/source/routing/performance/caching/entity.mdx

apollo-router/src/plugins/cache/entity.rs

Signed-off-by: Benjamin <[email protected]>

…_659

Signed-off-by: Benjamin <[email protected]>

…ample Applies to #6234

…ample (#6284)

feat(cache): add support for surrogate cache key

ba3779c

Signed-off-by: Benjamin <[email protected]>

Geal reviewed Nov 6, 2024

View reviewed changes

apollo-router/src/plugins/cache/entity.rs Outdated Show resolved Hide resolved

Geal reviewed Nov 6, 2024

View reviewed changes

apollo-router/src/plugins/cache/entity.rs Show resolved Hide resolved

bnjjj added 3 commits November 6, 2024 11:54

update snapshot

0219859

Signed-off-by: Benjamin <[email protected]>

add tests with snapshots

898f156

Signed-off-by: Benjamin <[email protected]>

fix lint

9065a18

Signed-off-by: Benjamin <[email protected]>

bnjjj requested review from Geal, garypen and BrynCooke November 6, 2024 16:01

bnjjj marked this pull request as ready for review November 6, 2024 16:02

bnjjj requested review from a team as code owners November 6, 2024 16:02

bnjjj added 2 commits November 7, 2024 16:00

add example using node js

71f77e3

Signed-off-by: Benjamin <[email protected]>

add configuration in the docs

63c750c

Signed-off-by: Benjamin <[email protected]>

bnjjj requested a review from a team as a code owner November 7, 2024 15:31

bnjjj added 2 commits November 7, 2024 16:50

fix test

c2b3033

Signed-off-by: Benjamin <[email protected]>

Merge branch 'dev' of github.com:apollographql/router into bnjjj/feat…

a7f33c9

…_659

Geal approved these changes Nov 18, 2024

View reviewed changes

BrynCooke requested changes Nov 18, 2024

View reviewed changes

bnjjj added 6 commits November 18, 2024 16:45

remove expect

dc13826

Signed-off-by: Benjamin <[email protected]>

update snapshot cache keys

bda744f

Signed-off-by: Benjamin <[email protected]>

Merge branch 'dev' of github.com:apollographql/router into bnjjj/feat…

2ea7c97

…_659

update snapshot cache keys

8a1457f

Signed-off-by: Benjamin <[email protected]>

fix some lints

fe0b676

Signed-off-by: Benjamin <[email protected]>

update readme

54d6199

Signed-off-by: Benjamin <[email protected]>

BrynCooke approved these changes Nov 19, 2024

View reviewed changes

abernix added a commit that referenced this pull request Nov 19, 2024

Use a Map rather than object for coprocessor surrogate cache key ex…

0f0b60a

…ample Applies to #6234

abernix mentioned this pull request Nov 19, 2024

Use a Map rather than object for coprocessor surrogate cache key example #6284

Merged

Use a Map rather than object for coprocessor surrogate cache key ex…

bcc6780

…ample (#6284)

bnjjj merged commit 83e6291 into dev Nov 20, 2024
14 checks passed

bnjjj deleted the bnjjj/feat_659 branch November 20, 2024 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cache): add support for surrogate cache key #6234

feat(cache): add support for surrogate cache key #6234

bnjjj commented Nov 6, 2024 •

edited by jira bot

Loading

svc-apollo-docs commented Nov 6, 2024 •

edited

Loading

github-actions bot commented Nov 6, 2024

router-perf bot commented Nov 6, 2024

Geal commented Nov 6, 2024

feat(cache): add support for surrogate cache key #6234

feat(cache): add support for surrogate cache key #6234

Conversation

bnjjj commented Nov 6, 2024 • edited by jira bot Loading

Context

Surrogate keys and the router’s entity cache

Router side support

Example workflow

Footnotes

svc-apollo-docs commented Nov 6, 2024 • edited Loading

✅ Docs Preview Ready

github-actions bot commented Nov 6, 2024

router-perf bot commented Nov 6, 2024

Geal commented Nov 6, 2024

bnjjj commented Nov 6, 2024 •

edited by jira bot

Loading

svc-apollo-docs commented Nov 6, 2024 •

edited

Loading