Migrate all `VaultsSecrets` handlers to `RawHandler` #822

aryanjassal · 2024-10-08T05:20:04Z

Specification

Most, if not all, VaultsSecrets handlers transfer binary data. Doing so using plain JSON isn't the most efficient, as the required bandwidth can be as high as 200% of actual data size, making regular JSON extremely inefficient for transferring binary data.

This is solved by the RawHandler, which is able to stream raw bytes of data directly, bypassing the large overhead. However, it also has many issues, like not supporting error serialisation by default. As such, any errors thrown within the context will hit the transport layer, causing a read 0 error.

To solve this, each handler needs to implement its own method of error serialisation and deserialisation. The handler side needs to serialise the errors so they are passed through as binary data, and the client side needs to deserialise the data back into the original error. Serialising and deserialising all possible errors is not feasible, and should not be done either. Only create serialisation and deserialisation handlers for relevant errors that are expected, and raise a generic error for all other, unexpected errors.

Additional context

Discussion on this topic Polykey#799

Tasks

Migrate the following to use RawHandlers
- VaultsSecretsCat
- VaultsSecretsWrite
- VaultsSecretsCopy
- VaultsSecretsMove

The text was updated successfully, but these errors were encountered:

linear · 2024-10-08T05:20:09Z

ENG-424 Migrate all `VaultsSecrets` handlers to `RawHandler`

CMCDragonkai · 2024-10-09T19:49:47Z

This needs to be weighed up against making the RPC too unique to our usecases and making it more difficult to use in other clients.

aryanjassal · 2024-12-19T05:27:54Z

Actually, not all VaultsSecrets handlers need to be converted. Only the ones which deal with actual binary data like VaultsSecretsCat. I don't think a raw stream is required to file names, as it would be much rarer for filenames to have escaped characters than binary file contents.

This would be crucial, however, for VaultsSecretsCopy and VaultsSecretsMove, as both the handlers will send a tar file storing the entire file tree. This will have a lot of unprintable binary, and escaping it all for server streams will lead to a massively blown-up size.

CMCDragonkai · 2024-12-31T17:39:12Z

Raw stream just means a json message initially and then just raw binary data afterwards. We don't have a trailing message protocol so it's easy. This also makes it possible to protocol matching, like enabling the ability to raw pipe binary data from the fs straight into the stream after the initial json message.

CMCDragonkai · 2024-12-31T17:39:32Z

This makes it far more efficient too. It was the original intention!

aryanjassal · 2025-01-13T07:32:21Z

Another point to note would be that error serialisation would be restricted. Normally, the regular streams would throw errors normally, and they would get caught by the RPC internals, get serialised, and sent over the RPC as JSON. Then, the other side will convert it back. For raw streams, however, if an error is thrown from inside the RPC, then it will hit the transport layer and bring down the entire connection with a read 0 message.

To deal with this, we would need to do manual error serialisation/deserialisation, which is fine but it would restrict what errors we can send over. If we restrict the errors we can send over, then we are restricting the visibility of bugs. If an error happens on the handler, then instead of getting the error, we would get a UnknownError or something. If we implement a robust error handing, then that is almost what we already have in Polykey, and it would be repetition of effort to repeat it. Perhaps the existing implementation for codeToError and errorToCode might be reusable as a drop-in method, which would simplify the whole process, but if it isn't, then that would be a major issue for the usability and boilerplate needed for the specific RPC handlers.

tegefaulkes · 2025-01-14T23:52:48Z

If you want to send errors over the binary stream then we need to work out a protocol that can multiplex these errors over the stream. It would have to be implemented by a re-usable transform stream that handles it for us.

aryanjassal added the development Standard development label Oct 8, 2024

aryanjassal self-assigned this Oct 8, 2024

aryanjassal mentioned this issue Oct 8, 2024

Implement secrets write command MatrixAI/Polykey-CLI#242

Closed

aryanjassal mentioned this issue Oct 18, 2024

Allow parsing just the vault name without requiring the secret path MatrixAI/Polykey-CLI#305

Merged

25 tasks

aryanjassal mentioned this issue Dec 13, 2024

Add cancellation to RPC handlers for the vaults domain #846

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate all `VaultsSecrets` handlers to `RawHandler` #822

Migrate all `VaultsSecrets` handlers to `RawHandler` #822

aryanjassal commented Oct 8, 2024 •

edited

Loading

linear bot commented Oct 8, 2024

CMCDragonkai commented Oct 9, 2024

aryanjassal commented Dec 19, 2024

CMCDragonkai commented Dec 31, 2024

CMCDragonkai commented Dec 31, 2024

aryanjassal commented Jan 13, 2025

tegefaulkes commented Jan 14, 2025

Migrate all VaultsSecrets handlers to RawHandler #822

Migrate all VaultsSecrets handlers to RawHandler #822

Comments

aryanjassal commented Oct 8, 2024 • edited Loading

Specification

Additional context

Tasks

linear bot commented Oct 8, 2024

CMCDragonkai commented Oct 9, 2024

aryanjassal commented Dec 19, 2024

CMCDragonkai commented Dec 31, 2024

CMCDragonkai commented Dec 31, 2024

aryanjassal commented Jan 13, 2025

tegefaulkes commented Jan 14, 2025

Migrate all `VaultsSecrets` handlers to `RawHandler` #822

Migrate all `VaultsSecrets` handlers to `RawHandler` #822

aryanjassal commented Oct 8, 2024 •

edited

Loading