Adding functionality to satisfy v0.itworks #5

uncleDecart · 2024-08-28T19:33:52Z

This commit removes NatsClient and also introduces proper subscribe mechanism, which you can actually use to receive new values.

There's a bunch of things to do till version v0.itworks , we can slowly continue moving with multiple PRs or I'll just create commit for each point in TODO list

TODO:

Gracefully handle unsubscribe when we delete value
For subscriber handle server response in case if subscription is not found
remove NATS from GHA
make nkv persistent, i.e. try load from checkpoints when create values
Add id tracking for requests, right now all requests are with id 0 which is not useful, we need this for any kind of debugging
Introduce some graphs
write basic benchmarking
Introduce proper errors and their returns
Introduce graceful shutdown for server and notifier via channels (probably oneshot ones)

Signed-off-by: Pavel Abramov <[email protected]>

No more NATS Signed-off-by: Pavel Abramov <[email protected]>

Signed-off-by: Pavel Abramov <[email protected]>

Because that is the correct name to use Signed-off-by: Pavel Abramov <[email protected]>

That also introduces Notifier Error and NkvClient now has map instead of single channel, need to think about cleaning it up Signed-off-by: Pavel Abramov <[email protected]>

Signed-off-by: Pavel Abramov <[email protected]>

Notify Key Value will load previously stored values and delete will remove values from filesystem Signed-off-by: Pavel Abramov <[email protected]>

This way it is easier to trace and debug things Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart · 2024-09-02T08:32:12Z

@deitch PR is ready for review, I'll do rest of todo in separate PR(s)

Edit: I tried to make it easier to review commit-by-commit, i.e. each commit serves one purpose

src/nkv.rs

deitch · 2024-09-02T10:29:39Z

src/lib.rs

+        let req = ServerRequest::Delete(BaseMessage {
+            id: Self::uuid(),
+            key,
+        });
        self.send_request(&req).await
    }

    pub async fn subscribe(&mut self, key: String) -> tokio::io::Result<ServerResponse> {


All of this sets up a subscription (nice catch on the, "you already subscribed to this key, go away").

The subscriptions are stored in NkvClient.subscriptions, which is a HashMap<String, watch::Receiver<Message>>. How does that get used? The call is to NkvClient.subscribe("somekey"), which just returns the server response. How do I set up a handler that gets triggered for each such change?

Yup, and those changes are sent to tokio::watch which is a channel storing latest update. I think that it might make sense to make that HashMap public, so that clients could write their own handlers, idk how good of a practice it is to pass handlers for subscriptions (they way we're doing it right now)

What would either of those look like? Sample code?

I think passing handlers for subscriptions would be the right way to do it. You would need an ~~interface~~ trait defining the handler, and then the subscribe call would need to call it. Something like we do now (yeah, I didn't do it 100% right, don't care):

trait Handler { fn handle(&self, message: &dyn Message); } pub async fn subscribe(&mut self, key: String, handler: Handler) -> tokio::io::Result<ServerResponse>

But you want 3 different handlers for subscribe, update and delete ? I mean we're talking again about general library ?

Why do we need handlers for client update and delete? NkvClient user calls delete(), it either succeeds or fails.

I was thinking a single handler for subscribe(), which receives all changes to key x.

Do you think we need separate ones for "there were updates to the keys" and "keys were deleted" and "keys were created"? I think most systems that work async (mainly message buses) just have a "subscribe to this topic and I will tell you everything on it". I think having separate handlers makes it more complicated? On the other hand, it means you need boilerplate to handle what kind of update it was.

I would start with one handler for subscribe. Let code calling NkvClient figure out what to do with the update based on its type. We always can add finer-grained later.

Okay, makes sense, I don't want to introduce to many knobs, because it'll lead to boiler plating which I want to avoid

deitch · 2024-09-02T10:32:55Z

I put some comments inline. One other important thing that I didn't address is the "keyspace". What is our keyspace? I see a few options:

Global, every put/get/delete/subscribe uses a single universal key space, so no difference between keys named "foo", "bar", "a.b.c", "this*1me"
Global but with character restrictions (no *, etc.)
Namespaced, either explicitly via put(namespace, key, value) or implicit, like NATS, via . separated, as in put(namespace1.key1, value) which might be stored and thus searched differently. Both of these imply some ability to do get("namespace1.*") or getAll("namespace1"), similar for delete and subscribe (doesn't make sense for put()

Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart · 2024-09-02T11:49:13Z

Regarding "key space" I believe our best option would be to have it implicit like NATS, via separated dot. Because technically it won't change the way we store things and the only thing that would be added is to get all by a regex as a special case, we can then update documentation

deitch · 2024-09-02T11:51:06Z

Are you sure that implicit doesn't change storage? There can be significant performance differences between get("foo.*") when it's all in a single table and you need to check each key, vs separate tables.

uncleDecart · 2024-09-02T11:58:01Z

Are you sure that implicit doesn't change storage? There can be significant performance differences between get("foo.*") when it's all in a single table and you need to check each key, vs separate tables.

True, performance will be different, but depends on how many keys will we have? Say we are talking about 20 services which are namespaces of first order, then there are subscriptions/publications of 25 in each which are namespaces of 2nd order and each of them would contain say N values for each Edge App (say 25 as well). So it's ~12500 entries in total. How much it would increase our performance to iterate over 625 entries say instead of 12500 for each request? Well, I need to add benchmarks so that we could see it in microseconds :D

deitch · 2024-09-02T12:03:36Z

You're thinking eve. I'm thinking, you're building a generic high performance low footprint kv store.

uncleDecart · 2024-09-02T12:04:53Z

You're thinking eve. I'm thinking, you're building a generic high performance low footprint kv store.

Question is then, how many layers of nested namespaces should we allow in such generic store?

deitch · 2024-09-02T12:13:09Z

Truth is, it doesn't matter. Define the API day one, you always can optimize the back end later. You just need to think about what the API would constrain later.

uncleDecart · 2024-09-02T12:51:14Z

Then making it implicit makes sense, we can change implementation later, so client can do wild masks on dot patterns, wdyt?

deitch · 2024-09-02T12:55:12Z

Sure. Start with that, use regex for linear search. Split it in the future.

It is not needed anymore Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart · 2024-09-02T14:17:47Z

Put description in #6 will address it in separate PR, it'll need some love

deitch · 2024-09-02T15:00:46Z

src/nkv.rs

+// storage with ability to notify clients about changes
+// made in a value. When created via new() it will try to
+// load values from folder. Underlying structure is a HashMap
+// and designed to be access synchronously.


Here it gets confusing. Is a "client" what NkvClient is? Or is NkvClient a network client that talks to some server structure, that wraps around NotifyKeyValue? I think you mean the second, and the "client" here is the server (which then handles network communication), but it is not 100% clear.

"synchronously".

Let me see if I get this, and then we can update the docs.

CLI client --> NkvClient --> network --> CLI server --> NotifyKeyValue --> filesystem

Is that correct? If so:

is NotifyKeyValue an actual implementation? Are there traits it implements, so that it can be plugged in with another implementation in the future? I am thinking about testing but also pure in-memory, etc. We have learned from EVE pubsub that having pluggable implementations for the backend storage is very useful. Or is it that there is a trait that the server would expect, and NotifyKeyValue implements that trait? I think the other way around, as NotifyKeyValue describes the behaviour, and using the filesystem is a concrete implementation. Or is it already this way and I just missed it?

can we update the comment to make it clear what the role of it is, what we mean by a "client"?

synchronous, then, is a specific of the implementation?

Yes this is correct, NotifyKeyValue is an actual implementation, we can extract traits from it, like put, get, delete, subscribe, unsubscribe so that we can create variations. Idea behind NotifyKeyValue is that it is just a container containing keys and Values, which are actually composition of PersistentValue and Notifier, so every time you update value you notify subscribers and store the value on a disk. Technically, we can implement Builder pattern which will build you necessary NotifyKeyValue with PersistentValue or InMemory Value and Notifier being TCPNotifier or UnixSocketNotifier or whatever you want it to be.

It is sync under the hood but from the user (or client) perspective, whenever you are creating an object called NotifyKeyValue, you interact with it via channels, therefore you can access NotifyKeyValue in an async manner, but under the hood it's going to be a queue from channels which is processed synchronously. So technically from NotifyKeyValue perspective Server is a client :D Meaning Server creates NotifyKeyValue instance and communicates with it via channels and those channels are created for each connection server handles

src/nkv.rs

uncleDecart · 2024-09-02T15:00:56Z

src/server/srv.rs

+        let sub_resp = client.subscribe(key.clone()).await.unwrap();
+        assert_eq!(
+            sub_resp,
+            request_msg::ServerResponse::Base(request_msg::BaseResp {
+                id: "0".to_string(),
+                status: http::StatusCode::OK,
+                message: "Subscribed".to_string(),
+            })
+        );
+        // Give server time to subscribe
+        tokio::time::sleep(tokio::time::Duration::from_millis(100)).await;
+
+        let new_value: Box<[u8]> = Box::new([42, 0, 1, 0, 1]);
+        let resp = client.put(key.clone(), new_value.clone()).await.unwrap();
+        assert_eq!(
+            resp,
+            request_msg::ServerResponse::Base(request_msg::BaseResp {
+                id: "0".to_string(),
+                status: http::StatusCode::OK,
+                message: "No Error".to_string(),
+            })
+        );
+        let result = client.latest_state(&key).await;
+        assert!(result.is_ok());
+        match result {
+            Ok(Message::Update { value }) => {
+                assert_eq!(value, new_value)
+            }
+            _ => panic!("Expected no errors"),
+        }


@deitch here is how getting the latest state looks like, otherwise we can make state pub and basically allow user to implement their own variation of latest_state

Signed-off-by: Pavel Abramov <[email protected]>

It's enough time to get server up and running and that reduces test time Signed-off-by: Pavel Abramov <[email protected]>

Signed-off-by: Pavel Abramov <[email protected]>

It is not needed Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart · 2024-09-03T15:38:25Z

Actually got too into it so I finished all the todo list, @deitch I see one unresolved discussion about documentation, once it's done I think we're safe to merge this and proceed with other issues

Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart added 2 commits August 28, 2024 21:09

Introduce notification on subscribe

95574fa

Signed-off-by: Pavel Abramov <[email protected]>

Rename NatsClient to NkvClient

0e61608

No more NATS Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart added this to the Release v0.itworks milestone Aug 28, 2024

uncleDecart requested a review from deitch August 28, 2024 19:34

uncleDecart added 5 commits August 28, 2024 21:36

Remove NATS service container from GHA

f3bbbb4

Signed-off-by: Pavel Abramov <[email protected]>

Rename handle_base_msg to handle_delete

f77a82d

Because that is the correct name to use Signed-off-by: Pavel Abramov <[email protected]>

Send NotFound via Notifier channel in case Key is not found

49ed2f6

That also introduces Notifier Error and NkvClient now has map instead of single channel, need to think about cleaning it up Signed-off-by: Pavel Abramov <[email protected]>

Gracefully handle unsubscribe when delete value

94bb070

Signed-off-by: Pavel Abramov <[email protected]>

Make NKV persistent

54cb710

Notify Key Value will load previously stored values and delete will remove values from filesystem Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart changed the title ~~[WIP] Add proper Subscribe mechanism~~ [WIP] Adding functionality to satisfy v0.itworks Sep 2, 2024

Generate unique UUID for requests

e080b6c

This way it is easier to trace and debug things Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart changed the title ~~[WIP] Adding functionality to satisfy v0.itworks~~ Adding functionality to satisfy v0.itworks Sep 2, 2024

deitch reviewed Sep 2, 2024

View reviewed changes

src/nkv.rs Show resolved Hide resolved

deitch reviewed Sep 2, 2024

View reviewed changes

Update documentation including diagrams

572d6d6

Signed-off-by: Pavel Abramov <[email protected]>

Remove async-nats from cargo

9eab304

It is not needed anymore Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart mentioned this pull request Sep 2, 2024

Add key space #6

Closed

deitch reviewed Sep 2, 2024

View reviewed changes

src/nkv.rs Outdated Show resolved Hide resolved

uncleDecart commented Sep 2, 2024

View reviewed changes

uncleDecart added 2 commits September 2, 2024 17:29

Add license and description for files

0d13221

Signed-off-by: Pavel Abramov <[email protected]>

change sleep to 100ms instead of 2s

a1fac9c

It's enough time to get server up and running and that reduces test time Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart mentioned this pull request Sep 2, 2024

Add update Handler #7

Closed

uncleDecart force-pushed the add-client branch from 2d1d298 to 6f431fd Compare September 3, 2024 13:16

Add benchmarking tests and integrate it with CI/CD

9e4ba3d

Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart force-pushed the add-client branch from 6f431fd to 9e4ba3d Compare September 3, 2024 13:21

Remove pub from bench

910d463

It is not needed Signed-off-by: Pavel Abramov <[email protected]>

Add server graceful shutdown through channels

8aad3cd

Signed-off-by: Pavel Abramov <[email protected]>

uncleDecart force-pushed the add-client branch from fc8085c to 8aad3cd Compare September 3, 2024 15:42

uncleDecart merged commit 41a89ed into main Sep 4, 2024
1 check passed

uncleDecart deleted the add-client branch September 4, 2024 13:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding functionality to satisfy v0.itworks #5

Adding functionality to satisfy v0.itworks #5

uncleDecart commented Aug 28, 2024 •

edited

Loading

uncleDecart commented Sep 2, 2024 •

edited

Loading

deitch Sep 2, 2024

uncleDecart Sep 2, 2024

deitch Sep 2, 2024

uncleDecart Sep 2, 2024

deitch Sep 2, 2024

uncleDecart Sep 2, 2024

deitch Sep 2, 2024

uncleDecart Sep 2, 2024

uncleDecart Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024 •

edited

Loading

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch Sep 2, 2024

deitch Sep 2, 2024

uncleDecart Sep 2, 2024

uncleDecart Sep 2, 2024

uncleDecart commented Sep 3, 2024

Adding functionality to satisfy v0.itworks #5

Adding functionality to satisfy v0.itworks #5

Conversation

uncleDecart commented Aug 28, 2024 • edited Loading

uncleDecart commented Sep 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024 • edited Loading

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

deitch commented Sep 2, 2024

uncleDecart commented Sep 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

uncleDecart commented Sep 3, 2024

uncleDecart commented Aug 28, 2024 •

edited

Loading

uncleDecart commented Sep 2, 2024 •

edited

Loading

uncleDecart commented Sep 2, 2024 •

edited

Loading