-
-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate context deadline exceeded with DragonflyDB #389
Comments
Please let us know how it goes |
I was unable to reproduce this error with benchmarks, tried on MacOS (Docker) and Ubuntu 24.04 (with Docker and without Docker). Some other findings: Redis overperforms Dragonflydb up to order of magnitude in Centrifuge benchmarks. I am explaining this to myself that we use pipelining over a single connection in Centrifuge, and I guess DF batches calls over uring collecting them from different connections. And this means requests coming through a single conn are just waiting more time to be executed. This is actually not bad since in practice we have many Centrifuge nodes and eventually a higher throughput could be achieved potentially. I quickly tried running 10 instances of benchmarks in parallel, I see that a higher throughput may be achieved with DF in this case, so for me it proves the theory above. Still was far away from Redis throughput though. And CPU was like 450% compared to 100% of Redis. This is the limit for Redis, but it's clear how Redis provides the best throughput it can on a single core. Also, latencies are very unstable with DF when using several pipelining connections, running the same bench may result into 20k rps, then into 100k rps, then again 20k. While with Redis latencies are stable and benchmark rps is always consistent. For now I've run presence benchmarks, Centrifuge uses Lua in such requests. It's not very handy to experiment at this point, had to do many manual tweaks, so would be nice to automate various bench conditions. |
@FZambia , thank you for performing these tests. Can you instruct me on how to run a centrifuge benchmark? |
Yep - let me prepare sth suitable for reproducing various scenarios in convenient way, now it's not trivial. But a benchmark which just uses a single connection with pipelining may be run like this after cloning:
Redis bench (uses 6379 port):
Same but with Dragonfly (uses different port - 7379):
Go 1.21 or higher should be installed. These benches run many ops in parallel, all operations are then collected to a single pipeline. I'll try to find and implement a simple way to run benches which utilize several pipelines instead of one - to quickly experiment how it scales adding more connections. |
I will check it out, thanks! Does centrifuge usually open a single upstream connection from a single centrifuge process? |
Usually yes, it uses single pipeline. But that's what I've been talking in comments above - i tried to run multiple conns with separate pipelines and for now could not achieve good results, but I had super hacky bash scripts to experiment, will try to write a cleaner Go bench with more conns |
This seems related to centrifugal/centrifugo#925 – nothing about DragonflyDB, caused by deadlock in Centrifugo code which blocks Redis client read loop. |
The fix was shipped in Centrifuge v0.33.5, I am 99% sure it's the same reason here, closing. There is a chance that this may be re-opened if we have more cases in new versions. |
There was a report in community TG:
Current assumption is that benchmarks should help to reproduce.
The text was updated successfully, but these errors were encountered: