Poll for messages using `TaskExecutor` #178

mimischi · 2024-11-14T19:39:20Z

We currently sleep for pollInterval when no new messages have been polled from the cluster. This leads to unnecessary slowness of the client. Instead of doing that, we now break up the polling of messages into two distinct approaches:

Attempt to poll synchronously: if there a message is polled, we return it. If there is no message, we immediately go to step 2.
We create a DispatchQueue and run the consumerPoll on it using withTaskExecutorPreference. We make the consumerPoll call wait for up to pollInterval before bailing.

This prevents us from sleeping on the running thread, and frees up cycles to do other work if required.

Resolves #165

We currently sleep for `pollInterval` when no new messages have been polled from the cluster. This leads to unnecessary slowness of the client. Instead of doing that, we now break up the polling of messages into two distinct approaches: 1. Attempt to poll synchronously: if there a message is polled, we return it. If there is no message, we immediately go to step 2. 2. We create a `DispatchQueue` and run the `consumerPoll` on it using `withTaskExecutorPreference`. We make the `consumerPoll` call wait for up to `pollInterval` before bailing. This prevents us from sleeping on the running thread, and frees up cycles to do other work if required. Resolves swift-server#165

Whoops.

Can't use the current implementation before Swift 6.

mimischi · 2024-11-14T20:51:26Z

Haven't done any benchmarks just yet. Would like to do some, before we go about merging this.

mimischi · 2024-11-14T21:05:53Z

The benchmark looks better, but the benchmark suite says all differences are negative—and it's setting a positive difference as the threshold, so while this PR is better, the benchmark ends up failing?


=====================================================================================================
Threshold deviations for SwiftKafkaConsumerBenchmarks:SwiftKafkaConsumer_basic_consumer_messages_1000
=====================================================================================================
╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Time (wall clock) (ms, %)                │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │             642 │             149 │             -76 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Time (total CPU) (ms, %)                 │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │              30 │              17 │             -42 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Throughput (# / s) (#, %)                │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │               2 │               7 │            -250 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Context switches (#, %)                  │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │            1230 │             392 │             -68 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ (Alloc + Retain) - Release Δ (#, %)      │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │            2004 │            1223 │             -39 │              20 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Releases (K, %)                          │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │              14 │              11 │             -22 │              20 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

=========================================================================================================
Threshold deviations for SwiftKafkaConsumerBenchmarks:SwiftKafkaConsumer_with_offset_commit_messages_1000
=========================================================================================================
╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Time (wall clock) (ms, %)                │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │             635 │             119 │             -81 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Time (total CPU) (ms, %)                 │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │              38 │              15 │             -62 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Throughput (# / s) (#, %)                │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │               2 │               8 │            -300 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ Context switches (#, %)                  │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │            1432 │             507 │             -64 │              35 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

╒══════════════════════════════════════════╤═════════════════╤═════════════════╤═════════════════╤═════════════════╕
│ (Alloc + Retain) - Release Δ (#, %)      │            main │              PR │    Difference % │     Threshold % │
╞══════════════════════════════════════════╪═════════════════╪═════════════════╪═════════════════╪═════════════════╡
│ p90                                      │            1994 │            1186 │             -40 │              20 │
╘══════════════════════════════════════════╧═════════════════╧═════════════════╧═════════════════╧═════════════════╛

New baseline 'PR' is BETTER than the 'main' baseline thresholds.

error: benchmarkThresholdImprovement
Retcode is 1
Benchmark failed

FranzBusch · 2024-11-15T12:01:18Z

Sources/Kafka/KafkaConsumer.swift

@@ -12,6 +12,7 @@
 //
 //===----------------------------------------------------------------------===//

+import Dispatch


Don't think we need this import here

Whoops. Left over from a refactor. Is there a linter that can help point out unused imports?

Sources/Kafka/KafkaConsumer.swift

FranzBusch · 2024-11-15T12:02:11Z

Sources/Kafka/KafkaConsumer.swift

+                    #if swift(>=6.0)
+                    // Wait on a separate thread for the next message.
+                    return try await withTaskExecutorPreference(queue) {
+                        try client.consumerPoll(for: Int32(self.pollInterval.inMilliseconds))


What happens after the time out?

We attempt to retrieve a message for self.pollInterval. If there's still no message, we return nil—the same behavior as in the above if let. I'd expect we get caught up in the while-loop on line 100 until we do receive a message eventually.

Sources/Kafka/Utilities/NaiveQueueExecutor.swift

FranzBusch · 2024-11-15T12:04:18Z

Sources/Kafka/Utilities/NaiveQueueExecutor.swift

+    public func enqueue(_ _job: consuming ExecutorJob) {
+        let job = UnownedJob(_job)
+        queue.async {
+            job.runSynchronously(


@ktoso Should we call this runSynchronously or the one that also takes isolatedTo?

FranzBusch · 2024-11-15T12:04:58Z

The benchmark looks better, but the benchmark suite says all differences are negative—and it's setting a positive difference as the threshold, so while this PR is better, the benchmark ends up failing?

Yes the benchmarks improved significantly! Negative numbers means fewer allocations.

mimischi · 2024-11-15T12:13:18Z

Negative numbers means fewer allocations.

It's just funny that the benchmarking suite thinks the benchmark failed, because we are below the positive threshold with our negative numbers :)

mimischi · 2024-11-15T12:27:56Z

Also, should this PR—once approved—update the benchmark baseline as well?

FranzBusch · 2024-11-15T13:16:00Z

Also, should this PR—once approved—update the benchmark baseline as well?

Yes. We intentionally fail the benchmarks when we improve so we set a new thresholds to make sure we don't regress again.

hassila · 2024-11-15T19:03:54Z

Negative numbers means fewer allocations.

It's just funny that the benchmarking suite thinks the benchmark failed, because we are below the positive threshold with our negative numbers :)

There are different return codes so CI can choose how to handle that - for an expected improvement one would check in new baselines manually anyway - for an unexpected improvement something may be wrong with eg. The benchmark setup.

But super nice improvements here 👍🏻

mimischi · 2024-11-16T09:22:36Z

@hassila Oh, fair enough. I've not thought about the unexpected improvement situation. Thanks for bringing that up!

ktoso · 2024-11-18T13:10:36Z

Sources/Kafka/Utilities/DispatchQueueTaskExecutor.swift

+        let job = UnownedJob(_job)
+        queue.async {
+            job.runSynchronously(
+                on: self.asUnownedTaskExecutor()


if you're able to use the

public func runSynchronously(isolatedTo serialExecutor: UnownedSerialExecutor, taskExecutor: UnownedTaskExecutor) {

here and pass the queue's UnownedSerialExecutor that would be preferable as it would AFAIR be more correct in tracking where this is isolated to.

If this is a pain because the queue does not conform to SerialExecutor on some platforms still... then perhaps conditionalize it to platforms where it is? Or leave as is and let me know and we'll chase fixing the conformance.

Got you. Since we own the underlying queue here is owned by us nothing should be isolated to it.

ktoso

This looks good to me, check the run func but either way I think this is looking good

mimischi added the 🔨 semver/patch No public API change. label Nov 14, 2024

mimischi added 4 commits November 14, 2024 20:00

Remove unnecessary return

0d7b579

Run swift-format

46a04c7

Whoops.

Use @available for NaiveQueueExecutor

a84f1c0

Can't use the current implementation before Swift 6.

mimischi force-pushed the issue-165-nonblocking-poll-task-executor branch from 9c4b7dd to a84f1c0 Compare November 14, 2024 20:00

FranzBusch reviewed Nov 15, 2024

View reviewed changes

Address reviewer comments

5565624

ktoso reviewed Nov 18, 2024

View reviewed changes

ktoso approved these changes Nov 18, 2024

View reviewed changes

Merge branch 'main' into issue-165-nonblocking-poll-task-executor

caa77e3

FranzBusch merged commit 4a74297 into swift-server:main Nov 18, 2024
23 checks passed

mimischi deleted the issue-165-nonblocking-poll-task-executor branch November 18, 2024 16:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Poll for messages using `TaskExecutor` #178

Poll for messages using `TaskExecutor` #178

mimischi commented Nov 14, 2024

mimischi commented Nov 14, 2024

mimischi commented Nov 14, 2024

FranzBusch Nov 15, 2024

mimischi Nov 15, 2024 •

edited

Loading

FranzBusch Nov 15, 2024

mimischi Nov 15, 2024

FranzBusch Nov 15, 2024

FranzBusch commented Nov 15, 2024

mimischi commented Nov 15, 2024

mimischi commented Nov 15, 2024

FranzBusch commented Nov 15, 2024

hassila commented Nov 15, 2024

mimischi commented Nov 16, 2024

ktoso Nov 18, 2024

FranzBusch Nov 18, 2024

ktoso left a comment

Poll for messages using TaskExecutor #178

Poll for messages using TaskExecutor #178

Conversation

mimischi commented Nov 14, 2024

mimischi commented Nov 14, 2024

mimischi commented Nov 14, 2024

FranzBusch Nov 15, 2024

Choose a reason for hiding this comment

mimischi Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

FranzBusch Nov 15, 2024

Choose a reason for hiding this comment

mimischi Nov 15, 2024

Choose a reason for hiding this comment

FranzBusch Nov 15, 2024

Choose a reason for hiding this comment

FranzBusch commented Nov 15, 2024

mimischi commented Nov 15, 2024

mimischi commented Nov 15, 2024

FranzBusch commented Nov 15, 2024

hassila commented Nov 15, 2024

mimischi commented Nov 16, 2024

ktoso Nov 18, 2024

Choose a reason for hiding this comment

FranzBusch Nov 18, 2024

Choose a reason for hiding this comment

ktoso left a comment

Choose a reason for hiding this comment

Poll for messages using `TaskExecutor` #178

Poll for messages using `TaskExecutor` #178

mimischi Nov 15, 2024 •

edited

Loading