New %codes for computing message forwarding delays #260

eduard-bagdasaryan · 2024-07-15T15:00:30Z

The new format codes log absolute times of an HTTP/FTP transaction stage:

received_request_first_byte_time
received_request_last_byte_time
sent_request_first_byte_time
sent_request_last_byte_time
received_response_first_byte_time
received_response_last_byte_time
sent_response_first_byte_time
sent_response_last_byte_time

These codes yield an approximate absolute time of event in
<full seconds since epoch>.<fractional seconds> format.
The times may be useful in triaging various delays caused by
Squid and/or peers.

Squid maintains these timers only for successful transactions. Most of
the timers are updated in an IO callback invoked during HTTP/FTP
data transfers.

There are many ways a transaction can fail. Many failures are not associated with a socket read(2). We can and perhaps should collect and report error timestamps, but doing so correctly is a lot of work _and_ does not help with monitoring primary SLA-related delays that usually focus on successful transactions (percentage of unforced errors is usually a separate metric that is not related to proxy-induced delays).

Instead, a new requestParseStart field is used for factoring in possible parsing delays. %request_first_read_time now shows the beginning of the first successful parse attempt (ignoring previous unsuccessful attempts due to insufficient data).

Also added another ReadFromClient() helper for updating statistics and timers.

Also added (missing) statCounter.client_http.kbytes_out statistics for FTP.

src/servers/Server.cc

eduard-bagdasaryan · 2024-07-28T21:00:14Z

src/tunnel.cc

-        statCounter.client_http.kbytes_in += len;
-    }
+
+    ReadFromClient(al, len, errcode);


As we discussed, ReadFromClient() and other three functions minimize duplication by collecting global statistics, but do not collect local (transaction-related) statistics as well. For example, in this context the local statistics is calculated by the client.bytesIn(len) call. In HttpStateData::readReply() it would be probably HttpStateData::payloadSeen variable. We have also ClientHttpRequest::out::size calculated in both HTTP and FTP code. Some of these context-related variables are used for other format codes, e.g., "<st" is gathered from al->http.clientReplySz, that in turn is calculated from ClientHttpRequest::out.
In general, all these transaction-related bits seem rather unsystematic: I do not see an (easy) way how we can neatly adjust further, moving them into ReadFromClient() (and similar) functions.

src/cf.data.pre

eduard-bagdasaryan · 2024-07-28T23:13:17Z

src/servers/FtpServer.cc

+    const auto http = pipeline.front()->http;
+    assert(http);
+    http->out.size += io.size;
+    WrittenToClient(http->al, io.size, false);


This (and 2 cases below) adds statCounter.client_http.kbytes_out statistics for FTP.

Also this change eliminates requestReadTimer special updating outside ReadFromClient() API. Also removed requestReadTimer special handling on EOF: in this case the truncated request is invalid (aborting the transaction?). For simplicity, we do not maintain requestReadTimer (and others) if there is a higher level problem (e.g., the entire transaction is aborted).

eduard-bagdasaryan · 2024-08-12T00:13:29Z

src/cf.data.pre

+
+	response_last_write: The latest observed event in a series of successful response write events. See response_first_write event for the definition of a successful response write event. If the entire response was delivered in a single socket write(2), then this event is the same as response_first_write event.
+
+


The PR implementation differs from the proposed definitions (for request_first_read/request_last_read):

request_first_read

valid requests: Just after request parsing starts (parseOneRequest())

invalid requests (created by abortRequestParsing(): Same as (1)

fake requests: When the fake request is created (buildFakeRequest())

request_last_read

valid requests: request_first_read (for parsed complete requests) or just after it reads the last request data block from the client socket (for parsed incomplete requests)

invalid requests (created by abortRequestParsing(): same as request_first_read

fake requests: When the fake request is created (buildFakeRequest())

request_first_write
Just after the first request data block is written to the peer socket (CommIoCbParams/IOCB callbacks)

request_last_write
Just after the last request data block is written to the peer socket (CommIoCbParams/IOCB callbacks)

response_first_read
Just after the first response data block is read from the peer socket (CommIoCbParams/IOCB callbacks)

response_last_read
Just after the last response data block is read from the peer socket (CommIoCbParams/IOCB callbacks)

response_first_write
Just after the first request data block is written to the client socket (CommIoCbParams/IOCB callbacks)

response_last_write
Just after the last request data block is written to the client socket (CommIoCbParams/IOCB callbacks)

We could simplify request_first_read/request_last_read calculation, ignoring parsing time and measuring them from ClientHttpRequest constructor. However, this approach would minimize request_first_read/request_last_read difference in some cases, concealing this useful information from the admin.

eduard-bagdasaryan · 2024-08-12T12:49:31Z

src/servers/Http1Server.cc

@@ -72,6 +72,8 @@ Http::One::Server::noteMoreBodySpaceAvailable(BodyPipe::Pointer)
 Http::Stream *
 Http::One::Server::parseOneRequest()
 {
+    TimeScope parseContext(requestParseStart);


We cannot easily simplify here (getting rid of TimeScope wrap) because requestParseStart may be used (incorrectly) if we buildFakeRequest() somewhere further (within the same clientParseRequests() iteration). In buildFakeRequest() request_first_read should be 'now'.

src/servers/Server.cc

src/cf.data.pre

These or similar improvements are needed, but they are far from trivial and deserve dedicated PR(s) that would address several problems: * inconsistent treatment of ERR_CLOSING * inconsistent treatment of read EOF * inconsistent treatment of I/O errors * inconsistent treatment of StoreEntry aborts during I/O wait * abuse of statCounter.client_http for FTP traffic? * missing statCounter.client_http.kbytes_out update in Ftp::Server::wroteReply() * basic low-level I/O stats collected by higher-level I/O handling code and/or post-TLS/decryption stats misrepresenting actual socket I/Os Bundling statCounter and ALE::cache updates also complicated ReadFromClient() API and logic, requiring an error-prone hasError parameter and special treatment of zero-size I/O (that was duplicating caller checks in some cases). At the same time, some other rather similar stats updates were not bundled (e.g., hier.notePeerWrite()). The in-scope changes "inlined" in this commit are still unpolished; their location may also change. Some are suffering from similar inconsistencies. Some are missing ALE presence checks.

Ideally, we want requestReadTimer to include the time it takes Squid to parse a fully accumulated request header. However, the proposed (and now removed) solution was problematic on several levels: * It led to virtual parseOneRequest() code duplication. * It crashed two existing ClientHttpRequest(nullptr) callers. * It contradicted existing cache.start_time initialization code in the same ClientHttpRequest constructor. It would be odd for %tS to be later than %request_first_read (that _name_ reflects neither removed nor this/new/current code and will be changed, of course). * A different solution will be needed when we start creating ClientHttpRequest objects earlier, _before_ we start parsing. And we should create those object earlier instead of accumulating information at ConnStateData level and then passing it to ClientHttpRequest later, including on parsing errors. Many other problems will be solved if we create ClientHttpRequest objects earlier!

XXX: Besides benign compilation problems, this solution is complex but probably incomplete and probably hits other/existing code problems. We should find a better approach.

Revert previous branch commit due to problems documented in that commit message.

Underlying principle: We want to capture the moment when a successful transaction(*) received all data it was waiting for -- the time of the last socket read associated with that transaction. Some of these EOF events abort the current transaction. We do not care about such events because we only care about _successful_ transactions. We (greatly) simplify code by not chasing EOF effects to distinguish successful EOF events from killer ones (see two previous branch commits for a negative example of such chasing). On a client-to-Squid side, a socket read may read data for multiple (current and/or future) transactions, but at most _one_ of them would be the current transaction waiting for those bytes. The rest are future transactions that have not even started yet (from this project events point of view); a future transaction read timer will _start_ when that transaction is created, after accumulating enough data to parse its request header. (*): Here, "transaction" is a request-response exchange (on either client-to-Squid or Squid-to-peer connection).

N.B. Legacy comm_read() callers may receive these three io.flags values: ERR_CLOSING, OK, and COMM_ERROR. No ENDFILE and INPROGRESS for them (those flags are specific to readers converted to Comm::ReadNow() API).

I am not sure zero-writes are possible, but if they are possible, I see no reason to ignore them because the transaction is still waiting for that write(2) system call.

... instead of 'request header parsed' or 'last header byte' timestamp. I think both timestamps are valuable, but we should start with "first byte" because * 'first byte' feels more "fundamental" than 'last header byte'. * 'first byte' "matches" 'last byte' timestamp this branch is adding. * 'first request byte' "matches" 'first response byte' this branch is adding. On the Squid-to-server connection, measuring 'first response byte' is easier/simpler, and it is good to start with something simpler. * 'first byte' covers header parsing overheads that are, in most cases, Squid overheads that admins want to include in SLAs and such. This commit partially restores ideas and associated problems removed in commit c0dd69b: We make this code more complex again, reintroduce a cache.start_time conflict, and add code that will have to be refactored when ClientHttpRequest objects are created earlier.

... until we can prove it (and assert it in FwdState ctor).

IME, it is best to use std::optional wrapping to indicate lack of information rather than to define information type. The difference is usually very subtle. For example, in this specific case, the only problematic areas (fixed by this commit) were MessageTimer conversion constructor (that arguably should take a known time value) and ExtractRequestStartTime() (that definitely should return a known time value rather than a never-nil std::optional). There are exceptions to this rule of thumb, of course. Sometimes, the possibility of missing information _is_ an intrinsic property of a C++ type. I do not think those exceptions apply to MessageTimer::Time.

Also formatted new %code docs.

We claim that "CONNECT tunnel establishment ... happen earlier". FwdState::successfullyConnectedToPeer() does reset the timer, but tunnel.cc (which uses Http::Tunneler) does not go through FwdState, and if feels odd to update a timer that is guaranteed to be reset anyway. Tunnel establishment handling is one of those areas where "message activity" (i.e. what we are trying to measure on this branch) differs from "socket I/O".

since doneWithControlMsg() is also used for caller notification (i.e., on unsuccessful write).

eduard-bagdasaryan · 2024-08-27T17:58:18Z

src/auth/basic/NIS/nis_support.h

@@ -8,9 +8,6 @@
 #ifndef SQUID_SRC_AUTH_BASIC_NIS_NIS_SUPPORT_H
 #define SQUID_SRC_AUTH_BASIC_NIS_NIS_SUPPORT_H

-#ifndef SQUID_SRC_AUTH_BASIC_NIS_NIS_SUPPORT_H
-#define SQUID_SRC_AUTH_BASIC_NIS_NIS_SUPPORT_H
-


This change is out of scope - it fixes a build error.

rousskov and others added 22 commits June 28, 2024 16:57

Incomplete and messy feature documentation draft

334d87a

fixup: Simplified and generalized

f19be20

fixup: Clarified adaptation timing w.r.t. byte events

98ccecb

Switched from _byte to _read terms to be able to add _write events

b34e422

Reduce "first request" vs "first read" ambiguity

e843568

Document all 8 cases: send/receive first/last request/response byte

44b7842

Added %request_first_read_time and %request_last_read_time codes

fa1b0ad

Introduced a new MessageTimer class

330ef13

Added %request_first_write_time and %request_last_write_time codes

8a8da4b

Polished

380f33a

Removed an unused type

bfb0102

Reworked 8a8da4b, covering some missed FTP cases

a4f5a99

Cleanup

e4fad62

Added %response_first_read_time and %response_last_read_time codes

38038ad

Update requestReadTimer when uploading FTP data

109de07

Update responseReadTimer on EOF

507ab43

Update requestReadTimer on EOF

38daef3

Update timers on EOF in tunnel.cc

57fda97

Also added another ReadFromClient() helper for updating statistics and timers.

Added %response_first_write_time and %response_last_write_time codes

a9c51e7

Also added (missing) statCounter.client_http.kbytes_out statistics for FTP.

Added a missing WrittenToClient() in FTP

3b23fd3

eduard-bagdasaryan changed the title ~~%codes for computing message forwarding delays~~ New %codes for computing message forwarding delays Jul 28, 2024

eduard-bagdasaryan commented Jul 28, 2024

View reviewed changes

src/servers/Server.cc Outdated Show resolved Hide resolved

eduard-bagdasaryan commented Jul 28, 2024

View reviewed changes

src/cf.data.pre Outdated Show resolved Hide resolved

Added a TODO

0a19226

eduard-bagdasaryan commented Jul 28, 2024

View reviewed changes

eduard-bagdasaryan added 2 commits August 9, 2024 16:32

Addressed a TODO about duplication

626598c

eduard-bagdasaryan commented Aug 12, 2024

View reviewed changes

rousskov requested changes Aug 12, 2024

View reviewed changes

src/servers/Server.cc Outdated Show resolved Hide resolved

src/cf.data.pre Outdated Show resolved Hide resolved

eduard-bagdasaryan and others added 19 commits August 12, 2024 20:17

Adjusted new format codes to be compatible with %tS

bf09e96

WIP: A failed attempt to time end-of-request discovery

7ab3b89

XXX: Besides benign compilation problems, this solution is complex but probably incomplete and probably hits other/existing code problems. We should find a better approach.

Revert "WIP: A failed attempt to time end-of-request discovery"

ed0be1a

Revert previous branch commit due to problems documented in that commit message.

fixup: Simplified a bit

3fb4e85

N.B. Legacy comm_read() callers may receive these three io.flags values: ERR_CLOSING, OK, and COMM_ERROR. No ENDFILE and INPROGRESS for them (those flags are specific to readers converted to Comm::ReadNow() API).

Unified *WriteTimer.update() calls

ff53998

I am not sure zero-writes are possible, but if they are possible, I see no reason to ignore them because the transaction is still waiting for that write(2) system call.

fixup: Treat FTP data connection establishment as I/O

235e422

fixup: Polished EOF-without-transactions event description

692aaf8

fixup: I/O event timings should not depend on Store state

12d9909

fixup: Do not assume FwdState::al is always available

9d3b7e5

... until we can prove it (and assert it in FwdState ctor).

fixup: Fixed timer name mixed up by my branch commit 88c458c

d8af1f5

Renamed and redocumented new %codes to better match code

9dd2725

Also formatted new %code docs.

Update timers on control messages

b70587b

Update responseWriteTimer only if the control message was sent

cfff89f

since doneWithControlMsg() is also used for caller notification (i.e., on unsuccessful write).

eduard-bagdasaryan commented Aug 27, 2024

View reviewed changes

Polished

3317bda

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New %codes for computing message forwarding delays #260

New %codes for computing message forwarding delays #260

eduard-bagdasaryan commented Jul 15, 2024 •

edited

Loading

eduard-bagdasaryan Jul 28, 2024

eduard-bagdasaryan Jul 28, 2024

eduard-bagdasaryan Aug 12, 2024

eduard-bagdasaryan Aug 12, 2024

eduard-bagdasaryan Aug 27, 2024


		response_last_write: The latest observed event in a series of successful response write events. See response_first_write event for the definition of a successful response write event. If the entire response was delivered in a single socket write(2), then this event is the same as response_first_write event.

New %codes for computing message forwarding delays #260

Are you sure you want to change the base?

New %codes for computing message forwarding delays #260

Conversation

eduard-bagdasaryan commented Jul 15, 2024 • edited Loading

eduard-bagdasaryan Jul 28, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan Jul 28, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan Aug 12, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan Aug 12, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan Aug 27, 2024

Choose a reason for hiding this comment

eduard-bagdasaryan commented Jul 15, 2024 •

edited

Loading