Only retrieve updated query statement metrics #19321

sethsamuel · 2024-12-27T22:37:41Z

What does this PR do?

Updates MySQL statement collection to only gather statements that have executed since the last collection. This functionality is behind a flag to allow for slow rollout and testing before hopefully making it standard.

Four variant builds were compared:

latest, a control with current live behavior
digest-only, query for the same records as latest but only get the digest, then perform a second query for the digest text if it is not already cached
last-seen, query only for digests of statements that have executed since the last collection, then query for digest text if not cached
last-seen-with-text, query for statements executed since last collection but with digest text.

Testing was performed on orders app with 1 minute bursts of high cardinality queries every 5 minutes.

Somewhat surprisingly, querying for the digest and then for digest text made little difference in performance, even with large random statements (~1kb) executed during bursts. When querying for only updated rows, the much smaller row count allows for fetching the text as well, avoiding a second database query.

CPU usage is greatly reduced during normal load, and somewhat reduced during high cardinality bursts.

Query time and overall statement metrics collection time (note that y-axis is in seconds, not ms) are greatly reduced.

Motivation

Customers have complained about CPU usage by the MySQL check. This change uses a similar technique to the other database integrations to minimize the amount of unnecessary data retrieved for each statement collection.

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
Add the qa/skip-qa label if the PR doesn't need to be tested during QA.
If you need to backport this PR to another branch, you can add the backport/<branch-name> label to the PR and it will automatically open a backport PR once this one is merged

codecov · 2024-12-27T22:40:36Z

Codecov Report

Attention: Patch coverage is 94.44444% with 2 lines in your changes missing coverage. Please review.

Project coverage is 87.65%. Comparing base (aa9bd4e) to head (2600722).
Report is 12 commits behind head on master.

Additional details and impacted files

Flag	Coverage Δ
activemq	`?`
cassandra	`?`
hive	`?`
hivemq	`?`
ignite	`?`
jboss_wildfly	`?`
kafka	`?`
mysql	`89.51% <94.44%> (-0.01%)`	⬇️
presto	`?`
solr	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

lu-zhengda · 2025-01-02T15:41:15Z

mysql/datadog_checks/mysql/statements.py

+            hostname=self._check.resolved_hostname,
+        )
+
+        monotonic_rows = self._filter_query_rows(monotonic_rows)


Is there a reason to filter the rows after retrieving from the database? can you still leverage WHERE digest_textNOT LIKE 'EXPLAIN %' ORdigest_text IS NULL with last_seen filter?

We could but it slows down the query and since we're not limiting anymore it's simple enough to filter them on the client. In practice it seems like EXPLAINs are never a substantial number of rows.

sethsamuel added 7 commits December 23, 2024 12:52

Cache digest text for MySQL

24120d6

WIP

6c09f70

Metric

1cf651f

Fix time

cbcf5ba

Last seen

f422cd0

WIP

16df724

Fixed

724de2b

datadog-assets bot added agent/review-requested ecosystems/review-requested product/review-requested labels Dec 27, 2024

sethsamuel changed the title ~~Cache digest text for MySQL~~ Only retrieve updated query statement metrics Dec 27, 2024

sethsamuel added 5 commits December 30, 2024 09:42

Clean

940b27d

Clean

3a69651

Feature flag

5ae0a83

Changelog

279ba93

Clean

bdacf12

sethsamuel added the qa/skip-qa Automatically skip this PR for the next QA label Dec 30, 2024

lu-zhengda reviewed Jan 2, 2025

View reviewed changes

sethsamuel marked this pull request as ready for review January 2, 2025 16:01

sethsamuel requested review from a team as code owners January 2, 2025 16:01

sethsamuel added 3 commits January 2, 2025 11:39

Handle case of skipped queries between runs

4b69943

Clean

263a4cc

Clean

2600722

lu-zhengda approved these changes Jan 2, 2025

View reviewed changes

steveny91 approved these changes Jan 2, 2025

View reviewed changes

datadog-assets bot added agent/approved and removed agent/review-requested labels Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only retrieve updated query statement metrics #19321

Only retrieve updated query statement metrics #19321

sethsamuel commented Dec 27, 2024 •

edited

Loading

codecov bot commented Dec 27, 2024 •

edited

Loading

lu-zhengda Jan 2, 2025

sethsamuel Jan 2, 2025

Only retrieve updated query statement metrics #19321

Are you sure you want to change the base?

Only retrieve updated query statement metrics #19321

Conversation

sethsamuel commented Dec 27, 2024 • edited Loading

What does this PR do?

Motivation

Review checklist (to be filled by reviewers)

codecov bot commented Dec 27, 2024 • edited Loading

Codecov Report

lu-zhengda Jan 2, 2025

Choose a reason for hiding this comment

sethsamuel Jan 2, 2025

Choose a reason for hiding this comment

sethsamuel commented Dec 27, 2024 •

edited

Loading

codecov bot commented Dec 27, 2024 •

edited

Loading