Allow (controlled) reuse of parsers again #14025

athornton · 2023-09-29T17:09:22Z

Use Case

After #13886 was merged, and we deployed a rebased telegraf instance, our performance took a nosedive. We realized it was because we were making a network call to our Schema Registry with every single measurement, because now we're creating a new Avro parser object each measurement, and therefore the schema cache, which is inside the parser, is always empty.

Expected behavior

We expected performance to stay about the same: one call to the registry per schema ID (there are many, many more measurements than distinct schemas).

Actual behavior

We're hitting the schema registry on each measurement, which is very slow, creates a huge load on the schema registry, and gobbles up a lot of memory for all of the simultaneously-open TCP connections.

Additional info

A little discussion on the InfluxDB Slack resulted in this suggested from @srebhan :

This is really another instance of "stateful parser"... :disappointed: In my view we need to introduce a Clone() telegraf.Parser interface to allow the parser to decide what shall be copied to a new instance and what shouldn't. This way we will get rid of special handling for e.g. CSV etc. In your case Clone() would simply return the present parser instance if if parsing is thread-safe (which was the reason for #13886). In other cases we might need to copy the whole parser or parts of it...

So I intend to add the Clone() interface to the Avro parser, and then take a look at what needs doing to plumb that into the generic create-a-parser logic.

The text was updated successfully, but these errors were encountered:

athornton · 2023-09-29T20:12:32Z

I wanted to get some feedback from @powersj and @srebhan about how to implement this.

So, should Clone() be a zero-argument method you call on an initialized Parser object, returning a pointer to a Parser? The superclass implementation gets you a new Parser, calls Init() on it, and returns a pointer to that object, while the one for the Avro parser just returns a pointer to the extant object?

athornton · 2023-10-05T20:10:07Z

So...my next question is...is there any easy way to tell which parsers are thread-safe? Is there any easy way to tell which fields for a given parser can be shallow-copied and where I need a deep copy?

The idea of implementing Clone() for every parser class fills me with a certain amount of dread, especially if it means that I'm going to have to learn how each one works internally.

AlbertasB · 2023-10-18T10:38:15Z

Just wanted to add that I also encountered HUGE performance hit after #13886 has been merged

powersj · 2023-10-18T18:57:43Z

@athornton, @srebhan, and myself met to discuss this today.

The next steps involve:

Revert of fix(inputs.kafka_consumer): Use per-message parser to avoid races #13886
Blocking around the json_v2 parse function
Create benchmark functions for parsing functions

The conclusion we reached was that the json_v2 has known issues around parsing data in a thread safe manner, resulting in #13886. However, this has impacted performance and other parsers that have state too much. Hence the revert.

For json_v2, we are going to add some blocking into the parse function in order to mitigate issues that were seen, without impact other parsers for now. Then additional work can occur later to better understand what is wrong specifically with json_v2.

Finally, in order to have real data around the parser performance, we want to add some benchmark functions around each parse function. This way we can get some idea of the performance for any given release.

athornton added the feature request Requests for new plugin and for new features to existing plugins label Sep 29, 2023

athornton mentioned this issue Oct 5, 2023

fix(inputs.kafka_consumer): Use per-message parser to avoid races #13886

Merged

3 tasks

srebhan mentioned this issue Oct 19, 2023

test(parsers): Add benchmarks #14148

Merged

3 tasks

athornton mentioned this issue Oct 19, 2023

fix(parsers.json_v2): Prevent race condition in parse function #14149

Merged

3 tasks

srebhan closed this as completed in #14149 Oct 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow (controlled) reuse of parsers again #14025

Allow (controlled) reuse of parsers again #14025

athornton commented Sep 29, 2023

athornton commented Sep 29, 2023

athornton commented Oct 5, 2023

AlbertasB commented Oct 18, 2023

powersj commented Oct 18, 2023

Allow (controlled) reuse of parsers again #14025

Allow (controlled) reuse of parsers again #14025

Comments

athornton commented Sep 29, 2023

Use Case

Expected behavior

Actual behavior

Additional info

athornton commented Sep 29, 2023

athornton commented Oct 5, 2023

AlbertasB commented Oct 18, 2023

powersj commented Oct 18, 2023