Feat/fetch retry and error handling #21

TheMagoo73 · 2020-08-17T10:22:59Z

A slightly naive implementation of retry logic to mitigate transient errors communicating with the intake. Can be improved down the line by filtering HTTP response codes such as 401 that should not cause a retry as they're not transient.

TheMagoo73 · 2020-08-17T15:21:04Z

Found some time today to improve the retry logic slightly. It now only retries HTTP 500 and greater errors. Anything in the 400 range is considered a 'persistent' error and isn't retried - the POST bails immediately. There are a couple of test cases to cover each scenario.

itsfadnis

@TheMagoo73 My apologies for the delay in getting back at this PR

While I think retrying requests is a good idea, a better idea would be to expose a callback that the consumer of datadog-winston could call if a request fails.

This way the consumer can decide what they want to do with a log when it fails, rather than us trying to resend it to datadog.

We could probably expose something like an onFailure hook, so it could be used like:

const transport = new DatadogTransport({
  // options
  onFailure: (data) => {
	// Log transport failed,
    // Do whatever you want with the failed log
  }
})

What are your thoughts on this?

TheMagoo73 · 2020-09-25T21:42:48Z

Apologies for the delay in getting back - holiday season!

I like the idea of a callback, it definitely adds more flexibility, I'm playing with some ideas around it at the moment. The key for me, I think, is to allow the caller to have enough information to handle retries, back offs etc. easily. My thinking is that an enriched version of the original log request that includes the number of failures would probably no the job.

I think supplying an 'out-of-the-box' retry implementation that can be used for the hook handler is also something we should consider, so users have a really simple, zero(ish) config to just get something reliable working.

Will try and get something up this weekend!

TheMagoo73 added 6 commits August 17, 2020 10:10

Slightly naive retry mechanism

7dc5f1b

Correct fetch result status

38b138a

Correct fetch result status

46d78eb

Retry test and fixes

8f1a7ef

re-enable standard for tests

6f82672

Add improved retry logic

878f366

itsfadnis reviewed Sep 6, 2020

View reviewed changes

jonathanmorley mentioned this pull request Mar 1, 2021

dont use async #26

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/fetch retry and error handling #21

Feat/fetch retry and error handling #21

TheMagoo73 commented Aug 17, 2020

TheMagoo73 commented Aug 17, 2020

itsfadnis left a comment

TheMagoo73 commented Sep 25, 2020

Feat/fetch retry and error handling #21

Are you sure you want to change the base?

Feat/fetch retry and error handling #21

Conversation

TheMagoo73 commented Aug 17, 2020

TheMagoo73 commented Aug 17, 2020

itsfadnis left a comment

Choose a reason for hiding this comment

TheMagoo73 commented Sep 25, 2020