delivery tag in the observations leading to too many timers generated -> leading to memory leak #2914

jensbaitingerbosch · 2024-11-25T16:46:41Z

In what version(s) of Spring AMQP are you seeing this issue?

3.2.0

Describe the bug

When upgrading our app to the latest spring version we discovered that the new verison produces way more metrics then before. While the old version produced about 20k timelines in prometheus, the new one generates 200k per instance. The main problematic metrics are:

spring_rabbit_listener_seconds_bucket
spring_rabbit_listener_active_seconds_bucket

which have not been in the older version 3.1.x

To Reproduce

Create an application, with message listeners, activate exporting of histogram buckets (e.g. by adding a meterFilter to the MeterRegistry)

Have Observations active (e.g. to trace the message processing even when they are processed async.)

public class MeterRegistryConfigurator implements BeanPostProcessor {

  @Override
  public Object postProcessAfterInitialization(@NotNull Object bean, @NotNull String beanName) {
    if (bean instanceof MeterRegistry meterRegistry) {

      var config = meterRegistry.config();
      config.meterFilter(new MeterFilter() {
        @Override
        public DistributionStatisticConfig configure(
            @NotNull Meter.Id id,
            @NotNull DistributionStatisticConfig config
        ) {
          return config.merge(DistributionStatisticConfig.builder()
              .percentilesHistogram(true)
              .build());
        }
      });
    }
    return bean;
  }
}

Expected behavior
A relatively small amount of timers are exported (max one per queue/error/listener(instace))

Actual behaviour

a new timer is created per delivery tag (which leads to an enormous amount of timers.

error="none", messaging_destination_name=xxx.queue", messaging_rabbitmq_message_delivery_tag="1", spring_rabbit_listener_id="org.springframework.amqp.rabbit.RabbitListenerEndpointContainer#2"}

Reference

The rabbit documentation refers the delivery tag as

monotonically growing positive integers

Therefore it is not suitable for a 'low cardinality key value'.

Workaround

Add an observation filer, that removes the key messaging.rabbitmq.message.delivery_tag

e.g. by adding it in a bean post processor:

@Bean open fun observationFilerPostProcessor() = object: BeanPostProcessor {
        override fun postProcessAfterInitialization(bean: Any, beanName: String): Any {
            if (bean is ObservationRegistry) {
                bean.observationConfig().observationFilter { context -> 
                    if (context.name == "spring.rabbit.listener") {
                        context.removeLowCardinalityKeyValue("messaging.rabbitmq.message.delivery_tag")
                    }
                    context
                }
            }
            return bean
        }
    }

The text was updated successfully, but these errors were encountered:

jensbaitingerbosch · 2024-11-26T09:45:34Z

In a previous state of this bug report I also mentioned anoter metric. That was caused by the inheritence of the low kardinality values to child observations.

artembilan · 2024-11-27T00:23:34Z

Yes, it feels like a bug that this messaging.rabbitmq.message.delivery_tag has to be a high cardinality. If that is a case, please, consider to contribute the fix. I’m on vacation until next week.

jensbaitingerbosch added status: waiting-for-triage type: bug labels Nov 25, 2024

jensbaitingerbosch changed the title ~~Too many metrice generated leading to memory leak~~ delivery tag in the observations leading to too many timers generated -> leading to memory leak Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

delivery tag in the observations leading to too many timers generated -> leading to memory leak #2914

delivery tag in the observations leading to too many timers generated -> leading to memory leak #2914

jensbaitingerbosch commented Nov 25, 2024 •

edited

Loading

jensbaitingerbosch commented Nov 26, 2024

artembilan commented Nov 27, 2024

delivery tag in the observations leading to too many timers generated -> leading to memory leak #2914

delivery tag in the observations leading to too many timers generated -> leading to memory leak #2914

Comments

jensbaitingerbosch commented Nov 25, 2024 • edited Loading

jensbaitingerbosch commented Nov 26, 2024

artembilan commented Nov 27, 2024

jensbaitingerbosch commented Nov 25, 2024 •

edited

Loading