Why does rabbitmq's consumption experience jagged pauses #14567

ponponon · 2025-09-18T03:01:00Z

ponponon
Sep 18, 2025

Describe the bug

Whether it's the publisher or the consumer, the rate drops to zero at fixed intervals. Is this due to the RabbitMQ server pausing for garbage collection?

If observing a single queue leads to an unreliable conclusion about GC pauses, the same pattern holds globally. The entire RabbitMQ instance seems to pause approximately every 30 seconds. Why is this happening? Is RabbitMQ under excessive load? How can we specifically analyze and troubleshoot whether these GC pauses are caused by high load or other factors?

Reproduction steps

When using RabbitMQ 3.12.14 with a sufficiently large number of queues and consumers, the issue is 100% reproducible.

Expected behavior

I believe both sending consumption and consumption tasks should be smooth. Why does the consumption curve show metrics resetting to zero at fixed intervals? I want to understand the cause. If this is an abnormal phenomenon, I want to know how to prevent it.

Additional context

I deployed it using Debian 10 + Docker.

The machine configuration is 24 cores and 48GB RAM, with top-tier cloud SSD storage.

Answered by kjnilsson

Sep 18, 2025

3.12.4 is out of community support, you need to upgrade to 4.2.

That said the graph may just be a side effect of how metrics are calculated. You need to look at the throughout rate of your consumers and see if it matches what you see in the management UI. Upgrade to 4.2 and see if it still occurs.

View full answer

kjnilsson · 2025-09-18T06:15:28Z

kjnilsson
Sep 18, 2025
Maintainer

3.12.4 is out of community support, you need to upgrade to 4.2.

That said the graph may just be a side effect of how metrics are calculated. You need to look at the throughout rate of your consumers and see if it matches what you see in the management UI. Upgrade to 4.2 and see if it still occurs.

1 reply

mkuratczyk Sep 18, 2025
Maintainer

just a small correction: 4.2 has not been released yet. :) 4.1 is the latest available version.

If 4.1 behaves the same way, we still need more information - at the very least, what type of queues do you use?

lukebakken · 2025-09-18T14:11:09Z

lukebakken
Sep 18, 2025
Maintainer

@ponponon - what do you expect the RabbitMQ maintainers to do with what little information you provide, exactly? Do you expect them to rush to set up an environment, try to GUESS how you're using RabbitMQ, and report back to you, all for free? You're not even using a supported version of RabbitMQ.

If you want to get free support for your issue, I suggest you provide enough information to reproduce what you report.

First, reproduce your issue in your environment using the latest version of RabbitMQ and Erlang. If you see the same behavior, provide a git repository with the complete source code to start producers and consumers that mimics your workload and reproduces what you observe.

0 replies

michaelklishin · 2025-09-18T14:25:26Z

michaelklishin
Sep 18, 2025
Maintainer

@ponponon do you expect us to guess what your consumers do or do not do (like do not acknowledge deliveries in a timely manner or use a suitable prefetch value)? I'm afraid our small team cannot afford guessing, guessing is a very very time consuming approach to troubleshooting distributed infrastructure.

Is this due to the RabbitMQ server pausing for garbage collection?

The Erlang runtime does not suffer from "stop the world" pauses caused by GC because there is no global GC, every Erlang process (a connection, a channel or session, or queue or stream replica) has an independent heap and their garbage collections do not affect other processes.

Yes, there is a shared reference counted heap for larger binaries but its GC is not "stop the world" for the entire system.

As any heavy PerfTest user would confirm, when a stop-the-world Java GC in a consumer or producer process happens, you can usually tell by a drop in publishing or delivery/delivery acknowledgement metrics, even though RabbitMQ was not paused for GC.

One scenario where RabbitMQ is guaranteed to stop deliveries is when a consumer is delivered as many messages as its channel's prefetch, which by definition means that RabbitMQ should not deliver any more until some outstanding deliveries are acknowledged.

0 replies

michaelklishin · 2025-09-18T14:35:01Z

michaelklishin
Sep 18, 2025
Maintainer

How can we specifically analyze and troubleshoot whether these GC pauses are caused by high load or other factors?

By using monitoring data, ideally with a full set of Grafana dashboards (it can be inter-node connection congestion if the messages are large), and by asking the node how does it spend its CPU/scheduler time.

If this node has 1 CPU core, then a surge of activity in any part of the system (e.g. on a particular connection) inevitably can take CPU scheduler time from queues or channels (that serialize deliveries to be sent).

With an installation so old (it has reached EOL without any exceptions), I cannot rule out that these periodic background GC settings that were relevant for some workloads years ago could be enabled. They force a minor GC run for every single process in the system.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Why does rabbitmq's consumption experience jagged pauses #14567

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why does rabbitmq's consumption experience jagged pauses #14567

Uh oh!

ponponon Sep 18, 2025

Describe the bug

Reproduction steps

Expected behavior

Additional context

Replies: 4 comments · 1 reply

Uh oh!

kjnilsson Sep 18, 2025 Maintainer

Uh oh!

mkuratczyk Sep 18, 2025 Maintainer

Uh oh!

Uh oh!

lukebakken Sep 18, 2025 Maintainer

Uh oh!

Uh oh!

michaelklishin Sep 18, 2025 Maintainer

Uh oh!

michaelklishin Sep 18, 2025 Maintainer

ponponon
Sep 18, 2025

Replies: 4 comments 1 reply

kjnilsson
Sep 18, 2025
Maintainer

mkuratczyk Sep 18, 2025
Maintainer

lukebakken
Sep 18, 2025
Maintainer

michaelklishin
Sep 18, 2025
Maintainer

michaelklishin
Sep 18, 2025
Maintainer