Azure Container Apps with KEDA + Azure Service Bus scaling to 0 even when messages are coming in

Question

Azure Container Apps with KEDA + Azure Service Bus scaling to 0 even when messages are coming in

Max Voloshyn 5

Hi,

I'm using Azure Container Apps with a custom KEDA-based scaling rule that listens to an Azure Service Bus queue.

Here’s my setup:

minReplicas: 0

maxReplicas: 2

pollingInterval: 30 seconds

cooldownPeriod: 300 seconds

The scaling generally works — it scales up when messages arrive and down when idle.

However, I’m seeing unexpected scale-to-zero behavior even when messages are still coming in (though sporadically).

📈 My scenario:

Messages arrive with irregular frequency: 1 message every 1 to 15 seconds.

Each message is processed very quickly, so the queue often appears empty when KEDA checks it.

As a result, the app sometimes scales to 0 before the full 300-second cooldown, even though messages continue to trickle in.

❓ My question:

Is my assumption correct that because messages are processed too quickly, KEDA may not "see" them during the polling intervals — causing it to believe the queue is empty and trigger scale-down?

If so, is there a recommended way to prevent this kind of premature scale-down for low-frequency but steady message traffic?

Thanks in advance!Hi,

I'm using Azure Container Apps with a custom KEDA-based scaling rule that listens to an Azure Service Bus queue.

Here’s my setup:

minReplicas: 0

maxReplicas: 2

pollingInterval: 30 seconds

cooldownPeriod: 300 seconds

The scaling generally works — it scales up when messages arrive and down when idle.

However, I’m seeing unexpected scale-to-zero behavior even when messages are still coming in (though sporadically).

📈 My scenario:

Messages arrive with irregular frequency: 1 message every 1 to 15 seconds.

Each message is processed very quickly, so the queue often appears empty when KEDA checks it.

As a result, the app sometimes scales to 0 before the full 300-second cooldown, even though messages continue to trickle in.

❓ My question:

Is my assumption correct that because messages are processed too quickly, KEDA may not "see" them during the polling intervals — causing it to believe the queue is empty and trigger scale-down?

If so, is there a recommended way to prevent this kind of premature scale-down for low-frequency but steady message traffic?

Thanks in advance!

Anurag Rohikar 205 Reputation points Microsoft External Staff Moderator

2025-08-05T10:44:39.15+00:00
Hello,

Thank you for the detailed explanation of your issue. You've correctly identified the core problem.

Your assumption is right: because your application processes messages so quickly, KEDA's 30-second polling interval is likely "missing" messages. By the time KEDA checks the queue, the message has already arrived and been processed, causing KEDA to think the queue is idle and begin the scale-down process.

This is a common scenario with low-frequency, high-speed workloads. Here are our recommendations to prevent this premature scale-to-zero behavior.

Recommended Solutions

Reduce the Polling Interval: The most direct solution is to decrease the pollingInterval in your KEDA configuration. For your message frequency (1 message every 1-15 seconds), a lower interval of 5 or 10 seconds would give KEDA a much better chance of "seeing" messages in the queue and keeping an instance active.

Set minReplicas to 1: If your workload requires instant processing of every message, you can set minReplicas:1. This ensures at least one instance of your container app is always running, eliminating cold starts and the risk of scaling to zero. This is a trade-off that guarantees responsiveness but incurs a continuous, though minimal, cost.

Consider the messageAge Scaler: For sporadic traffic like yours, the messageAge scaler can be an excellent fit. Instead of scaling based on the number of messages (queueLength ), it scales based on how long the oldest message has been waiting in the queue. This is often more reliable for low-volume queues where messages are processed individually but consistently.

Adjust the cooldownPeriod: You mentioned that your cooldown period is set to 300 seconds. Consider decreasing this value to ensure that KEDA doesn't scale down too aggressively when messages are being processed sporadically.

Documentation:

You can find more information on these settings in the official documentation:

Azure Container Apps - Always-On Instances: This documentation explains the purpose of minReplicas.

KEDA Scalers - Azure Service Bus: This page covers all the available metadata, including queueLength and messageAge.

KEDA Polling Interval: This link explains how to configure the pollingInterval.

Cooldown Period Configuration

Disclaimer: The above external document is not maintained by Microsoft. It is being shared solely for your convenience.

To help us fine-tune the best solution for your setup, could you confirm the queueLength threshold you currently have configured?
Looking forward to your response to assist you further. Thank You!
Max Voloshyn 5 Reputation points

2025-08-05T13:20:54.48+00:00

@Anurag Rohikar Thank you for the quick and detailed response.

To answer your question about the queueLength threshold — I don’t have that configured. My current KEDA scaling setup is as follows:

Type: Custom

Custom rule type: azure-servicebus

activationMessageCount: 0

messageCount: 1000

I also reviewed the KEDA documentation but couldn’t find anything related to a messageAge scaler. It seems that parameter isn't currently listed among the available options.
Anurag Rohikar 205 Reputation points Microsoft External Staff Moderator

2025-08-06T10:18:44.1966667+00:00

Hello Max Voloshyn, just checking to see if you have a chance to check my previous response and helped, do let me know if you have any further questions on this. Thank You!
Anurag Rohikar 205 Reputation points Microsoft External Staff Moderator

2025-08-07T06:06:26.3866667+00:00

Hello Max Voloshyn, just checking to see if you have a chance to check my previous response and helped, do let me know if you have any further questions on this. Thank you!

1 answer

Your answer

Max Voloshyn 5 Reputation points

2025-08-05T13:20:54.48+00:00

@Anurag Rohikar Thank you for the quick and detailed response.

To answer your question about the queueLength threshold — I don’t have that configured. My current KEDA scaling setup is as follows:

Type: Custom

Custom rule type: azure-servicebus

activationMessageCount: 0

messageCount: 1000

I also reviewed the KEDA documentation but couldn’t find anything related to a messageAge scaler. It seems that parameter isn't currently listed among the available options.
Anurag Rohikar 205 Reputation points Microsoft External Staff Moderator

2025-08-06T10:18:44.1966667+00:00

Hello Max Voloshyn, just checking to see if you have a chance to check my previous response and helped, do let me know if you have any further questions on this. Thank You!
Anurag Rohikar 205 Reputation points Microsoft External Staff Moderator

2025-08-07T06:06:26.3866667+00:00

Hello Max Voloshyn, just checking to see if you have a chance to check my previous response and helped, do let me know if you have any further questions on this. Thank you!

Answer 1

Hello Max Voloshyn, thank you for the quick follow-up and for providing the scaler configuration details, this really helps to narrow things down.

A couple of clarifications and next steps:

About messageAge Scaler:

You're absolutely right, there isn’t a native "messageAge" scaler available for Azure Service Bus in KEDA as of now. My earlier suggestion was more of a conceptual alternative that sometimes gets implemented via custom scalers or in other services where trigger-based execution models can handle message age/latency better. For Azure Container Apps + KEDA, we're currently limited to queue length-based metrics when using the Azure Service Bus scaler.

Analyzing Your Current Setup:

From your config:

activationMessageCount: 0 — This defines when the app should "wake up" from scale-to-zero. So, even a single message should activate the app.
messageCount: 1000 — This defines the scaling target; KEDA will scale linearly up to maxReplicas as the queue approaches this count.

However, with sporadic messages that arrive in small bursts (1-15 seconds apart) and get processed rapidly, it’s likely that:

The queue rarely accumulates enough messages for the messageCount trigger to activate.
KEDA may still miss messages during its polling intervals, especially at low queue lengths.

Recommended Actions:

The root cause seems to be a combination of a high polling interval and the messageCount being too large for your workload pattern. Based on this, here are the recommended adjustments:

Lower the messageCount value: Since your queue processes messages quickly and doesn’t build up to high volumes, setting messageCount to something like 5 or 10 may help KEDA react more sensitively to incoming traffic.
Reduce the pollingInterval further: Try setting it to 5s or 10s to catch those quick-arriving messages.
Consider minReplicas: 1: If reducing polling intervals is not sufficient or if you need zero cold-start delays, this will keep an instance warm at all times.

Let me know if this helps. Happy to assist further. Thank You!

Share via

Azure Container Apps with KEDA + Azure Service Bus scaling to 0 even when messages are coming in

1 answer

Your answer