> If a queue isn't approaching empty, then its input is exceeding its output. Th...

teolandon · on Feb 19, 2023

GP said "approaching empty", meaning it should be empty in the long term. It's not a slippery slope.

camdenreslink · on Feb 19, 2023

It assumes the velocity of items being added to the queue never changes. In real life it speeds up (approaching infinite items), and it slows down (approaching empty). A more meaningful measurement would be something like average number of items in the queue and average time in the queue for any given item.

If you feel like you have enough spare capacity and any given item isn’t taking too long to process then it doesn’t matter if you are ever empty.

Espressosaurus · on Feb 19, 2023

You're on the right track, but I personally find averages to be not a useful measure...on average.

Corner conditions are where I start to care:

* What is the worst case latency? How long until the work can be done? How long until the work can finish from the point at which it enters the queue?

* What is the worst case number of items in the queue? How large does the queue need to be?

* What is the maximum utilization over some time period with some granularity? How spiky is the workload? A queue is a rate-matching tool. If your input rate exactly equals your output rate at all times, you don't need a queue.

* What is the minimum utilization over some time period? Another view on how spiky the workload is.

Minimums and maximums I find are much more illustrative than averages or even mean-squared-errors. Minimums and maximums bound performance; an average doesn't tell you where your boundary conditions are.

In general you don't want your queue to fully fill up, but like the other poster said, it's some tradeoff between utilization and latency, and the two are diametrically opposed.

camdenreslink · on Feb 19, 2023

Sure, I guess by average I just meant shorthand for some sort of measurement for making decisions about resources allocated to your queues. Median, mean, P99, whatever is useful.

heipei · on Feb 19, 2023

We use queues for various jobs, some of which are time-sensitive (e.g. emails), so the queue should be empty most of the time, some of which are background or re-processing (e.g. re-indexing), so queues can balloon and then take days to drain again.

The unifying aspects of using queues for us is that it allows us to load-balance jobs across workers, and allows us to monitor throughput in a centralised fashion. We can set alerts on the time-sensitive queues and then use different thresholds for the background queues, but we're using the same metric data and same alerting system.

roflyear · on Feb 20, 2023

No it isn't. It is math and queuing theory.