Anything Can Be a Message Queue If You Use It Wrongly Enough (2023)

Posted4 months agoActive4 months ago

crescit_eundo

184 points

65 comments

xeiaso.netTech DiscussionstoryHigh profile

informativeneutral

Debate

40/100

Message QueuesSystem DesignProgramming

Key topics

Message Queues

System Design

Programming

The tongue-in-cheek assertion that "anything can be a message queue if you use it wrongly enough" sparked a lively discussion, with commenters playfully riffing on the idea that even seemingly unrelated systems can be repurposed as message queues. Some wryly observed that online comment sections, like the one they were participating in, could be viewed as makeshift message queues, with latency being a notable drawback. As the joke gained traction, commenters chimed in with humorous takes, from acknowledging the "ueue" in "queue" to poking fun at the idea of treating a comment section like a message queue. Amidst the levity, a few astute observers noted that, in a sense, a message queue is just a database being used in a particular way – a clever insight that underscored the original claim.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

60m

Peak period

0-6h

Avg / period

9.3

Comment distribution65 data points

Loading chart...

Based on 65 loaded comments

Key moments

01Story posted
Aug 28, 2025 at 11:14 AM EDT
4 months ago
Step 01
02First comment
Aug 28, 2025 at 12:13 PM EDT
60m after posting
Step 02
03Peak activity
54 comments in 0-6h
Hottest window of the conversation
Step 03
04Latest activity
Aug 31, 2025 at 3:26 AM EDT
4 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (65 comments)

Showing 65 comments

redbell

4 months ago

1 reply

On a totally unrelated topic, I once read a meme online that says: "If you ever felt useless, remember ueue in queue!"

aitchnyu

4 months ago

They are not useless, they are awaiting their turn.

pluto_modadic

4 months ago

1 reply

muahahahaha, muahahaha!

lstodd

4 months ago

so very true.

IIAOPSW

4 months ago

2 replies

Even HN comment sections?

tux3

4 months ago

2 replies

ACK

unmotivated-hmn

4 months ago

1 reply

Even HN comment sections?

therein

4 months ago

At least once delivery.

pwagland

4 months ago

Although latency is shockingly bad.

npteljes

4 months ago

Of course. A message queue is database, and software that handles it in a specific way to make it a message queue. So, HN could basically be that database backend for that imaginary software that turns it into a message queue.

I don't have fun examples with message queues, but I do remember some with filesystems - a popular target to connect cursed backends to. You can store data in Ping packets [0]. You can store data in the digits of Pi - achieving unbelievable compression [1]. You can store data in the metadata and other unused blocks of images - also known as steganography [2]. People wrote software to use Gmail emails as a file system [3].

That's just from the top of my head, and it really shows that sky's the limit with software.

[0] https://github.com/yarrick/pingfs

[1] https://github.com/ajeetdsouza/pifs

[2] https://en.wikipedia.org/wiki/Steganographic_file_system

[3] https://lwn.net/Articles/99933/

metadat

4 months ago

2 replies

Fiendishly outlandish idea, incredibly wrong that it should even be possible for the existence of Hoshino to even have ever been a thought, yet here we are. I love it!

On a related note, have you seen the prices at Whole Foods lately? $6 for a packet of dehydrated miso soup. This usually costs $2.50 served prepared at a sushi restaurant. AWS network egress fees are similarly blasphemous.

Shame on Amazon, lol. Though it's really capitalisms fault, if you think it through all the way.

sneak

4 months ago

4 replies

Why is it Amazon’s fault that people voluntarily choose to use Amazon?

Even with the massive margins, cloud computing is far cheaper for most SMEs than hiring an FTE sysadmin and racking machines in a colo.

The problem is that people forget to switch back to the old way when it’s time.

shermantanktop

4 months ago

2 replies

Another of my online lives is on guitar forums (TGP etc), populated by diverse set of non-geek characters. An eternal question that comes up is “why are they charging so much for this guitar? The parts can’t be that expensive. I bet I could just…”

And the only viable answer is the ol’ capitalist saw: they charge what buyers are willing to pay.

That never quite satisfies people though.

ecshafer

4 months ago

1 reply

Employing labor full time is incredibly expensive in the US. Once you include overhead, taxes, benefits, etc. you can easily be paying 2x wage for a worker. Not to mention buying the goods. So yeah the parts for the guitar might cost X, but then it costs Y to store them and Z for the space to assemble them then A to pay the workers and B to ship them and C to market. It adds up. Without jumping to the EVILS of "Capitalism" a business costs money to run. I can't imagine guitar manufacturer margins are anything close to techs, probably <5%. Gemini tells me industry is around 3.8% so I don't think I am far off.

BiteCode_dev

4 months ago

In this case, you would need to pay someone anyway. I never heard about an AWS account that didn't require at least one engineer in charge of it.

DougMerritt

4 months ago

1 reply

Why aren't they satisfied with merely pondering strats made in US vs Mexico vs Japan vs Indonesia? Careful reviews of quality versus price (which of course varied over time) always showed more correlation with sometimes-unwarranted reputation than with reality.

shermantanktop

4 months ago

Amongst this crowd, “my buddy said” is data. Correlation analysis is not in the picture.

mrkeen

4 months ago

1 reply

Using AWS was supposed the way to avoid the cost of an ops team.

Now every developer also has to be DevOps, learning docker, kubernetes and CI systems instead of just focusing on development.

Also we all still have ops teams.

ranger_danger

4 months ago

False equivalence IMO... devops/docker/CI were not required with "the old way" and neither are they now, why are you thinking that they are?

If your scale requires it, that's fine, but that would have been the case with or without AWS et al.

immibis

4 months ago

SMEs hire someone (an MSP) to manage their IT. They don't use AWS because AWS services are too low-level. AWS is chosen by people who should know better and mostly on the basis of marketing inertia.

Edit: And by people with too much money, which was until recently most tech companies.

devmor

4 months ago

> Even with the massive margins, cloud computing is far cheaper for most SMEs than hiring an FTE sysadmin and racking machines in a colo.

That very much depends on your use case and billing period. Most of my public web applications run in a colo in Atlanta on containers hosted by less than $2k in hardware and cached by Cloudflare. This replaced an AWS/Digitalocean combination that used to bill about $400/mo.

Definitely worth it for me, but there are some workloads that aren’t worth it and I stick with cloud services to handle.

I would estimate that a significant amount of services hosted on AWS are paid for by small businesses with less reliability and uptime requirements than I have.

BiteCode_dev

4 months ago

This is not a situation where you have zero alternatives. You have ton of cheap hosting out there. Most people using AWS don't need the level of reliability and scaling it provides, they pay the price for nothing.

ranger_danger

4 months ago

1 reply

sounds like "parasitic storage" and/or steganography

Kye

4 months ago

One of my favorite dinosaurs

stephenlf

4 months ago

2 replies

Remember when Amazon Video moved from serverless back to a monolith because they were using S3 for storing video streams for near realtime processing? This feels the same. Except Amazon Video is an actual company trying to build real software.

Amazon Video’s original blog post is gone, but here is a third party writeup. https://medium.com/@hellomeenu1/why-amazon-prime-video-rever...

lloydatkinson

4 months ago

1 reply

They deleted their own post?

It couldn’t possibly be because AWS execs were pissed or anything… /s

Simran-B

4 months ago

Archived blog post: https://web.archive.org/web/20240719152109/https://www.prime...

thrance

4 months ago

3 replies

IIRC they were storing individual frames in S3 buckets and hitting their own internal lambda limits. Funny story tbh.

LeifCarrotson

4 months ago

3 replies

You remember correctly:

> The main scaling bottleneck in the architecture was the orchestration management that was implemented using AWS Step Functions. Our service performed multiple state transitions for every second of the stream, so we quickly reached account limits. Besides that, AWS Step Functions charges users per state transition.

> The second cost problem we discovered was about the way we were passing video frames (images) around different components. To reduce computationally expensive video conversion jobs, we built a microservice that splits videos into frames and temporarily uploads images to an Amazon Simple Storage Service (Amazon S3) bucket. Defect detectors (where each of them also runs as a separate microservice) then download images and processed it concurrently using AWS Lambda. However, the high number of Tier-1 calls to the S3 bucket was expensive.

They were really deeply drinking the AWS serverless kool-aid if they thought the right way to stream video was multiple microservices accessing individual frames on S3...

zoogeny

4 months ago

I haven't read the entire article, but just based on the snippets you posted it doesn't look like they were streaming video using this process. It sounds like they were doing defect detection.

I would guess this was part of a process when new videos were uploaded and transcoded to different formats. Likely they were taking transcoded frames at some sample rate and uploading them to S3 where some workers were then analyzing the images to look for encoding artifacts.

This would most likely be a one-time sanity check for new videos that have to go through some conversion pipelines. However, once converted to their final form I would suspect the video files are statically distributed using a CDN.

pythonaut_16

4 months ago

It’s more honesty that you see from most service providers, both dogfooding the approach and not handwaving the costs.

wat10000

4 months ago

Every time they order Chinese takeout, two thousand cars show up, each carrying one grain of rice.

mikepurvis

4 months ago

1 reply

Has a lot of “orders from on high to dog food all the things” energy.

breppp

4 months ago

My guess was "no real cost accounting for service usage internally, until one day zero interest ends and a VP changes that"

moi2388

4 months ago

That’s hilarious

no_thank_you

4 months ago

1 reply

The truly cursed thing in the article is this bit near the end (unless this is part of the satire):

"Something amusing about this is that it is something that technically steps into the realm of things that my employer does. This creates a unique kind of conflict where I can't easily retain the intellectial property (IP) for this without getting it approved from my employer. It is a bit of the worst of both worlds where I'm doing it on my own time with my own equipment to create something that will be ultimately owned by my employer. This was a bit of a sour grape at first and I almost didn't implement this until the whole Air Canada debacle happened and I was very bored."

mananaysiempre

4 months ago

Yes, I guess this is how we learn that Tailscale will lay claim to things you do on your own time using your own machine.

spectraldrift

4 months ago

1 reply

People often forget a message queue is just a simple, high-throughput state machine.

It's tempting to roll your own by polling a database table, but that approach breaks down- sometimes even at fairly low traffic levels. Once you move beyond a simple cron job, you're suddenly fighting row locking and race conditions just to prevent significant duplicate processing; effectively reinventing a wheel, poorly (potentially 5 or 10 times in the same service).

A service like SQS solves this with its state management. A message becomes 'invisible' while being processed. If it's not deleted within the configurable visibility timeout, it transitions back to available. That 'fetch next and mark invisible' state transition is the key, and it's precisely what's so difficult to implement correctly and performantly in a database every single time you need it.

groone

4 months ago

2 replies

Message becomes invisible in a regular relational database when using `SELECT FOR UPDATE SKIP LOCKED`

spectraldrift

4 months ago

That's totally feasible, and works for small to medium traffic (SQS scales seamlessly from 1 message per year to millions per second).

In practice, I've never seen this implemented correctly in the wild- most people don't seem to care enough to handle the transactions properly. Additionally, if you want additional features like DLQs or metrics on stuck message age, you'll end up with a lot more complexity just to get parity with a standard queue system.

A common library could help with this though.

kerblang

4 months ago

Overall it's completely feasible to build a message queue with RDBMS _because_ they have locking. You might end up doing extra work compared to some other products that make message queueing easy/fun/so-simple-caveman-etc.

Now if SQS has some super-scalar mega-cluster capability where one instance can deliver 100 billion messages a day across the same group of consumers, ok, I'm impressed, because most MQ's can't, because... locking. Thus Kafka (which is not a message queue).

I think the RDBMS MQ should be treated as the "No worse than this" standard - if my fancy new message queueing product is even harder to set up, it isn't worth your trouble. But SQS itself IS pretty easy to use.

dwedge

4 months ago

1 reply

I thought the "multiple anime personalities explaining things to each other" style of tech blogging was so 2018

unmotivated-hmn

4 months ago

My first time seeing it. I was somewhat pleasantly confused.

rented_mule

4 months ago

3 replies

In the 1990s, I was at a startup that had a need for a message queue. The only thing we found at the time was a product from TIBCO that was priced way-way-way out of our reach. IIRC, it didn't even run on PCs, only mainframes and minis. Microsoft Exchange Server (Microsoft's email server) had just been released at the time, and we decided to use it as a message queue.

Message-submitting clients used SMTP libraries. Message-consuming clients used Exchange APIs. Consumers would only look at unread messages, they would mark messages as read when they started processing, and move them to a folder other than the Inbox if they succeeded. Many of the queues were multi-producer, but all queues were single-consumer (CPUs were pricey at the time - our servers were all Pentiums and Pentium Pros), which simplified things a lot.

Need a new queue / topic? Add an email address. Need to inspect a queue? Load up an email client. An unexpected benefit was that we could easily put humans in the loop for handling certain queues (using HTML in the messages).

It worked surprisingly well for the 5 years that the company was around. Latency was okay, but not great. Throughput was much better than we would have hoped for - Exchange was almost never the bottleneck.

supportengineer

4 months ago

I can assure you that various companies are still doing this. It still works and for all the same reasons as you list.

1718627440

4 months ago

Honestly that's not even abuse, this is what a mail delivery system truly is.

saghm

4 months ago

> Many of the queues were multi-producer, but all queues were single-consumer (CPUs were pricey at the time - our servers were all Pentiums and Pentium Pros), which simplified things a lot.

Oh good, I was worried about what would have happened if a rogue server accidentally hit "reply all"

adamcharnock

4 months ago

1 reply

I had a developer colleague a while back who was toying with an idea that would require emitting and consuming a _lot_ of messages. I think it was somewhere on the order of 10k-100k/second. He was looking at some pretty expensive solutions IIRC.

I asked if the messages were all under 1.5kb, he said yes. I asked if at-most-one delivery was ok, he said yes. So I proposed he just grab a router and fire messages through it as UDP packets, then use BGP/ECMP to balance the packets between receivers. Add some queues on the router, then just let the receivers pick up the packets as fast as they could. You'd need some kind of feedback to manage back pressure, but ¯\_(ツ)_/¯

A fairly cheap way to achieve 1M+ messages per second.

I never got the chance to flesh-out the idea fully, but the simplicity of it tickled me. Maybe it would have worked, maybe not.

HeyLaughingBoy

4 months ago

3 replies

Isn't a fundamental property of a queue that it's FIFO?

UDP message delivery order is not guaranteed. Hell, UDP delivery itself is not guaranteed (although IME, messages don't usually get dropped unless they cross subnets).

bdcravens

4 months ago

1 reply

> UDP delivery itself is not guaranteed

> I asked if at-most-one delivery was ok, he said yes.

Use case satisfied.

trelane

4 months ago

1 reply

>> UDP delivery itself is not guaranteed

>> I asked if at-most-one delivery was ok, he said yes.

> Use case satisfied.

No. https://wiki.wireshark.org/DuplicatePackets

"ConnectionlessProtocols such as UDP won't detect duplicate packets, because there's no information in, for example, the UDP header to identify a packet so that packets can be recognized as duplicates. The data from that packet will be indicated twice (or even more) to the application; it's the responsibility of the application to detect duplicates (perhaps by supplying enough information in its headers to do so) and process them appropriately, if necessary"

ranger_danger

4 months ago

1 reply

protection from duplicate packets was never a requirement... you also assume there is no other identifying info inside the packets that might provide an inherent protection anyways

trelane

4 months ago

1 reply

> protection from duplicate packets was never a requirement..

Nope. This is what "at-most-one" means. Either zero or one. Not two.

Three is right out.

> you also assume there is no other identifying info inside the packets that might provide an inherent protection anyways

Yes, but then if we're positing a system on top of UDP, then it doesn't matter what the protocol is, TCP or UDP. We can also assume the underlying system we're relying on can also detect missing requests, or even ensure we have exactly one.

ranger_danger

4 months ago

1 reply

> Nope. This is what "at-most-one" means. Either zero or one. Not two.

But "is at-most-one ok?" doesn't necessarily mean that more is also unacceptable. All the valid and invalid conditions are not completely specified IMO.

dragonwriter

4 months ago

1 reply

> But "is at-most-one ok?" doesn't necessarily mean that more is also unacceptable.

If you propose delivering an at-most-once solution, the customer agrees that that is an acceptable parameter, and then you deliver a solution that has no at-most-once guarantee, you haven't delivered what you proposed, and you certainly have no reason to believe you've met the customer's needs.

ranger_danger

4 months ago

The way OP framed it IMO sounded like a very informal conversation, not a written contract with formal specifications... I think it's possible the person agreeing at-most-one was "ok" quite possibly didn't care either way, and might even view supporting duplicates as a positive extra feature... but I don't think this is something either of us can speak to authoritatively.

> if we're positing a system on top of UDP, then it doesn't matter what the protocol is

And this is still true, and neither of us know if they would have designed it this way or not, but they could.

tomjen3

4 months ago

Even Azures native queue systems can guarantee that, because it gets hard to scale. You can only have one processor for any given queue, because otherwise one might take so long to process a message that others have finished processing others in the meantime.

But given OPs need for at-most-once, this doesn't really seem to matter in the first place.

adamcharnock

4 months ago

> UDP message delivery order is not guaranteed

My thinking was that ordering would be pretty unaffected when there is only a single hop. But yeah, we would have needed to test that under load.

DoneWithAllThat

4 months ago

1 reply

Corollary: every message queue can be a database if you use it wrongly enough.

rcleveng

4 months ago

Generalized a bit: everything can be a database if you use it wrongly enough.

devmor

4 months ago

This is utterly incredible and inspiring in the worst way. Mad engineering!

dang

4 months ago

Discussed at the time:

Anything can be a message queue if you use it wrongly enough - https://news.ycombinator.com/item?id=36186176 - June 2023 (239 comments)

stego-tech

4 months ago

This is beyond cursed and I love it.

packetlost

4 months ago

I once had a coworker use GitLab + a git repo + webhooks to implement a queued event system. Some change (I think it was in Jenkins) would call a webhook which would append to some JSON array in a repo, commit it, which would itself trigger something else downstream. It was horrifying and glorious.

1718627440

4 months ago

I don't really know how S3 works, but either its a single system, then you reinvented the use case of local sockets, or what is way more likely, it's already implemented on a TCP/IP network, then congratulations you just invented IP over IP.

View full discussion on Hacker News

ID: 45053234Type: storyLast synced: 11/20/2025, 6:33:43 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN