Use One Big Server (2022)

Posted4 months agoActive4 months ago

antov825

350 points

323 comments

specbranch.comTechstoryHigh profile

calmmixed

Debate

80/100

Cloud ComputingServer ManagementInfrastructure Costs

Key topics

Cloud Computing

Server Management

Infrastructure Costs

The article 'Use One Big Server' (2022) discusses the cost-effectiveness of using a single powerful server instead of multiple smaller ones or cloud services, sparking a debate among commenters about the pros and cons of this approach.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

37m

Peak period

0-12h

Avg / period

Comment distribution160 data points

Loading chart...

Based on 160 loaded comments

Key moments

01Story posted
Aug 31, 2025 at 1:29 PM EDT
4 months ago
Step 01
02First comment
Aug 31, 2025 at 2:06 PM EDT
37m after posting
Step 02
03Peak activity
81 comments in 0-12h
Hottest window of the conversation
Step 03
04Latest activity
Sep 7, 2025 at 7:15 AM EDT
4 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (323 comments)

Showing 160 comments of 323

talles

4 months ago

8 replies

Don't forget the cost of managing your one big server and the risk of having such single point of failure.

Puts

4 months ago

7 replies

My experience after 20 years in the hosting industry is that customers in general have more downtime due to self-inflicted over-engineered replication, or split brain errors than actual hardware failures. One server is the simplest and most reliable setup, and if you have backup and automated provisioning you can just re-deploy your entire environment in less than the time it takes to debug a complex multi-server setup.

I'm not saying everybody should do this. There are of-course a lot of services that can't afford even a minute of downtime. But there is also a lot of companies that would benefit from a simpler setup.

ocdtrekkie

4 months ago

1 reply

My single on-premise Exchange server is drastically more reliable than Microsoft's massive globally resilient whatever Exchange Online, and it costs me a couple hours of work on occasion. I probably have half their downtime, and most of mine is scheduled when nobody needs the server anyhow.

I'm not a better engineer, I just have drastically fewer failure modes.

talles

4 months ago

1 reply

Do you develop and manage the server alone? It's a quite a different reality when you have a big team.

ocdtrekkie

4 months ago

Mostly myself but I am able to grab a few additional resources when needed. (Server migration is still, in fact, not fun!)

motorest

4 months ago

1 reply

> My experience after 20 years in the hosting industry is that customers in general have more downtime due to self-inflicted over-engineered replication, or split brain errors than actual hardware failures.

I think you misread OP. "Single point of failure" doesn't mean the only failure modes are hardware failures. It means that if something happens to your nodes whether it's hardware failure or power outage or someone stumbling on your power/network cable, or even having a single service crashing, this means you have a major outage on your hands.

These types of outages are trivially avoided with a basic understanding of well-architected frameworks, which explicitly address the risk represented by single points of failure.

fogx

4 months ago

3 replies

don't you think it's highly unlikely that someone will stumble over the power cable in a hosted datacenter like hetzner? and even if, you could just run a provisioned secondary server that jumps in if the first becomes unavailable and still be much cheaper.

toast0

4 months ago

1 reply

I don't know about Hetzner, but the failure case isn't usually tripping over power plugs. It's putting a longer server in the rack above/below yours and pushing the power plug out of the back of your server.

Either way, stuff happens, figuring out what your actual requirements around uptime, time to response, and time to resolution is important before you build a nine nines solution when eight eights is sufficient. :p

kapone

4 months ago

1 reply

> It's putting a longer server in the rack above/below yours and pushing the power plug out of the back of your server

Are you serious? Have you ever built/operated/wired rack scale equipment? You think the power cables for your "short" server (vs the longer one being put in) are just hanging out in the back of the rack?

Rack wiring has been done and done correctly for ages. Power cables on one side (if possible), data and other cables on the other side. These are all routed vertically and horizontally, so they land only on YOUR server.

You could put a Mercedes Maybach above/below your server and nothing would happen.

toast0

4 months ago

1 reply

Yes I'm serious. My managed host took several of our machines offline when racking machines under/over ours. And they said it was because the new machines were longer and knocked out the power cables on ours.

We were their largest customer and they seemed honest even when they made mistakes that seemed silly, so we rolled our eyes and moved on with life.

Managed hosting means accepting that you can't inspect the racks and chide people for not cabling to your satisfaction. And mistakes by the managed host will impact your availability.

kapone

4 months ago

1 reply

I hope that "managed host" got fired in a heartbeat and you moved elsewhere. Because they don't know WTF they're doing. As simple as that.

toast0

4 months ago

We did eventually move elsewhere because of acquisition. Of course those guys didn't even bother to run LACP and so our systems would regularly go offline for a bit whenever someone wanted to update a switch. I was a lot happier at the host that sometimes bumped the power cables.

Firing a host where you've got thousands of servers is easier said than done. We did do a quote exercise with another provider that could have supported us, and it didn't end up very competitive ... and it wouldn't have been worth the transition. Overall, there were some derpy moments, but I don't think we would have been happier anywhere else, and we didn't want to rent cages and run our own servers.

motorest

4 months ago

1 reply

> don't you think it's highly unlikely that someone will stumble over the power cable in a hosted datacenter like hetzner?

You're not getting the point. The point is that if you use a single node to host your whole web app, you are creating a system where many failure modes, which otherwise could not even be an issue, can easily trigger high-severity outages.

> and even if, you could just run a provisioned secondary server (...)

Congratulations, you are no longer using "one big server", thus defeating the whole purpose behind this approach and learning the lesson that everyone doing cloud engineering work is already well aware.

juped

4 months ago

1 reply

Do you actually think dead simple failover is comparable to elastic kubernetes whatever?

motorest

4 months ago

1 reply

> Do you actually think dead simple failover is comparable to elastic kubernetes whatever?

References to "elastic Kubernetes whatever" is a red herring. You can have a dead simple load balancer spreading traffic across multiple bare metal nodes.

juped

4 months ago

1 reply

Thanks for switching sides to oppose yourself, I guess?

motorest

4 months ago

> Thanks for switching sides to oppose yourself, I guess?

I'm baffled by your comment. Are you sure you read what I wrote?

icedchai

4 months ago

It's unlikely, but it happens. In the mid 2000's I had some servers at a colo. They were doing electrical work and took out power to a bunch of racks, including ours. Those environments are not static.

jeffrallen

4 months ago

1 reply

Not to mention the other leading cause of outages: UPS's.

Sigh.

icedchai

4 months ago

1 reply

UPSes always seem to have strange failure modes. I've had a couple fail after a power failure. The batteries died and they wouldn't come back up automatically when the power came back. They didn't warn me about the dead battery until after...

sgarland

4 months ago

1 reply

That’s why they have self-tests. Learned that one the hard way myself.

icedchai

4 months ago

1 reply

My UPS was supposedly "self testing" itself periodically and it still happened!

sgarland

4 months ago

Oof, sorry.

sgarland

4 months ago

1 reply

Yep. I know people will say, “it’s just a homelab,” but hear me out: I’ve ran positively ancient Dell R620s in a Proxmox cluster for years. At least five. Other than moving them from TX to NC, the cluster has had 100% uptime. When I’ve needed to do maintenance, I drop one at a time, and it maintains quorum, as expected. I’ll reiterate that this is on circa-2012 hardware.

In all those years, I’ve had precisely one actual hardware failure: a PSU went out. They’re redundant, so nothing happened, and I replaced it.

Servers are remarkably resilient.

EDIT: 100% uptime modulo power failure. I have a rack UPS, and a generator, but once I discovered the hard way that the UPS batteries couldn’t hold a charge long enough to keep the rack up while I brought the generator online.

whartung

4 months ago

1 reply

Being as I love minor disaster anecdotes where doing all the "right things" seem to not make any difference :).

We had a rack in data center, and we wanted to put local UPS on critical machines in the rack.

But the data center went on and on about their awesome power grid (shared with a fire station, so no administrative power loss), on site generators, etc., and wouldn't let us.

Sure enough, one day the entire rack went dark.

It was the power strip on the data centers rack that failed. All the backups grids in the world can't get through a dead power strip.

(FYI, family member lost their home due to a power strip, so, again, anecdotally, if you have any older power strips (5-7+ years) sitting under your desk at home, you may want to consider swapping it out for a new one.)

sgarland

4 months ago

For sure, things can and will go wrong. For critical services, I’d want to split them up into separate racks for precisely that reason.

Re: power strips, thanks for the reminder. I’m usually diligent about that, but forgot about one my wife uses. Replacement coming today.

Aeolun

4 months ago

In my experience, my personal services have gone down exactly zero times. Actually not entirely true, but every time they stopped working the servers had simply run out of disk space.

The number of production incidents on our corporate mishmash of lambda, ecs, rds, fargate, ec2, eks etc? It’s a good week when something doesn’t go wrong. Somehow the logging setup is better on the personal stuff too.

api

4 months ago

A lot of this attitude comes from the bad old days of 90s and early 2000s spinning disk. Those things failed a lot. It made everyone think you are going to have constant outages if you don’t cluster everything.

Today’s systems don’t fail nearly as often if you use high quality stuff and don’t beat the absolute hell out of SSD. Another trick is to overprovision SSD to allow wear leveling to work better and reduce overall write load.

Do that and a typical box will run years and years with no issues.

talles

4 months ago

I also have seem the opposite somewhat frenquently: some team screws up the server and unrelated stable services that are running since forever (on the same server) are now affected due messing up the environment.

juped

4 months ago

The predictable cost, you mean, making business planning way easier? And you usually have two, because sometimes kernels do panic or whatever.

wmf

4 months ago

Don't forget to read the article.

ies7

4 months ago

The last 4-5 years taught me that my most often single point of failure where I can't do a thing is Cloudflare not my on premise servers

joek1301

4 months ago

chrisweekly

4 months ago

I'll take a (lone) single point of failure over (multiple) single points of failure.

justmarc

4 months ago

AWS has also been a single point of failure multiple times in history, and there's no reason to believe this will never happen again.

lelanthran

4 months ago

> Don't forget the cost of managing your one big server

Is that more, less than or about the same as having an AWS/Azure/GCP consultant?

What's the difference in labour per hour?

> the risk of having such single point of failure.

At the prices they charge I can have two hot failovers in two other datacenter and still come out ahead.

matt-p

4 months ago

2 replies

A thoroughly good article. It's probably worth also considering adding a CDN if you take this approach at scale. You get to use their WAF and DNS failover.

A big pain point that I personally don't love is that this non-cloud approach normally means running my own database. It's worth considering a provider who also provides cloud databases.

If you go for an 'active/passive' setup, consider saving even more money by using a cloud VM with auto scaling for the 'passive' part.

In terms of pricing the deals available these days on servers are amazing you can get 4GB RAM VPSs with decent CPU and bandwidth for ~$6 or bare metal for ~$90 for 32GB RAM quad core worth using sites like serversearcher.com to compare.

andersmurphy

4 months ago

2 replies

If you're running on a single machine then you'll get way more performance with something like sqlite (instead of postgres/MySQL) which also makes managing the database quite trivial.

immibis

4 months ago

4 replies

SQLite has serious concurrency concerns which have to be evaluated. You should consider running postgres or mysql/mariadb even if it's on the same server.

SQLite uses one reader/writer lock over the whole database. When any thread is writing the database, no other thread is reading it. If one thread is waiting to write, new reads can't begin. Additionally, every read transaction starts by checking if the database has changed since last time, and then re-loading a bunch of caches.

This is suitable for SQLite's intended use case. It's most likely not suitable for a server with 256 hardware threads and a 50Gbps network card. You need proper transaction and concurrency control for heavy workloads.

Additionally, SQLite lacks a bunch of integrity checks, like data types and various kinds of constraints. And things like materialised views, etc.

SQLite is lite. Use it for lite things, not hevy things.

wild_egg

4 months ago

1 reply

You know, it's ok to say that you're out of your element and don't have direct experience with the thing you're commenting on.

SQLite is easily the best scaling DB tech I've used. I've moved all my postgres workloads over to it and the gains have been incredible.

It's not a panacea and not the best in all cases but it's a very sane default that I recommend everyone start with and only complicate their stack with an external DB when they they start hitting real limits (often never happens)

immibis

4 months ago

1 reply

> You know, it's ok to say that you're out of your element and don't have direct experience with the thing you're commenting on.

I moved several projects from sqlite to postgres because sqlite didn't scale enough for any of them.

andersmurphy

4 months ago

1 reply

May I suggest you could have been holding it wrong?

The out of the box defaults for sqlite are terrible for web apps.

immibis

4 months ago

Are you aware of the irony behind saying "You're holding it wrong"? Do you know where that phrase came from?

Rohansi

4 months ago

Is any SQL database suitable for 50GBps of network traffic hitting it?

Most if not all of your concerns with SQLite are simply a matter of not using the default configuration. Enable WAL mode, enable strict mode, etc. and it's a lot better.

andersmurphy

4 months ago

Not sure what you are talking about? In WAL mode (which is what you should be using) writes don't block reads and reads don't block writes. If you are connections pooling (which you should) the cache will stay hot.

Sqlite (properly configured) will outperform "proper databases" often by an order of magnitude in the context of a single box. You want a single writer for high performance as it lets you batch.

> 256 hardware threads...

Have you tried? I have. Others have too. [1]

> Additionally, SQLite lacks a bunch of integrity checks, like data types and various kinds of constraints. And things like materialised views, etc.

Sqlite has blobs so you can use your own custom encoding which is what you want in a high performance context.

Here's sqlite on a 5$ shared VPS that can handle 10000+ checks per second over a billion checkboxes [2]. You're gonna be fine.

- [1] https://use.expensify.com/blog/scaling-sqlite-to-4m-qps-on-a...

- [2] https://checkboxes.andersmurphy.com

hruk

4 months ago

Agree on many things here, but SQLite does support WAL mode which supports 1 writer/N writer readers with snapshot isolation on reads. Writes are serialized but still quite fast.

SQLite (actually SQL-ite, like a mineral) maybe be light, but so are many workloads these days. Even 1000 queries per second is quite doable with SQLite and modest hardware, and I've worked at billion dollar businesses handling fewer queries than that.

rixed

4 months ago

If you have a single request at a time and need little integrity checks.

railorsi

4 months ago

2 replies

What’s the issue with running Postgres inside a docker container + regular backups? Never had problem and relatively easy to manage.

matt-p

4 months ago

1 reply

no PITB, but mostly just 'it's hassle' for the application server I literally don't need backups, just automated provisioning/docker container etc. Adding postgres then means I need full backups including PITB because I don't even want to lose an hours data.

doganugurlu

4 months ago

1 reply

Or use SQLite and your backups are literally a copy of a file.

You can abuse git for it if you really want to cut corners.

vanviegen

4 months ago

2 replies

Only if you can freeze your application for that long, in which case your statement is true for all non-broken databases.

wild_egg

4 months ago

1 reply

It only freezes your application if you've misconfigured it.

vanviegen

4 months ago

If you want to backup your database using just a file copy, you'd better freeze your database if you value your data. Or use a fancy snapshotting filesystem.

markusw

4 months ago

1 reply

You can easily do consistent backup on live databases. There’s a backup command and API.

vanviegen

4 months ago

Sure. But then it's not "just a file" copy, like GP said.

Biganon

4 months ago

2 replies

Why use a docker container? I run Postgres as is, what would I gain with running it in a container?

aflukasz

4 months ago

You can decouple Postgres and surrounding userspace upgrade cycles from your host os, if this is something that you want. Or run multiple different PG versions (have independent upgrades schedule) without being tied to the host os specific mechanisms for that.

Nextgrid

4 months ago

It makes the whole thing is configured in a docker-compose file (or your raw Docker CLI invocation) + the data volume. So as long as you have those two things you can replicate it and move it to other hosts regardless of their distro.

Compare that with using your distro's packaged version where you can have version variations, variations in default config or file path locations, etc.

decasia

4 months ago

9 replies

Regardless of the cost and capacity analysis, it's just hard to fight the industry trends. The benefits of "just don't think about hardware" are real. I think there is a school of thought that capex should be avoided at all costs (and server hardware is expensive up front). And above all, if an AWS region goes down, it doesn't seem like your org's fault, but if your bespoke private hosting arrangement goes down, then that kinda does seem like your org's fault.

wongarsu

4 months ago

2 replies

For anything up to about 128GB RAM you can still easily avoid capex by just renting servers. Above that it gets a bit trickier

matt-p

4 months ago

Renting (hosted) servers above 128GB RAM is still pretty easy, but I agree pricing levels out. 128GB RAM server ~$200/Month, 384 GB ~$580, 1024 GB ~$940/Month

IshKebab

4 months ago

It's not like it's a huge capex for that level of server anyway. Probably less than the cost of one employee's laptop.

logifail

4 months ago

1 reply

> and server hardware is expensive up front

You don't need to buy server hardware(!), the article specifically mentions renting from eg Hetzner.

> The benefits of "just don't think about hardware" are real

Can you explain on this claim, beyond what the article mentioned?

bearjaws

4 months ago

1 reply

> Can you explain on this claim, beyond what the article mentioned?

I run a lambda behind a load balancer, hardware dies, its redundant, it gets replaced. I have a database server fail, while it re provisions it doesn't saturate read IO on the SAN causing noisy neighbor issues.

I don't deal with any of it, I don't deal with depreciation, I don't deal with data center maintenance.

Nextgrid

4 months ago

1 reply

> I don't deal with depreciation, I don't deal with data center maintenance.

You don't deal with that either if you rent a dedicated server from a hosting provider. They handle the datacenter and maintenance for you for a flat monthly fee.

immibis

4 months ago

1 reply

They do rely on you to tell them if hardware fails, however, and they'll still unplug your server and physically fix it. And there's a risk they'll replace the wrong drive in your RAID pair and you'll lose all your data - this happens sometimes - it's not a theoretical risk.

But the cloud premium needs reiteration: twenty five times. For the price of the cloud server, you can have twenty-five-way redundancy.

1dom

4 months ago

2 replies

> And there's a risk they'll replace the wrong drive in your RAID pair and you'll lose all your data - this happens sometimes - it's not a theoretical risk.

A medium to large size asteroid can cause mass extinction events - this happens sometimes - it's not a theoretical risk.

The risk of the people responsible for managing the platform messing up and losing some of your data is still a risk in the cloud. This thread has even already had the argument "if the cloud provider goes down, it's not your fault" as a cloud benefit. Either cloud is strong and stable and can't break, or cloud breaks often enough that people will just excuse you for it.

namibj

4 months ago

There's a reason semiconductor manufacturing is so highly automated, and it's not labor cost. Humans err. Computers only err when told. But they'll repeat a task reliably without random mistakes if told what to do by a competent (manufacturing process) engineering organization. Yes it takes more than one engineer.

immibis

4 months ago

Many people have already had their data destroyed by remote hands replacing the wrong side of a RAID. Nobody's already had their server destroyed by a mass-extincting meteor.

qaq

4 months ago

1 reply

the benefits of don't write a distributed system unless you really have to are also very real

ehnto

4 months ago

Exactly, same for microservices I feel. Why have enterprise org problems if you don't have an enterprise org.

marcosdumay

4 months ago

1 reply

> I think there is a school of thought that capex should be avoided at all costs (and server hardware is expensive up front).

Yes, there is.

Honestly, it looks to me that this school of thought is mostly adopted by people that can't do arithmetic or use a calculator. But it does absolutely exist.

That said, no, servers are not nearly expensive enough to move the needle on a company nowadays. The room that often goes around them is, and that's why way more people rent the room than the servers in it.

sam_lowry_

4 months ago

1 reply

Connectivity is a problem, not the room.

I ran the IT side of a media company once, and it all worked on a half-empty rack of hardware in a small closet... except for the servers that needed bandwidth. These were colocated. Until we realized that the hoster did not have enough bandwidth, at which point we migrated to two bare metal servers at Hetzner.

marcosdumay

4 months ago

1 reply

It's connectivity, reliable power, reliable cooling, and security.

The actual space isn't a big deal, but the entire environment has large fixed costs.

sam_lowry_

4 months ago

In abstract yeah.

In practice, all that except connectivity is relatively easy to have on-site.

Connectivity is highly dependent on the business location, local providers, their business plans and their willingness to go out of their way to serve the clients.

And I am not talking only about bandwidth, but also reserve lines and latency.

decasia

4 months ago

To be clear - this isn't an endorsement on my part, just observations of why cloud-only deployment seems common. I guess we shouldn't neglect the pressure towards resume-oriented development either, as it undoubtedly plays a part in infra folks' careers. It probably makes you sound obsolete to be someone who works in a physical data center.

I for one really miss being able to go see the servers that my code runs on. I thought data centers were really interesting places. But I don't see a lot of effort to decide things based on pure dollar cost analysis at this point. There's a lot of other industry forces besides the microeconomics that predetermine people's hosting choices.

matt-p

4 months ago

If you rent dedicated servers, then you're not worrying about any of the capex or maintenance stuff.

grg0

4 months ago

> if an AWS region goes down, it doesn't seem like your org's fault, but if your bespoke private hosting arrangement goes down, then that kinda does seem like your org's fault.

Never underestimate the price people are willing to pay to evade responsibility. I estimate this is a multi-billion dollar market.

ehnto

4 months ago

I think you hit the nail on the head. What enterprise are paying for is abstraction of responsibility. Suits would never criticise going with Microsoft or Amazon.

swiftcoder

4 months ago

> I think there is a school of thought that capex should be avoided at all costs

Yep, and it's mostly caused by the VC funding model - if your investors are demanding hockey-stick growth, there is no way in hell a startup can justify (or pay for) the resulting Capex.

Whereas a nice, stable business with near-linear growth can afford to price in regular small Capex investments.

bob1029

4 months ago

2 replies

This isn't even the end game for "one big server". AMD will give the most bang per rack, but there are other factors.

An IBM z17 is effectively one big server too, but provides levels of reliability that are simply not available in most IT environments. It won't outperform the AMD rack, but it will definitely keep up for most practical workloads.

If you sit down and really think honestly about the cost of engineering your systems to an equivalent level of reliability, you may find the cost of the IBM stack to be competitive in a surprising number of cases.

dardeaup

4 months ago

1 reply

At what cost politically? I would expect political battles to be far more intense than any of the technical ones.

sgarland

4 months ago

1 reply

That’s because 75% (citation: wild-ass estimate) of tech workers are incapable of critical thinking, and blindly parrot whatever they’ve heard / read. The number of times I’ve seen something on HN, thought “that doesn’t sound right,” and then spent a day disproving it locally is too damn high. Of course, by then no one gives a shit, and they’ve all moved on patting each other on the back about how New Shiny is better.

grg0

4 months ago

1 reply

I do wish this field were more scientific and factual. Rather, it more closely resembles cults.

dardeaup

4 months ago

I agree. I always cringe when I see a job posting where they're wanting to hire a "passionate" xxx engineer. I always think to myself, "no, you really don't. you want to hire a dispassionate engineer who is objective". It's very difficult to be objective when you're passionate about something (especially a technology). And then what do you do with that passionate person when the organization gets rid of the technology that they're passionate about?

ETA - fixed spelling error

fock

4 months ago

1 reply

no. In the short time I work at a z/OS-shop, they had to IPL twice. And the IPL takes ages...

Now, if you can live with the weird environment and your people know how to programm what is essentially a distributed system described in terms noone else uses: I guess it's still ok, given the competition is all executing IBMs playbook too.

p_l

4 months ago

Entire mainframe IPL, or just LPAR?

My understanding is that usually you subdivide into few LPARs and then reboot the production ones on schedule to prevent drift and ensure that yes, unplanned IPLs will work

lewisjoe

4 months ago

3 replies

I helped bootstrap a company that made an enterprise automation engine. The team wanted to make the service available as SaaS for boosting sales.

They could have got the job done by hosting the service in a vps with a multi-tenant database schema. Instead, they went about learning kubernetes and drillingg deep into "cloud-native" stack. Spent a year trying to setup the perfect devops pipeline.

Not surprisingly the company went out of business within the next few years.

cpursley

4 months ago

3 replies

Yep, this is why I'm a proponent of paas until the bill actually hurts. Just pay the heroku/render/fly tax and focus on product market fit. Or, play with servers and K8s, burning your investors money, then move on to the next gig and repeat...

DaSHacka

4 months ago

1 reply

> Or, play with servers and K8s, burning your investors money, then move on to the next gig and repeat...

I mean, of the two, the PaaS route certainly burns more money, the exception being the rare shop that is so incompetent they can't even get their own infrastructure configured correctly, like in GP's situation.

There are guaranteed more shops that would be better off self-hosting and saving on their current massive cloud bills than the rare one-offs that actually save so much time using cloud services, it takes them from bankruptcy to being functional.

fragmede

4 months ago

1 reply

> the PaaS route certainly burns more money,

Does it? Vercel is $20/month and Neon starts at $5/month. That obviously goes up as you scale up, but $25/month seems like a fairly cheap place to start to me.

(I don't work for Vercel or Neon, just a happy customer)

cpursley

4 months ago

1 reply

Yeah, also a happy neon customer - but they can get pricy. Still prefer them over AWS. For compute, Fly is pretty competitive.

theaniketmaurya

4 months ago

1 reply

I’m using Neon too and upgraded to the scale up version today. Curious, what do you mean rhat they can get pricey?

Aeolun

4 months ago

1 reply

Like, you keep your server running for a month and you need to pay $255 pricey? I can get about 64 cores of dedicated compute for the price of a single neon compute (4c/16gb) unit.

And that’s before you factor in 500gb of storage.

cpursley

4 months ago

And how much time are you spending babysitting all of this? What’s your upgrade, deploy and rollback story? Because I don’t have to even think about these things.

fragmede

4 months ago

1 reply

Yeah, same. Vercel + Neon and then if you actually have customers and actually end up paying them enough money that it becomes significant, then you can refactor and move platforms, but until you do, there are bigger fish to fry.

matt-p

4 months ago

100%. Making it a docker container and deploying it is literally a few hours at most.

Aeolun

4 months ago

1 reply

The moment I sign up for a PaaS the bill hurts. I can never get over the fact I can get 1000x more compute for the same price, never mind that I never use it and have to set everything up myself. I’ll just never pay to lock myself in to something so restricted. My dedicated server allows me to do anything I want or need.

cpursley

4 months ago

1 reply

If you enjoy playing with servers instead of shipping features, enjoy!

Aeolun

4 months ago

1 reply

That’s only true if you still have to learn how to deploy to a server. I have the opposite problem. I need to learn how to deploy to these wonky services, and it never seems to transfer from one to the other.

cpursley

4 months ago

I moved from Heroku -> to Render.com in a day, then later Render -> Fly in a couple hours because everything was already dockerized. I’ve never really have to think about my servers on any of these providers, they just run.

rixed

4 months ago

1 reply

> Not surprisingly the company went out of business within the next few years.

But the engineers could find new jobs thanks to their acquired k8s experience.

doganugurlu

4 months ago

Get paid to learn and build your career instead, baby!

joshmn

4 months ago

This is my experience too—there’s too much time wasted trying to solve a problem that might exist 5 years down the road. So many projects and early-stage companies would be just fine either with a PaaS or nginx in front of a docker container. You’ll know when you hit your pain point.

ChrisArchitect

4 months ago

1 reply

Previously: https://news.ycombinator.com/item?id=32319147

dang

4 months ago

Thanks! Macroexpanded:

Use one big server - https://news.ycombinator.com/item?id=32319147 - Aug 2022 (585 comments)

turtlebits

4 months ago

2 replies

The problem is sizing and consistency. When you're small, it's not cost effective to overprovision 2-3 big servers (for HA).

And when you need to move fast (or things break), you can't wait a day for a dedicated server to come up, or worse, have your provider run out of capacity (or have to pick a different specced server)

IME, having to go multi cloud/provider is a way worse problem to have.

matt-p

4 months ago

1 reply

There are a number of providers who provision dedicated servers via API in minutes these days. Given a dedicated server starts at around $90/Month it probably does make sense for alot of people.

winrid

4 months ago

A $20 dedicated server from OVH can outperform $144 VPSs from Linode in my testing, on passmark.

andersmurphy

4 months ago

1 reply

Most industries are not bursty. Overprovision in not expensive for most businesses. You can handle 30000+ updates a second on a 15$ VPS.

A multi node system tends to be less reliable and more failure points than a single box system. Failures rarely happen in isolation.

You can do zero downtime deployment with a single machine if you need to.

Aeolun

4 months ago

1 reply

> A multi node system tends to be less reliable and more failure points than a single box system. Failures rarely happen in isolation.

Just like a lot of problems exists between keyboard and chair, a lot of problems exist between service A and service B.

The zero downtime deployment for my PHP site consisted of symlinking from one directory to another.

andersmurphy

4 months ago

1 reply

Nice!

Honestly, we need to stop promoting prematurely making everything a network request as a good idea.

Nextgrid

4 months ago

> we need to stop promoting prematurely making everything a network request as a good idea

But how are all these "distributed systems engineers" going to get their resume points and jobs?

runako

4 months ago

3 replies

One of the more detrimental aspects of the Cloud Tax is that it constrains the types of solutions engineers even consider.

Picking an arbitrary price point of $200/mo, you can get 4(!) vCPUs and 16GB of RAM at AWS. Architectures are different etc., but this is roughly a mid-spec dev laptop of 5 or so years ago.

At Hetzner, you can rent a machine with 48 cores and 128GB of RAM for the same money. It's hard to overstate how far apart these machines are in raw computational capacity.

There are approaches to problems that make sense with 10x the capacity that don't make sense on the much smaller node. Critically, those approaches can sometimes save engineering time that would otherwise go into building a more complex system to manage around artificial constraints.

Yes, there are other factors like durability etc. that need to be designed for. But going the other way, dedicated boxes can deliver more consistent performance without worries of noisy neighbors.

Demiurge

4 months ago

2 replies

I think it’s the other way around. I’m a huge fan of Hetzner for small sites with a few users. However, for bigger projects, the cloud seems to offer a complete lack of constraints. For projects that can pay for my time, $200/m or $2000/m in hosting costs is a negligible difference. What’s the development cost difference between AWS CDK / Terraform + GitHub Actions vs. Docker / K8s / Ansible + any CI pipeline? I don’t know; in my experience, I don’t see how “bare metal” saves much engineering time. I also don’t see anything complicated about using an IaC Fargate + RDS template.

Now, if you actually need to decouple your file storage and make it durable and scalable, or need to dynamically create subdomains, or any number of other things… The effort of learning and integrating different dedicated services at the infrastructure level to run all this seems much more constraining.

I’ve been doing this since before the “Cloud,” and in my view, if you have a project that makes money, cloud costs are a worthwhile investment that will be the last thing that constrains your project. If cloud costs feel too constraining for your project, then perhaps it’s more of a hobby than a business—at least in my experience.

Just thinking about maintaining multiple cluster filesystems and disk arrays—it’s just not what I would want to be doing with most companies’ resources or my time. Maybe it’s like the difference between folks who prefer Arch and setting up Emacs just right, versus those happy with a MacBook. If I felt like changing my kernel scheduler was a constraint, I might recommend Arch; but otherwise, I recommend a MacBook. :)

On the flip side, I’ve also tried to turn a startup idea into a profitable project with no budget, where raw throughput was integral to the idea. In that situation, a dedicated server was absolutely the right choice, saving us thousands of dollars. But the idea did not pan out. If we had gotten more traction, I suspect we would have just vertically scaled for a while. But it’s unusual.

runako

4 months ago

2 replies

> I really don't see how "bare metal" saves any engineering time

This is because you are looking only at provisioning/deployment. And you are right -- node size does not impact DevOps all that much.

I am looking at the solution space available to the engineers who write the software that ultimately gets deployed on the nodes. And that solution space is different when the nodes have 10x the capability. Yes, cloud providers have tons of aggregate capability. But designing software to run on a fleet of small machines is very different from accomplishing the same tasks on a single large machine.

It would not be controversial to suggest that targeting code at an Apple Watch or Raspberry Pi imposes constraints on developers that do not exist when targeting desktops. I am saying the same dynamic now applies to targeting cloud providers.

This isn't to say there's a single best solution for everything. But there are tradeoffs that are now always apparent. The art is knowing when it makes sense to pay the Cloud Tax, and whether to go 100% Cloud vs some proportion of dedicated.

sevensor

4 months ago

1 reply

I’ve seen multiple projects founder on the complexity of writing software for the cloud. Moving data from here to there ends up being way harder than anybody expected. Maybe teams with more experience build this into their planning, but from what I’ve seen, if you’re using the cloud, your solution ends up being 95% about getting data where it’s supposed to be and 5% application logic.

Esophagus4

4 months ago

This sounds a people problem, not a technology problem.

I’ve never had an issue with moving data.

Demiurge

4 months ago

Overall, I agree that most people underestimate the runway that the modern dedicated server can give you.

benterix

4 months ago

1 reply

> I’m a huge fan of Hetzner ... I don’t see how “bare metal” saves much engineering time.

I think you confuse Heztner with bare metal. Hetzner has Hetzner Cloud which is like AWS EC2 but much cheaper. (They also have bare metal servers which are even cheaper.) With Heztner Cloud, you can use Terraform, Github Actions and whatever else you mentioned.

Demiurge

4 months ago

Yeah, I do confuse it, because I've been using Hetzner long before they had "cloud".

shrubble

4 months ago

1 reply

It's more than that - it's all the latency that you can remove from the equation with your bare-metal server.

No network latency between nodes, less memory bandwidth latency/contention as there is in VMs, no caching architecture latency needed when you can just tell e.g. Postgres to use gigs of RAM and then let Linux's disk caching take care of the rest (and not need a separate caching architecture).

matt-p

4 months ago

2 replies

The difference between a fairly expensive ($300) RDS instance + EC2 in the same region vs a $90 dedicated server with a NVME drive and postgres in a container is absolutely insane.

bspammer

4 months ago

9 replies

A fair comparison would include the cost of the DBA who will be responsible for backups, updates, monitoring, security and access control. That’s what RDS is actually competing with.

shrubble

4 months ago

2 replies

Paying someone $2000 to set that up once should result in the costs being recovered in what, 18 months?

If you’re running Postgres locally you can turn off the TCP/IP part; nothing more to audit there.

SSH based copying of backups to a remote server is simple.

If not accessible via network, you can stay on whatever version of Postgres you want.

I’ve heard these arguments since AWS launched, and all that time I’ve been running Postgres (since 2004 actually) and have never encountered all these phantom issues that are claimed as being expensive or extremely difficult.

applied_heat

4 months ago

2 replies

$2k? That’s a $100k project for a medium size Corp

sysguest

4 months ago

1 reply

hmm where did you get the numbers?

(what's "medium-size corp" and how did you come up with $100k ?)

Aeolun

4 months ago

I’m assuming he’s talking about the corporate team of DBA’s that will spend weeks discussing the best way to copy a bunch of SQL files to S3

christophilus

4 months ago

$200 does seem too low. $100k seems waaay too high. That sounds like an AWS talking point.

sahilagarwal

4 months ago

3 replies

I guess my non-management / non-business side is show here, but how can it be that much?? I still remember I designed a fairly simple cron job that took database backups when I was a junior developer.

It gets even easier now that you have cheap s3 - just upload the dump to s3 every day and set the s3 deletion policy to whatever is feasible for you.

fragmede

4 months ago

1 reply

How much were you paid as a jr developer, and how long did it take you to set up? Then round up to mid-level developer, and add in hardware and software costs.

dijit

4 months ago

1 reply

That's a deflection. The question isn't about a developer's salary; it's about the fundamental difference between a one-time investment and a permanent cost.

Either way: 1 day of a mid-level developer in the majority of the world (basically: anywhere except Zurich, NYC or SF) is between €208 and €291. (Yearly salary of €50-€70k)

A junior developer's time for setup and the cost of hardware is practically a one-off expense. It's a few days of work at most.

The alternative you're advocating for (a recurring SaaS fee) is a permanent rent trap. That money is gone forever, with no asset or investment to show for it. Over a few years, you'll have spent tens of thousands of dollars for nothing. The real cost is not what you pay a developer; it's what you lose by never owning your tools.

fragmede

4 months ago

> The alternative you're advocating for

Not sure where I advocated for that. Could you point it out please?

nine_k

4 months ago

Taking database backups is relatively simple. What differentiates a good solution is the ease of restoring from a backup. This includes the certainty that the restored state would be a correct point-in-time state from the past, not an amalgamation of several such states.

alemanek

4 months ago

I am not an expert here but I am currently researching for a planned project.

For backups, including Postgres, I was planning on paying Veeam ~$500 a year for a software license to backup the active node and Postgres database to s3/r2. Standby node would be getting streaming updates via logical replication.

There are free options as well but I didn’t want to cheap out on the backups.

It looks pretty turnkey. I am a software engineer not a sysadmin though. Still just theory as well as I haven’t built it out yet

yjftsjthsd-h

4 months ago

1 reply

As long as you also include the Cloud Certified DevOps Engineer™[0] to set up that RDS instance.

[0] A normal sysadmin remains vaguely bemused at their job title and the way it changes every couple years.

mrweasel

4 months ago

3 replies

It's also interesting that the cloud engineer can apparently be both a DBA, network-, storage- and backup engineer, but if you move the same services on-prem, you apparently need specialists for each task.

Sometimes even the certified cloud engineers can't tell you why an RDS behaves the way it does, nor can they really fix it. Sometimes you really do need a DBA, but that applies equally to on-prem and cloud.

I'm a sysadmin, but have been labelled and sold as: Consultant (sounds expensive), DevOps engineer, Cloud Engineer, Operations Expert and right now a Site Reliability Engineer.... I'm a systems administrator.

icedchai

4 months ago

1 reply

I haven't seen a company that hired DBAs in over 15 years. I think the "DevOps" movement sent them packing, along with SysAdmins.

dijit

4 months ago

1 reply

Sysadmins never left, they just got rebranded.

icedchai

4 months ago

I actually agree with this. I meant you never seen roles with the "system administrator" job title, not that it actually disappeared as a function. DBAs on the other hand, I do think that has mostly been absorbed into other roles.

Aeolun

4 months ago

If you’ve started working in the industry more than about 15 years ago all the titles sound quaint.

data_marsupial

4 months ago

Need to get Platform Engineer for a full house

vidarh

4 months ago

2 replies

I do consulting in this space, and we consistently make more money from people who insist on using cloud services, because their setups tend to need far more work.

benterix

4 months ago

Similar here - but in my case the reason is because of vendor lock-in - they spent years getting into AWS and any thought of getting out seems dreadful.

kiney

4 months ago

same for me

sgarland

4 months ago

1 reply

You don’t need a DBA for any of those, you need someone who can read some docs. It’s not witchcraft.

Aeolun

4 months ago

I’d argue that AWS is witchcraft a lot of the time. They’ll have all these they claim will work for everything, but you’ll always find one of the things you’d expect to be unavailable.

lelanthran

4 months ago

1 reply

The RDS solution doesn't need a technical person to set it up?

It doesn't need someone who knows how to use the labrythine AWS services and console?

whstl

4 months ago

Agree.

These comments sound super absurd to me, because RDS is difficult as hell to setup, unless you do it very frequently or already have it in IoC format, since one needs setting up a VPC, subnets, security groups, internet gateway, etc.

It's not like creating a DynamoDB, Lambda or S3 where a non-technical person can learn it in a few hours.

Sure, one might find some random Terraform file online to do this or vibe-code some CloudFormation, but that's not really a fair comparison.

benterix

4 months ago

1 reply

You are aware that RDS needs backups, setting up monitoring properly, defining access, providing secrets management etc., and updates between major versions are not automatic?

RDS has a value. But for many teams the price paid for this value is ridiculously high when compared to other options.

pdhborges

4 months ago

AWS can make major version upgrades automatically now with less downtime. I think they do the logical replication dance internally.

steveBK123

4 months ago

My firm paid DBAs for RDS as well so..

Cthulhu_

4 months ago

While that's fair, most organizations I've worked at in the past decade have had a dedicated team for managing their cloud setup, which is also responsible for backups, updates, monitoring, security and access control. I don't think they're competing.

matt-p

4 months ago

Totally. My frustration isn't even price though RDS is literally just dog slow.

zenmac

4 months ago

1 reply

Yeah but AWS SRE are what making the big bucks! Soooo what can you do? It is nice to see many people here on HN are supporting open network and platform and making very drastic comments as to encouraging google engineers to quite their jobs.

I totally also understand why some people with family to support mortgage to pay they can't just walk way from a job at FAANG or MAMAA type place.

Looking at your comparison, this point it just seems like a scam.

jpgvm

4 months ago

1 reply

Right now the big bucks are in managing massive bare metal GPU clusters.

reactordev

4 months ago

This. Clustering and managing Nvidia at scale is the new hotness demanding half-million dollar salaries.

andersmurphy

4 months ago

100% this add an embedded database like sqlite and optimise writes to batch and you can go really really far with hetzner. It's also why I find the "what about overprovisioning" argument silly (once you look outside of AWS you can get insane cost/perf ratio).

Also in my experience more complex systems tend to have much less reliability/resilience than simple single node systems. Things rarely fail in isolation.

simonw

4 months ago

This was written in 2022, but looks like it's most still relevant today. Would be interesting to see updated numbers on the expected costs of various hosting providers.

cortesoft

4 months ago

> Part of the "cloud premium" for load balancers, serverless computing, and small VMs is based on how much extra capacity your cloud provider needs to build in order to handle their peak load. You're paying for someone's peak load anyway!

Eh, sort of. The difference is that the cloud can go find other workloads to fill the trough from off peak load. They won’t pay as much as peak load does, but it helps offset the cost of maintaining peak capacity. Your personal big server likely can’t find paying workloads for your troughs.

I also have recently come to the opposite conclusion for my personal home setup. I run a number of services on my home network (media streaming, email, a few personal websites and games I have written, my frigate NVR, etc). I had been thinking about building out a big server for expansion, but after looking into the costs I bought 3 mini pcs instead. They are remarkably powerful for their cost and size, and I am able to spread them around my house to minimize footprint and heat. I just added them all to my home Kubernetes cluster, and now I have capacity and the ability to take nodes down for maintenance and updates. I don’t have to worry about hardware failures as much. I don’t have a giant server heating up one part of my house.

It has been great.

163 more comments available on Hacker News

View full discussion on Hacker News

ID: 45085029Type: storyLast synced: 11/20/2025, 8:18:36 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN