Zed's Pricing Has Changed: LLM Usage Is Now Token-Based

Posted3 months agoActive3 months ago

meetpateltech

182 points

197 comments

zed.devTechstoryHigh profile

heatedmixed

Debate

80/100

AI-Assisted Coding ToolsPricing ModelsToken-Based Pricing

Key topics

AI-Assisted Coding Tools

Pricing Models

Token-Based Pricing

Zed, a coding editor, has changed its pricing model to token-based for LLM usage, sparking debate among users about the fairness and implications of this change.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

18m

Peak period

0-2h

Avg / period

11.4

Comment distribution160 data points

Loading chart...

Based on 160 loaded comments

Key moments

01Story posted
Sep 24, 2025 at 12:13 PM EDT
3 months ago
Step 01
02First comment
Sep 24, 2025 at 12:31 PM EDT
18m after posting
Step 02
03Peak activity
66 comments in 0-2h
Hottest window of the conversation
Step 03
04Latest activity
Sep 25, 2025 at 2:38 PM EDT
3 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (197 comments)

Showing 160 comments of 197

input_sh

3 months ago

1 reply

Entirely predictable and what should've been done from the start instead of this bait-and-switch mere months after introducing agentic editing.

relativeadv

3 months ago

1 reply

is this effectively what Cursor did as well? I seem to remember some major pricing change of their in the past few months.

input_sh

3 months ago

In a way I would say they were even worse, instead of outright saying "we've increased our prices", they "clarified their pricing".

cactusplant7374

3 months ago

1 reply

How much are companies spending per developer on tokens? From what I read it seems like it might be quite high at $1,000 or more per day?

trenchpilgrim

3 months ago

No, not at all! At my org it's around $7000 a month for the entire org - my personal usage is around $2-10 a day. Usually less than the price of my caffeinated beverages.

binwang

3 months ago

3 replies

Now I see little value in subscribing to Zed Pro compared to just bringing my own API key. Am I missing something?

agrippanux

3 months ago

1 reply

Their burn agent mode is pretty badass, but is super costly to run.

I'm a big fan of Zed but tbf I'm just using Claude Code + Nvim nowadays. Zed's problem with their Claude integration is that it will never be as good as just using the latest from Claude Code.

lemontheme

3 months ago

Same except with Helix.

The integration in Zed is limited by what the Claude Code SDK exposes. Since about half of the /commands are missing from the SDK, they don’t show up in Zed.

I think ACP was a good strategic move by Zed, but all I personally really need is Claude Code in a terminal pane with diffs of proposed edits in the absolutely wonderful multibuffer view

morgankrey

3 months ago

3 replies

(I work at Zed) No, you aren't. We care about you using Zed the editor, and we provide Zed Pro for folks who decide they'd like to support Zed or our billing model works for them. But it's simply an option, not our core business plan, and this pricing is in place to make that option financially viable for us. As long as we don't bear the cost, we don't feel the need (or the right) to put ourselves in the revenue path with LLM spend.

jsheard

3 months ago

1 reply

> [Zed Pro is] not our core business plan

What is the core business plan then?

morgankrey

3 months ago

https://zed.dev/blog/sequoia-backs-zed#introducing-deltadb-o...

maxbond

3 months ago

1 reply

Will you consider providing a feature to protect me from accidentally using my Zed account after the $5 is exhausted (or else a plan that only includes edit predictions)? I can't justify to myself continuing my subscription if there's a risk I will click the wrong button with identical text to the right button, and get charged an additional 10% for it. I get you need to be compensated for risk if you pay up front on my behalf, but I don't need you to do that.

I understand that there's nothing you could do to protect me if I make a prompt that ends up using >$5 of usage but after that I would like Zed to reject anything except my personal API keys.

morgankrey

3 months ago

1 reply

Yep, you can set your spend limit to $0 and it will block any spend beyond your $10 per month for the subscription

https://zed.dev/docs/ai/plans-and-usage#usage-spend-limits

maxbond

3 months ago

Excellent. Thanks.

drdaeman

3 months ago

1 reply

I’m curious if there’s any way to completely disable/remove `zed.dev` provider from Zed, while keeping others available?

ibejoeb

3 months ago

If you sign out of zed, zed's providers don't work. I believe you still see them in the AI panel, but it won't operate.

prasoon2211

3 months ago

Presumably the tab based edit-prediction model + $5 of tokens is worth the (new) $10 / mo price.

Though from everything I've read online, Zed's edit prediction model is far, _far_ behind that of Cursor.

bluehatbrit

3 months ago

6 replies

Token based pricing generally makes a lot of sense for companies like Zed, but it sure does suck for forecasting spend.

Usage pricing on something like aws is pretty easy to figure out. You know what you're going to use, so you just do some simple arithmetic and you've got a pretty accurate idea. Even with serverless it's pretty easy. Tokens are so much harder, especially when using it in a development setting. It's so hard to have any reasonable forecast about how a team will use it, and how many tokens will be consumed.

I'm starting to track my usage with a bit of a breakdown in the hope that I'll find a somewhat reliable trend.

I suspect this is going to be one of the next big areas in cloud FinOps.

prasoon2211

3 months ago

1 reply

This is partially why, at least for LLM-assisted coding workloads, orgs are going with the $200 / mo Claude Code plans and similar.

jsheard

3 months ago

3 replies

Until the rug inevitably gets pulled on those as well. It's not in your interest buy a $200/mo subscription unless you use >$200 of tokens per month, and long term it's not in their interest to sell you >$200 of tokens for a flat $200.

Hamuko

3 months ago

4 replies

The pricing model works as long as people (on average) think they need >$200 worth of tokens per month but actually do something less, like $170/month. Is that happening? No idea.

hombre_fatal

3 months ago

Well, the $200/mo plan model works as long as people on the $100/mo plan is insufficient for some people which works as long as the $17/mo plan is insufficient for some people.

I don't see how it matters to you that you aren't saturating your $200 plan. You have it because you hit the limits of the $100/mo plan.

jopsen

3 months ago

It's probably easier (and hence, cheaper) to finance the AI infrastructure investments if you have a lot of recurring subscriptions.

There is probably a lot of value in predictability. Meaning it might be visible for a $200, to offer more tokens than $200.

jsheard

3 months ago

Maybe that is what Anthropic is banking on, from what I gather they obscure Max accounts actual token spend so it's hard for subscribers to tell if they're getting their moneys worth.

https://github.com/anthropics/claude-code/issues/1109

KallDrexx

3 months ago

I don't know about for people using CC on a regular basis, but according to `ccusage`, I can trivially go over $20 of API credits in a few days of hobby use. I'd presume if you are paying for a $200 plan then you know you have heavy usage and can easily exceed that.

baq

3 months ago

1 reply

meanwhile me hiding from accounting for spending $500 on cursor max mode in a day

typpilol

3 months ago

3 replies

Did you actually get 500 bucks worth of work out of it?

GoatInGrey

3 months ago

How should they know? It's not like they're checking what it does.

baq

3 months ago

No way to measure it directly, but it did write 4kLOC of mostly working angular... whether non-max would manage the same feat in the same time is an open question.

joegibbs

3 months ago

It depends on the salary, right? If you're in Silicon Valley paying 500k TC it probably makes sense to let your employees go wild and use as much token spend as they like.

sellyme

3 months ago

> It's not in your interest buy a $200/mo subscription unless you use >$200 of tokens per month

This is only true if you can find someone else selling them at cost.

If a company has a product that cost them $150, but they would ordinarily sell piecemeal for a total of $250, getting a stable recurring purchase at $200 might be worthwhile to them while still being a good deal for the customer.

mdasen

3 months ago

1 reply

I agree that tokens are a really hard metric for people. I think most people are used to getting something with a certain amount of capacity per time and dealing with that. If you get a server from AWS, you're getting a certain amount of capacity per time. You still might not know what it's going to cost you to do what you want - you might need more capacity to run your website than you think. But you understand the units that are being billed to you and it can't spiral out of control (assuming you aren't using autoscaling or something).

When you get Claude Code's $20 plan, you get "around 45 messages every 5 hours". I don't really know what that means. Does that mean I get 45 total conversations? Do minor followups count against a message just as much as a long initial prompt? Likewise, I don't know how many messages I'll use in a 5 hour period. However, I do understand when I start bumping up against limits. If I'm using it and start getting limited, I understand that pretty quickly - in the same way that I might understand a processor being slower and having to wait for things.

With tokens, I might blow through a month's worth of tokens in an afternoon. On one hand, it makes more sense to be flexible for users. If I don't use tokens for the first 10 days, they aren't lost. If I don't use Claude for the first 10 days, I don't get 2,160 message credits banked up. Likewise, if I know I'm going on vacation later, I can't use my Claude messages in advance. But it's just a lot easier for humans to understand bumping up against rate limits over a more finite period of time and get an intuition for what they need to budget for.

Filligree

3 months ago

Both prefill and decode count against Claude’s subscriptions; your conversations are N^2 in conversation length.

My mental model is they’re assigning some amount of API credits to the account and billing the same way as if you were using tokens, shutting off at an arbitrary point. The point also appears to change based on load / time of day.

scuff3d

3 months ago

2 replies

Also seems like a great idea to create a business models where the companies aren't incentivised to provide the best product possible. Instead they'll want to create a product just useful enough to not drive away users, but just useless enough to temp people to go up a tier, "I'm so close, just one more prompt and it will be right this time!"

Edit: To be clear, I'm not talking about Zed. I'm talking about the companies make the models.

potlee

3 months ago

1 reply

While Apple is incentivized to ship a smaller battery to cut costs, it is also incentivized to make their software efficient as possible to make the best use of the battery they do ship

scuff3d

3 months ago

That's not the same thing at all.

GoatInGrey

3 months ago

As well as gatekeep functionality behind the prompt box. Want to find and replace? Regex? Insert a new column? Add a line break? Have the AI do it and pay us for those tokens whether it works the first time or not!

I unfortunately have seen many AI-based tools being demoed with this approach. The goal is clearly to monetize every user action while piggybacking off of models provided by a third-party. The gross thing is that leadership from the director level up LOVES these demos, even when the models very clearly fuck up in the demo.

AI: "I have cleaned the formatting for all 4,650 records in your sample XML files. Let me know if there's anything else I can do to help!"

Me: "There are over 25,000 records in that data..."

AI: "You're absolutely right!"

garrickvanburen

3 months ago

1 reply

My rant on token-based pricing is primarily based on the difficulty in consistently forecasting spend.....and also that the ongoing value of a token is controlled by the vendor...."the house always wins"

https://forstarters.substack.com/p/for-starters-59-on-credit...

coder543

3 months ago

There are enough vendors that it's difficult for any one vendor to charge too much per token. There are also a lot of really good open-weight models that your business could self-host if the hosted vendors all conspire to charge too much per token. (I believe it's only economical to self-host big models if you're using a lot of tokens, so there is a breakeven point.)

jklinger410

3 months ago

Token based pricing works for the company, but not for the user.

Spartan-S63

3 months ago

> I suspect this is going to be one of the next big areas in cloud FinOps.

It already is. There’s been a lot of talk and development around FinOps for AI and the challenges that come with that. For companies, forecasting token usage and AI costs is non-trivial for internal purposes. For external products, what’s the right unit economic? $/token, $/agentic execution, etc? The former is detached from customer value, the latter is hard to track and will have lots of variance.

With how variable output size can be (and input), it’s a tricky space to really get a grasp on at this point in time. It’ll become a solved problem, but right now, it’s the Wild West.

qsort

3 months ago

1 reply

I wonder if first-party offerings like Codex and Claude will follow suit. Most "agents" are utter nonsense, but they cooked with the CLI tools. It'd be a shame to let go of them.

hashbig

3 months ago

Eventually that is the plan. Like we saw with Claude Code, they want developers to get a taste of that unlimited and unrestrained power of a state of the art model like Opus 4, then slowly limit usage until you fully transition to metered billing and deprecate subscription based billing.

prymitive

3 months ago

2 replies

I can imagine the near future where companies “sponsor” open source projects by donating tokens to “mine” a PR for a feature they need.

ebrescia

3 months ago

3 replies

I love this! Finally a more direct way for companies to sponsor open source development. GitHub Sponsors helps, but it is often so vague where the funding is going.

bsnnkv

3 months ago

More often than not, for individuals, it's barely contributing to their living costs

scuff3d

3 months ago

If companies want to help they can just... I don't know... give projects some money

drakythe

3 months ago

Unless companies also donate money to sponsor the code review that will be required to be done by real human being I could see this idea being a problem for maintainers. Yes you have to code review a human being as well but a human being is capable of learning and carrying that learning forward and their next PR will be better, as well as being able to look at past PRs to evaluate whether the user is a troll/bad actor or someone who genuinely wants to assist with the project. An LLM won't learn and will always spit out valid _looking_ code.

hombre_fatal

3 months ago

But the reason LLMs aren't used to build features isn't because they are expensive.

The hard work is the high level stuff like deciding on the scope of the project, how it should fit in to the project, what kind of extensibility the feature might need to be built with, what kind of other components can be extended to support it, (and more), and then reviewing all the work that was done.

sharkjacobs

3 months ago

4 replies

This whole business model of trying to shave off or arbitrage a fraction of the money going to OpenAI and Anthropic just sucks. And it seems precarious. There's no honest way to resell tokens at a profit, and everyone knows it.

Havoc

3 months ago

1 reply

>There's no honest way to resell tokens at a profit, and everyone knows it.

Agree with the sentiment, but I do think there are edge cases.

e.g. I could see a place like openrouter getting away with a tiny fractional markup based on the value they provide in the form of having all providers in one place

Lalabadie

3 months ago

The issue with a model like this (fixed small percentage) is that your biggest clients are the most incentivized to move away.

At scale, OpenRouter will instead get you the lower high-volume fees they themselves get from their different providers.

thelastbender12

3 months ago

1 reply

Sorry, how is this new pricing anything but honest? They provide an editor you can use to - optimize the context you send to the LLM services - interact with the output that comes out of them

Why does not justify charging a fraction of your spend on the LLM platform? This is pretty much how every service business operates.

hu3

3 months ago

There's now greater incentive for Zed to stuff more content in the prompts to inflate tokens used and thus profit more. Or at least be less zealous.

This is not a new concern. And is not unique to Zed.

dinvlad

3 months ago

1 reply

The whole business model even for OAI/Anthropic is unsustainable.. they are already running it at a huge loss atm, and will do for the foreseeable future. The economics simply doesn't work, unfortunately or not

porridgeraisin

3 months ago

2 replies

OAI/Anthropic/Big AI players today never ever need to become profitable... The same way roads never need to become profitable by themselves. This per token charge is like tolls, it has nothing to do with the costs.

AI, just like cloud was before this, is being treated as "infrastructure". The reason people invest in roads is not to make money from the road itself, but rather recuper all the losses from the new extremely profitable businesses that will come on top of this infrastructure. To the stock ticker, this looks like a boom+bust cycle. But really, it's a bust+boom cycle as far as the investors are concerned. Saudi Arabia don't give a hoot about transformers, but they know if they invest their large amounts of capital in the new infrastructure - huge models in expensive datacenters using cities worth of power, they can invest in the many new hyper profitable businesses on top. Also helps that it locks in demand for oil, I guess :P

A similar example for cloud would be the gajillion cloud SaaS, DevOps, cloud security whatever businesses that only exist today because the whole cloud infra segment ran unprofitable for a long time and created the infrastructure for all this.

These new businesses will not be multi hundred-billion dollar businesses, no. They will all be million dollar businesses. But you'll have a million such businesses. Everyone that's stuffing money into AI today is hoping that there will be a huge product layer [1] on top of this new infrastructure once it's all consolidated at a few major players allowing "infra" costs to drop considerably, which they can milk.

The stock of all these companies going high is just wall st making it easy for people to buy debt from this infrastructure. If they're indisciplined, they will co-mingle this debt with normal people's daily lives debt and sell it as one, and everything gets affected badly. If they're not, then the stock market will complain, but normal folks will be kind of OK.

[1] well, everyone has this same idea, meaning there also a lot of short term investors trying to make gains while openai/nvidia/etc are on their way up, sort of like greater fool investing, but let's ignore them for the purposes of this argument.

[Note] of course, whether cloud/ai/whatever are actually useful infra that deserves to be force-created is up for debate.. many disagree, not me though.

osn9363739

3 months ago

1 reply

I don't even know where to start with this. Comparing private companies to roads? This makes no sense right? These companies, like roads never have to turn a profit?

porridgeraisin

3 months ago

It might sound weird, but yes. The economic perspective is the same.

You're making a huge upfront unprofitable investment into something, so that a lot of insanely profitable investments can be made 10 years in the future, that uses the result of todays investment as infrastructure.

Whether it's as important as roads, all of that is not relevant from an economic standpoint -- I'm just explaining the rationale behind the investments today.

> These companies

It's useful to mentally think of it as an implicit "team effort" among all relevant, rich companies. Openai (just an example) themselves are being "allowed" by everyone else in the "team" to not make money, because everyone knows that at the end there will be a gajillion new product companies they can all make their money back on. Openai/stargate is in that sense just a "front" for this massive infrastructure investment. Sama/Microsoft themselves will make money that way, either by building those products (just think of the ai integrations in MS enterprise in 10 years...) or by investing in others building those products. Defense folks have investments into this for the same reason.

That's why I said it's similar to roads, no one expects to make money from roads. But on those nice roads amazon will create a 1-day delivery product, charge you 10$ a month for it, and investors will make money from that [1].

[1] yes yes this specific example is bunk and historically wrong, but wanted to drive home the parallel with a small example

stalfosknight

3 months ago

1 reply

Wasn't this the thinking behind the massive overbuilding of fiber in the 90s?

porridgeraisin

3 months ago

Yes, this was the thinking behind the telco boom(or bust depending on who you ask), cloud in the 2010s too, was the same.

drakythe

3 months ago

For companies where that is their entire business model I absolutely agree. Zed is a solid editor with additional LLM integration features though, so this move would seem to me to just cover their costs + some LLM integration development funds. If their users don't want to use the LLM then no skin off Zed's back unless they've signed some guaranteed usage contract.

andrewmcwatters

3 months ago

2 replies

Am I wrong in that GitHub Copilot Pro apparently has the best overall token spend when considering agentic editors?

hu3

3 months ago

1 reply

$10 GitHub Copilot Pro plan works for me in VSCode.

I've been exclusively using Claude Sonnet 4 model in VSCode and so far I've used 90% of the premium quota at the end of the months. I can always use GPT4.1 or GPT5-mini for free if need be.

andrewmcwatters

3 months ago

Yeah, that's what I've been using since its release as well. I don't really see a point in trying the competition. It can't be better than this.

ramon156

3 months ago

Better than Gemini Pro 2.5? Github Copilot doesn't even support tooling in Zed yet. It's been months..

genshii

3 months ago

4 replies

I'm personally looking forward to this change because I currently pay $20/month just to get edit prediction. I use Claude Code in my terminal for everything else. I do wish I could just pay for edit prediction at an even lower price, but I can understand why that's not an option.

I'm curious if they have plans to improve edit prediction though. It's honestly kind of garbage compared to Cursor, and I don't think I'm being hyperbolic by calling it garbage. Most of the time it's suggestions aren't helpful, but the 10-20% of the time it is helpful is worth the cost of the subscription for me.

morgankrey

3 months ago

3 replies

We have a significant investment underway in edit predictions. We hear you, more soon.

sippeangelo

3 months ago

1 reply

This is the one thing keeping me from switching from Cursor. I much prefer Zed in every other way. Exciting!

hombre_fatal

3 months ago

3 replies

Yeah, Cursor tab completion is basically in the realm of magical mind reading and might still be the most insane productivity demonstration of LLM tech in software.

It obsoleted making Vim macros and multiline editing for example. Now you just make one change and the LLM can derive the rest; you just press tab.

It's interesting that the Cursor team's first iteration is still better than anything I've seen in their competitors. It's been an amazing moat for a year(?) now.

sippeangelo

3 months ago

I agree. I wish they focused more on it. I'd love to be able to give it a few sentences of instructions to make it even more effective for me. It's so much more of a productivity boon than all the coding agent stuff ever was.

TiredOfLife

3 months ago

Windsurf.

pdntspa

3 months ago

I could say the same about the AI-assisted autocomplete in IDEA. Wonder how they compare...

genshii

3 months ago

That's great to hear, thanks!

pkilgore

3 months ago

This is very very exciting.

chewz

3 months ago

1 reply

I have never used Zed predictions but $20 for 500 prompts is quite a good deal. I use it mostly with Opus for some hard cases.

typpilol

3 months ago

1 reply

10 bucks on copilot and you get unlimited + unlimited gpt4.1 etc

Copilot is the best value by far

insane_dreamer

3 months ago

1 reply

How well does copilot edit prediction work vs Zed’s?

debian3

3 months ago

1 reply

Still very poor. Not sure why they have so much trouble catching up. I would have expected them to by now. Cursor is still the best by far, followed by Windsurf (free by the way).

insane_dreamer

3 months ago

sorry, do you mean CoPilot is poor or Zed's own edit prediction is poor?

dajonker

3 months ago

The edit prediction model is open source https://huggingface.co/zed-industries/zeta

It's based on qwen 2.5 coder 7B and you should be able to run it locally quite easily since it would only require about 8 GB of VRAM for the 8-bit version. Not sure if Zed supports this though, I'm not a Zed user myself.

inerte

3 months ago

That's been my workflow also. Claude Code / OpenAI Codex most of the time, when I have to edit files Cursor's auto-complete is totally worth the $20.

AbuAssar

3 months ago

1 reply

Zed and Warp were two promising Rust-based projects that I closely monitor. Currently, both projects are progressing towards becoming a generic AI Agentic code platform.

scuff3d

3 months ago

1 reply

Until now I've never really come across a comment on Hackernews I thought was AI generated...

AbuAssar

3 months ago

You are partially right, after I wrote the comment I used the writing tools in macOS to rewrite it in a professional tone.

The wording may sound AI generated but the gist of the comment is my true opinion

dinobones

3 months ago

5 replies

Making this prediction now, LLM pricing will eventually be priced in bytes.

Why: LLMs are increasingly becoming multimodal, so an image "token" or video "token" is not as simple as a text token. Also, it's difficult to compare across competitors because tokenization is different.

Eventually prices will just be in $/Mb of data processed. Just like bandwidth. I'm surprised this hasn't already happened.

dragonwriter

3 months ago

2 replies

> Why: LLMs are increasingly becoming multimodal, so an image "token" or video "token" is not as simple as a text token.

For autoregressive token-based multimodal models, image tokens are as straightforward as text tokens, and there is no reason video tokens wouldn’t also be. (If models also switch architecture and multimodal diffusion models, say, become more common, then, sure, a different pricing model more tied to actual compute cost drivers for that architecture are likely but... even that isn’t likely to be bytes.)

> Also, it's difficult to compare across competitors because tokenization is different.

That’s a reason for incumbents to prefer not to switch, though, not a reason for them to switch.

> Eventually prices will just be in $/Mb of data processed.

More likely they would be in floatint point operations expended processing them, but using tokens (which are the primary drivers for the current LLM architectures) will probably continue as long as the architecture itself is doninant.

oblio

3 months ago

2 replies

> For autoregressive token-based multimodal models, image tokens are as straightforward as text tokens, and there is no reason video tokens wouldn’t also be.

In classical computing, there is a clear hierarchy: text < images <<< video.

Is there a reason why video computing using LLMs shouldn't be much more intensive and therefore costly than text or image output?

Filligree

3 months ago

1 reply

Of course it’s more expensive. It’s still tokens, but considerably more of them.

oblio

3 months ago

2 replies

That's the thing, I can't visualize (and I don't think most people can) what "tokens" represent for image or video outputs.

For text I just assume them to be word stems or more like work-family-members (cat-feline-etc).

For images and videos I guess each character, creature, idea in it is a token? Blue sky, cat walking around, gentleman with a top hat, multiplied by the number of frames?

dragonwriter

3 months ago

> For images and videos I guess each character, creature, idea in it is a token?

No, for images, tokens would, I expect, usually be asymptotically proportional to the area of the image (this is certainly the case with input token for OpenAIs models that take image inputs; outputs are more opaque); you probably won’t have a neat one-to-one intuition for what one token represents, but you don’t need that for it to be useful and straightforward for understanding pricing, since the mathematical relationship of tokens to size can be published and the size of the image is a known quantity. (And videos conceptually could be like images with an additional dimension.)

porridgeraisin

3 months ago

Tokens correspond more to words in text land. The cat-feline etc connection happens when you train the model and not really by the tokenisation algorithm, which only sees text and not concepts. Byte pair encoding and SentencePiece (the two main tokenisation algorithms used by all LLMs) are mostly leetcode-medium-level algorithms. You can check it out and gain a permanent intuition, especially BPE.

For images, you take patches of the image (say 16x16 patches[1]), and then directly pass it into the FFN+transformer machinery[2]. As such, there is no vocabulary of tokens for images[3]. So, the billing happens per image patch. i.e, for large images, your cost will go up[2] Since it will have more px*py patches.

[1] x 3, due to RGB

[2] Upto a point, it gets downsamples to lower quality beyond a certain res. The downsampling happens in many ways... Qwen-VL uses a CNN, GPT iirc stuffs a downsampler after the embedding layer... As well as before. Anyways, they usually just take some average reduction by that downsampler and cut your billed tokens by that much in all these cases. OpenAIs bin-based billing is like this.

[3] Dall-E from way back when did have a discrete set of tokens and it mapped all patches of all images in the world to one from that, IIRC.

dragonwriter

3 months ago

No, it'll certainly be more expensive in any conceivable model that handles all three modalities, but if the model uses an architecture like current autoregressive, token-based multimodal LLMs/VLMs, tokens will make just as much sense as the basis for pricing, and be similarly straightforward, as with text and images.

efskap

3 months ago

To clarify, "as straightforward" = same dimensionality? I guess it would have to be, to be usable in the same embedding space.

jstummbillig

3 months ago

2 replies

Why this instead of cpu/gpu time?

antiframe

3 months ago

CPU/GPU time is opaque to me before I send my data, but tokens I can count before I decide to send it. That means I can verify the metering. With CPU time I send the data and then the company says "That cost X CPU units, which is $500".

typpilol

3 months ago

I'm assuming token count and usage is pretty closely tied

vtail

3 months ago

Hm... why not tokens as reported by each LLM provider? They already handle pricing for images etc.

jermaustin1

3 months ago

The problem is that tokens don't all equate to the same size. A megabyte of some random json is a LOT more tokens than a megabyte of "Moby Dick".

mhuffman

3 months ago

No one is going to give up token-based pricing. The main players can twiddle their models to make anything any amount of tokens they choose.

vtail

3 months ago

3 replies

Prediction: the only remaining providers of AI-assisted tools in a few years will be the LLM companies themselves (think claude code, codex, gemini, future xai/Alibaba/etc.), via CLIs + integrations such as ASP.

There is very little value that a company that has to support multiple different providers, such as Cursor, can offer on top of tailored agents (and "unlimited" subscription models) by LLM providers.

computerex

3 months ago

I don't know. Foundation models are very good, and you can get a surprising amount of mileage from them by using them with low level interfaces. But personally I think companies building development tools of the future will use LLMs to build systems with increasing capabilities. I think a lot of engineering challenges remain in scaling LLM's to take over day to day in programming, and the current tools are scratching the surface of what's possible when you combine LLMs with traditional systems engineering.

rudedogg

3 months ago

If you look at even the Claude/OpenAI chat UIs, they kind of suck. Not sure why you think someone else can't/won't do it better. Yes, the big players will copy what they can, but they also need to chase insane growth and getting every human on earth paying for an LLM subscription.

A tool that is good for everyone is great for no one.

Also, I think we're seeing the limits on "value" of a chat interface already. Now they're all chasing developers since there's a real potential to improve productivity (or sadly cut-costs) there. But even that is proving difficult.

serbuvlad

3 months ago

I recently started using Codex (OpenAI's Claude Code) and it has a VSCode extension that works like a charm. I tried out Windsurf a while ago. And the Codex extension simply does everything that Windsurf did. I guess it doesn't show changes at well, (it shows diffs in it's own window instead of in the file), but I can just check a git diff graphically (current state vs. HEAD) if I really wanted that.

I am really tempted to buy ChatGPT Pro, and probably would have if I lived in a richer country (unfortunetley purchase power parity doesn't equalize for tech products). The problem with Windsurf (and presumably Cursor and others) is that you buy the IDE subscription and then still have to worry about usage costs. With Codex/Claude Code etc., yeah, it's expensive, but, as long as you're within the usage limits, which are hopefully reasonable for the most expensive prices, you don't have to worry about it. AND you get the web and phone apps with GPT 5 Pro, etc.

oakesm9

3 months ago

1 reply

I completely get why this pricing is needed and it seems fair. There’s a major flaw in the announcement though.

I get that the pro plan has $5 of tokens and the pricing page says that a token is roughly 3-4 characters. However, it is not clear:

- Are tokens input characters, output characters, or both?

- What does a token cost? I get that the pricing page says it varies by model and is “ API list price +10%”, but nowhere does it say what these API list prices are. Am I meant to go to The OpenAI, Anthropic, and other websites to get that pricing information? Shouldn’t that be in a table on that page which each hosted model listed?

—

I’m only a very casual user of AI tools so maybe this is clear to people deep in this world, but it’s not clear to me just based on Zelda pricing page exactly how far $5 per month will get me.

morgankrey

3 months ago

1 reply

List here: https://zed.dev/docs/ai/models. Thanks for the feedback, we'll make sure this is linked from the pricing page. Think it got lost in the launch shuffle.

oakesm9

3 months ago

1 reply

All makes sense. I presumed it was an oversight.

It’s hard for me to conceptualise what a million tokens actually looks like, but I don’t think there’s a way around that aside from making proving some concrete examples of inputs, outputs, and the number of tokens that actually is. I guess it would become clearer after using it a bit.

morgankrey

3 months ago

Now live: https://zed.dev/pricing#what-is-a-token. Thanks for the feedback

giancarlostoro

3 months ago

2 replies

I was just thinking this morning about how I think Zed should rethink their subscription because its a bit pricey if they're going to let you just use Claude Code. I am in the process of trying out Claude and figured just going to them for the subscriptions makes more sense.

I think Zed had a lot of good concepts where they could make paid AI benefits optional longer term. I like that you can join your devs to look at different code files and discuss them. I might still pay for Zed's subscription in order to support them long term regardless.

I'm still upset so many hosted models dont just let you use your subscription on things like Zed or JetBrains AI, what's the point of a monthly subscription if I can only use your LLM in a browser?

hamandcheese

3 months ago

1 reply

> I'm still upset so many hosted models dont just let you use your subscription on things like Zed or JetBrains AI, what's the point of a monthly subscription if I can only use your LLM in a browser?

This is her another reason why CLI-based coding agents will win. Every editor out there trying to be the middle man between you and an AI provider is nuts.

oblio

3 months ago

1 reply

Wouldn't the last step just be an API? That would allow direct integration from everywhere.

praseodym

3 months ago

There is one, developed by the Zed team in collaboration with Gemini. And Claude Code is also supported now.

https://agentclientprotocol.com/overview/introduction

ibejoeb

3 months ago

> if they're going to let you just use Claude Code

I'm pretty sure that's only while it's in preview, just like they were giving away model access before that was formally launched. Get it while it's hot.

blutoot

3 months ago

2 replies

Why is most of the AI-tooling industry still stuck on this "bring your own key" model?

Macha

3 months ago

The companies actually providing the models charge by token and this lets the tooling avoid having to do cost planning for something with a bunch of unknowns and push the risk of overspend to customers.

sethhochberg

3 months ago

What would you propose as an alternative?

As a corporate purchaser, "bring your own key" is just about the only way we can allow our employees to stay close to the latest happenings in a rapidly moving corner of the industry.

We need to have a decent amount of trust in the model execution environment and we don't like having tons of variable-cost subscriptions. We have that trust in our corporate-managed OpenAI tenant and have good governance and budget controls there, so BYOK lets us have flexibility to put different frontends in front of our trusted execution environment for different use cases.

hmokiguess

3 months ago

3 replies

Zed was supposed to be the answer to Atom / Sublime Text in my opinion, and I kinda do want to use it as my main driver, but it just isn’t there yet for me. It’s shameful because I like its aesthetics as a product more than the competition out there.

Just this other day I tried using it for something it sort of advertised itself as the superior thing, which was to load this giant text file I had instantly and let me work on it.

I then tried opening this 1GB text file to do a simple find/replace on it only to find macOS run out of system memory with Zed quickly using 20gb of memory for that search operation.

I then switched to vscode, which, granted opened it in a buffered sort of way and limited capability, but got the job done.

Maybe that was a me issue I don’t know, but aside from this one-off, it doesn’t have a good extensions support in the community for my needs yet. I hope it gets there!

j_bum

3 months ago

3 replies

VSCode is my go to for large text file interaction on macOS.

TextEdit may be worth looking into as well? Haven’t tested it for large files before.

dewey

3 months ago

2 replies

I have Sublime Text installed for the onlu use case of opening large files. Nothing comes close.

CharlesW

3 months ago

3 replies

Googling around a bit, Sublime Text doesn't seem to be particularly good at this: https://forum.sublimetext.com/t/unable-to-open-a-large-text-...

In my experience, BBEdit will open files that kill other editors: "Handling large files presents no intrinsic problems for BBEdit, though some specific operations may be limited when dealing with files over 2GB in size."

spartanatreyu

3 months ago

1 reply

Sublime text works better on large files (where file sizes are a few gb) compared to VSCode.

But, you can go faster depending on your usecase:

- If you're trying to manually look through the file, use `less`. You can scroll up and down, go quickly to the top and bottom of the file, and also search the file for strings quickly

- If you already know the string in the file that you're looking for, use ripgrep

- If you're trying to do a search and replace, and you already know what the strings are, use sed. (macos' built-in sed isn't good, so get the proper gnu coreutills through homebrew, and you can access the good sed through `gsed`)

hmokiguess

3 months ago

Well ripgrep can be just a backend right. Just like your terminal is how you invoke it, you could have your editor do that.

In fact, I believe that is what vscode uses.

Apparently there’s been a neglected issue about bringing it to Zed. https://github.com/zed-industries/zed/issues/4560

roto

3 months ago

I have always found sublime to be the best at large files, well over 1gb. Since you mention bbedit, maybe this is some mac specific issue? I really don't know. But at least among people i know, opening large files has effectively become its main USP.

Should be noted that the linked post is almost 15 years old at this point too, so perhaps not the most up to date either.

xrisk

3 months ago

While I don’t know if the claim is true, you’ve linked a post from 2012…

fdg4t

3 months ago

The open source CudaText has the same speed on opening huge files. It has slower rendering, but for huge files it is not the issue.

typpilol

3 months ago

Vscode has a special optimizations in place for large files. That's why it works so good.

You can actually disable it in the settings if you want it to try and render the entire thing at once

hmokiguess

3 months ago

Speaking of TextEdit, I like what the folks at CodeEdit are doing. They are moving slow and focusing on just the core parts. Maybe I should go give them a try too!

laweijfmvo

3 months ago

2 replies

Similar experience: I added a folder to my zed project that was too big, causing zed to lock up and eventually crash. But because the default setting was to re-open projects on launch, I was stuck in a loop where I couldn’t remove the folder either. Eventually found a way to clear the recent projects, load an empty editor, and change the setting to avoid it in the future.

mr90210

3 months ago

2 replies

Ok, how big was your project?

My JetBrains IDEs (RustRover, Goland) probably would have choked out too.

tux3

3 months ago

You can open large codebases in Jetbrains IDEs and it takes forever to index, but it shouldn't outright crash or completely freeze.

You can open the kernel in CLion. Don't expect the advanced refactoring features to work, but it can deal with a ~40 million lines project folder for example

sensanaty

3 months ago

IntelliJ IDEs are fine with huge files and projects. At certain sizes it'll disable intellisense in active files, but IME stuff like find and replace works fine regardless of size and you can still turn intellisense on if you want.

They'll index for a long time on huge codebases, but I only go through that like once a month max, I just have the editors as always open

silverwind

3 months ago

Big files/projects is where Sublime really shines. I hope Zed can replicate that performance.

klaussilveira

3 months ago

6 replies

I feel like Zed stopped working on the editor itself since AI was rolled out. I also wanted it to be the open-source alternative to Sublime, but nothing comes close.

bombcar

3 months ago

2 replies

If the full magnitude of products that stopped working on the main product and started to try to shoehorn AI in was known, the economy would collapse overnight.

kace91

3 months ago

1 reply

It really is concerning. I keep an excel sheet with links of all companies I could apply to whenever i change jobs, and checking it the other day practically every row was now selling an ai product.

sensanaty

3 months ago

I just went through the job hunt circuit and honestly while many companies do have some AI thing they're selling, for the actual day-to-day work and what most teams are working on, there's barely any mention of AI. It seems mostly marketing people and non-technical management cares, the devs I've spoken to in my interviews have not cared about AI much at all, and I had interviews with some large, influential companies.

0x1ceb00da

3 months ago

1 reply

Intellij wasn't immune to it either. Number of bugs has exploded since they started adding ai features to their ide, none of which I used for more than 5 minutes.

aitchnyu

3 months ago

Werent you able to disable AI completely?

hakanensari

3 months ago

2 replies

I think their recent push to delegate to CLI agents in the agent panel is the right direction. Claude Code has been running in Zed for the past month. Sure, there are SDK limitations and kinks to iron out, but it’s moving quickly. I’m into it.

hmokiguess

3 months ago

I get what you are saying, and I think they are doing a good job there as well. That said, it still feels like something is missing in that whole workflow to me.

I sometimes worry if we are moving too fast for no reason. Some things are becoming standards in an organic way but they feel suboptimal in my own little bias bubble corner.

Maybe I am getting old and struggling to adapt to the new generation way of getting work done, but I have a gut feeling that we need to revisit some of this stuff more deliberately.

I still see Agents as something that will be more like a background thread that yields rather than a first class citizen inside the Editor you observe as it goes.

I don't know about you, but I feel an existential dread whenever I prompt an Agent and turn into a vegetable watching it breathe. — am I using it wrong? Should I be leaving and coming back later? Should I pick a different file and task while it's doing its thing?

SOLAR_FIELDS

3 months ago

I really do like the Claude code integration in zed. Conceptually it’s pretty well done. Provides some better visualization over running Claude code in the terminal and really aligns with the close supervisor style I like to work with. I really like the follow feature that brings the editor along with the agent.

My complaints:

No Claude code hooks support at the time of this writing. As someone who leverages this somewhat heavily this is why I don’t really use it all the time. I actually find it to be somewhat of a feature at times because I can simply run the thing through Zed if I want to temporarily run with no hooks.

Performance is noticeably degraded presumably because of the “ACP” protocol they invented. I usually work either directly in Claude terminal window using its edit tools or using repoprompt’s mcp editor tools and both are noticeably faster than running in zed.

What seems to be memory leaks in the agent window causes sluggish performance, especially when scrolling. It’s not bad enough to make it unusable, but for an editor whose main advertisement is speed it feels particularly painful.

jamesgeck0

3 months ago

1 reply

There have been improvements recently, but it still has some of the worst text rendering of any editor on macOS, if you have a non-4K display plugged in. Rendering text is kind of a big deal!

klaussilveira

3 months ago

2 replies

It's not just on Mac. Zed's font rendering on Linux is also buggy and ugly. And they don't seem to care much:

https://github.com/zed-industries/zed/issues/11676

https://github.com/zed-industries/zed/issues/7992

https://github.com/zed-industries/zed/issues/4334

Font rendering should be the most important feature of a text editor.

hmokiguess

3 months ago

Funny enough this seems to be the exact opposite of what they preach on their mission statement.

https://zed.dev/about

rtfeldman

3 months ago

It's actively being worked on - there have been 8 font rendering PRs in the past 3 weeks, most recently yesterday: https://github.com/zed-industries/zed/pulls?q=is%3Apr+is%3Ac...

A downside that comes with the territory of building the rendering pipeline from scratch is needing to work through the long tail of complex and tradeoff-heavy font rendering issues on different displays, operating systems, drivers, etc.

I know it's taking awhile to get through, but I agree it's important!

jeanlucas

3 months ago

4 replies

Question out of curiosity: why does Sublime need an alternative? As far as I know it still is maintained?

klaussilveira

3 months ago

1 reply

I love Sublime. I have been using it for years and it's just fantastic software. I have no problems paying for it. But since it is such an important part of my toolbox, not having the source code is a liability. What if they decide to drop support for my platform? What if they decide to shift gears into AI and enshittify the experience?

Every other piece of software in my toolbox is open-source. The scenarios I've described happened to some of those tools, and I maintain my own forks. Currently, Sublime is the single point of failure on my toolbox.

I would buy a source code license if I could.

NuclearPM

3 months ago

> The scenarios I've described happened to some of those tools, and I maintain my own forks.

Which ones?

halJordan

3 months ago

I don't think they mean "replacement" but rather "the sublime of ai editors"

WD-42

3 months ago

Sublime is great but falling behind. LSP support being a janky plugin instead of first party is a great example.

hmokiguess

3 months ago

Sorry did not mean to hate on Sublime, it was pointed out in another comment that the comparison didn’t really match and I sort of agree. The mental model that brought that initially was the one-off use case of opening large files, for which I have traditionally done through Sublime in the past.

rtfeldman

3 months ago

A quick glance at Zed's changelog is all it takes to see that AI has been a minority of what Zed has shipped since it was rolled out over a year ago. :)

https://zed.dev/releases/stable

(And that's even with almost none of the work on the massive Windows project being included in the changelog!)

hmokiguess

3 months ago

Yeah exactly this! I get they want to stay in the game and follow the market, but I’m sad they’re not being more aggressive on that original vision. I still think there could be a huge payoff for them if they invested more on their brand and aesthetics of a more polished and comfy editor.

The way I see it, we’re sort of living in a world where UX is king. (Looking at you Cursor)

I feel like there’s a general sentiment where folks just want a sense of home with their tools more than anything. Yes they need to work, but they also need to work for you in your way. Cursor reinvented autocomplete with AI and that felt like home for most, what’s next? I see so much focus on Agents but to me personally that feels more like it should live on the CI/CD layer of things. Editors are built for humans, something isn’t quite there yet, excited to see how it unfolds.

VGHN7XDuOXPAzol

3 months ago

> Token-agnostic prompt structures obscure the cost and are rife with misaligned incentives

Saying that, token-based pricing has misaligned incentives as well: as the editor developer (charging a margin over the number of tokens) or AI provider, you benefit from more verbose input fed to the LLMs and of course more verbose output from the LLMs.

Not that I'm really surprised by the announcement though, it was somewhat obviously unsustainable

bananapub

3 months ago

seems fine - they're aligning their prices with their costs.

presumably everyone is just aiming or hoping for inference costs to go down so much that they can do a unlimited-with-tos like most home Internet access etc, because this intermediate phase of having to count your pennies to ask the matrix multiplier questions isn't going to be very enjoyable or stable or encourage good companies to succeed.

dmix

3 months ago

I just asked this exact question about Zed pricing 2 days ago

https://news.ycombinator.com/item?id=45333425

muratsu

3 months ago

For those of us building agentic tools that require similar pricing, how does one implement it? OpenRouter seems good for the MVP, but I'm curious if there are alternatives down the line.

dinvlad

3 months ago

Another one bites the dust :-( I hope at least Windsurf stays the same..

WD-42

3 months ago

Good change. I’m not a vibe coder, I use Zed Pro llm integration more like glorified stack overflow. I value Zed more for being an amazing editor for the code I actually write and understand.

I suspect I’m not alone on this. Zed is not the editor for hardcore agentic editing and that’s fine. I will probably save money on this transition while continuing to support this great editor for what it truly shines at: editing source code.

okokwhatever

3 months ago

This is going to be a blood bath for many freelancers if the trend continues with other platforms. Mark my words.

pkilgore

3 months ago

This is much better for me but I really want a plan that includes zero AI other than edit prediction and BYOK for the rest.

But as a mostly claude max + zed user happy to see my costs go down.

37 more comments available on Hacker News

View full discussion on Hacker News

ID: 45362425Type: storyLast synced: 11/20/2025, 8:56:45 PM

Want the full context?