Claude Skills Are Awesome, Maybe a Bigger Deal Than Mcp

Posted3 months agoActive3 months ago

weinzierl

738 points

370 comments

simonwillison.netTechstoryHigh profile

calmmixed

Debate

70/100

AILlmsClaude SkillsMcp

Key topics

Llms

Claude Skills

Mcp

The article discusses Claude Skills, a new feature that allows users to create and manage context for AI tasks, sparking a discussion on its comparison to MCP and its potential impact on AI workflows.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

Peak period

134

0-6h

Avg / period

22.9

Comment distribution160 data points

Loading chart...

Based on 160 loaded comments

Key moments

01Story posted
Oct 17, 2025 at 1:40 PM EDT
3 months ago
Step 01
02First comment
Oct 17, 2025 at 1:45 PM EDT
5m after posting
Step 02
03Peak activity
134 comments in 0-6h
Hottest window of the conversation
Step 03
04Latest activity
Oct 20, 2025 at 4:15 PM EDT
3 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (370 comments)

Showing 160 comments of 370

billconan

3 months ago

2 replies

> LLMs know how to call cli-tool --help, which means you don’t have to spend many tokens describing how to use them—the model can figure it out later when it needs to.

I do not understand this. cli-tool --help outputs still occupies tokens right?

SoMomentary

3 months ago

1 reply

Absolutely, but it occupies them later and only when needed. This is what I think they're driving at here.

billconan

3 months ago

3 replies

but why can't I do the same with mcp? I just create a help() function that returns the help info?

CharlesW

3 months ago

1 reply

That hypothetical might be fine, but MCPs do much more than that and their catalogs can be enormous. Here are some popular MCPs and the amount of context they eat before you've done anything with them:

  • Linear: 23 tools (~12,935 tokens)
  • JetBrains: 20 tools (~12,252 tokens)
  • Playwright: 21 tools (~9,804 tokens)

esafak

3 months ago

1 reply

Github: 39 tools, 30K. I had to disable it.

Does anybody have a good SKILLS.md file we can study?

CharlesW

3 months ago

Absolutely! I now have Claude Code using `gh` and haven't missed the MCP. (If there are better CLI alternatives, I'd love to hear about them.)

8note

3 months ago

you can; i've seen people put mcp access behind another mcp. I'm not sure how much success they got from it though

brazukadev

3 months ago

Most people still don't understand MCP properly and think it's about adding 50 tools to every call. Proper MCP servers and clients implement tools/listChanged

cesarvarela

3 months ago

This is like MCPs on demand. I've switched a few MCP servers to just scripts because they were taking like 20% of the context just by being there. Now, I can ask the model to use X script, which it reads and uses only if needed.

fullstackchris

3 months ago

1 reply

I just read this seperately through Google Discover, and I don't quite get amazing newness of it - if anything, it feels to me like of an abstraction of MCP - there is nothing I see here that couldnt be replaced by a series of MCP tools - for example, the author mentions "a current trick" often used is including a markdown file with details / instructions around a task - this can be handled with an mcp server prompt (or even a 'tool' that just returns the desired text) If you've fooled around as much as I have, you realize in the prompt itself you can mention other available tools the LLM can use - defining a workflow, if you will, including tools for actual coding and validation like the author mentions they included in their skill.

Furthermore, with all the hype around MCP servers and simply the amount of servers now existing, do they just immediately come obsolete? its also a bit fuzzy to me just exactly how an LLM will choose an MCP tool over a skill and vice versa...

notatoad

3 months ago

a skill is a markdown & yaml file on your filesystem. an MCP server is accessed over http, and defines a way to authenicate users.

if you're running an MCP file just to expose local filesystem resources, then it's probably obsolete. but skills don't cover a lot of the functionality that MCP offers.

frankc

3 months ago

5 replies

So far I am in the skeptic camp on this. I don't see it adding a lot of value to my current claude code workflow which already includes specialized agents and a custom mcp to search indexed mkdocs sites that effectively cover the kinds of things I would include in these skills file. Maybe it winds up being a simpler, more organized way to do some of this, but I am not particularly excited right now.

I also think "skills" is a bad name. I guess its a reference to the fact that it can run scripts you provide, but the announcement really seems to be more about the hierarchical docs. It's really more like a selective context loading system than a "skill".

hatmanstack

3 months ago

1 reply

That's exactly what it is - formalizing and creating a standard induces efficiency. Along with things like AGENTS.md, it's all about standardization.

What bugs me: if we're optimizing for LLM efficiency, we should use structured schemas like JSON. I understand the thinking about Markdown being a happy medium between human/computer understanding but Markdown is non-deterministic for parsing. Highly structured data would be more reliable for programmatic consumption while still being readable.

eggnet

3 months ago

In general, markdown refers to CommonMark and derivatives now. I’d be surprised if that wasn’t the case here.

tortilla

3 months ago

1 reply

I manually select my context* (like a caveman) and clear it often. I feel like I have a bit more control and grounding this way.

*I use a TUI to manage the context.

gonzaman

3 months ago

Which TUI do you use to manage context?

vunderba

3 months ago

2 replies

I'm inclined to agree. I've read through the Skill docs and it looks like something I've been doing all along - though I informally referred to it as the "Table of Contents" approach.

Over time I would systematically create separate specialized docs around certain topics and link them in my CLAUDE.md file but noticeably without using the "@" symbol which to my understanding always causes CLAUDE to ingest the linked files resulting in unnecessarily bloating your prompt context.

So my CLAUDE md file would have a header section like this:

  # Documentation References

  - When adding CSS, refer to: docs/ADDING_CSS.md
  - When adding or incorporating images, refer to: docs/ADDING_IMAGES.md
  - When persisting data for the user, refer to: docs/STORAGE_MANAGER.md
  - When adding logging information, refer to: docs/LOGGER.md

It seems like this is less of a breakthrough and more an iterative improvement towards formalizing this process from a organizational perspective.

tortilla

3 months ago

2 replies

How consistently do you find that Claude Code follows your documentation references? Like you work on a CSS feature and it goes to ADDING_CSS.md? I run into issues where it sometimes skips my imperative instructions.

vunderba

3 months ago

It's funny you mention this - for a while I was concerned that CC wasn't fetching the appropriate documentation related to the task at hand (coincidentally this was around Aug/Sept when Claude had some serious degradation issues [1]), so I started adding the following to the beginning of each specialized doc file:

  When this documentation is read, please output "** LOGGING DOCS READ **" to the console.

These days I do find that the TOC approach works pretty well though I'll probably swap them over to Skills to see if the official equivalent works better.

[1] https://www.anthropic.com/engineering/a-postmortem-of-three-...

braebo

3 months ago

For me, it’s pretty reliable until a chat grows too long and it drifts too far away from the start where it reviewed the TOC

mudkipdev

3 months ago

I just tag all the relevant documentation and reference code at the beginning of the session

mritchie712

3 months ago

if you've ever worked with Excel + Python, I think this example will drive home the value a bit:

https://github.com/anthropics/skills/blob/main/document-skil...

There are many edge cases when writing / reading Excel files with Python and this nails many of them.

visarga

3 months ago

> and a custom mcp to search indexed mkdocs sites that effectively cover the kinds of things I would include in these skills file

Search and this document base pattern are different. In search the model uses a keyword to retrieve results, here the model starts from a map of information, and navigates it. This means it could potentially keep context better, because search tools have issues with information fragmentation and not seeing the big picture.

filereaper

3 months ago

2 replies

We've just started to roll out our MCP Servers and if Anthropic and the community has already moved on, we'll wait till all this churn subsides till switching over next time.

rco8786

3 months ago

2 replies

I don’t really see how this replaces MCP tbh.

MCP gives the LLM access you your APIs. These skills are just text files with context about how to perform specific tasks.

simonw

3 months ago

4 replies

You don't need MCP if you can instead drop in a skill markdown file that says "to access the GitHub API, use curl against api.github.com and send the GITHUB_API_KEY environment variable in the authorization header. Here are some examples. Consult github-api.md for more."

SV_BubbleTime

3 months ago

1 reply

Am I the only person left that is still impressed that we have a natural language understanding system so good that its own tooling and additions are natural language?

simonw

3 months ago

1 reply

I still can't believe we can tell a computer to "use playwright Python to test this new feature page" and it will figure it out successfully most of the time!

qiller

3 months ago

Impressing, but I can't believe we went from fixing bugs to coffee-grounds-divination-prompt-guessing-and-tweaking when things don't actually go well /s

rco8786

3 months ago

I would much, much rather provide discreet APIs directly to the LLM via MCP than just tell it to hit the api and figure it out from the docs.

thunky

3 months ago

> You don't need MCP

Depends on who the user is...

A difference/advantage of MCP is that it can be completely server-side. Which means that an average person can "install" MCP tools into their desktop or Web app by pointing it to a remote MCP server. This person doesn't want to install and manage skills files locally. And they definitely don't want to run python scripts locally or run a sandbox vm.

Yeroc

3 months ago

That's going to be a lot less efficient context-wise and computing-wise than using either a purpose-built MCP or skill based around executing a script.

jngiam1

3 months ago

Strong agree here

ffsm8

3 months ago

Is any of it even churn? I feel like almost everything is still relevant, basically everything was a separate card which they're using to build up a house. Even RAG still has it's place

Now wherever they're able to convert that house of cards into a solid foundation or it eventually spectacularly falls over will have to be seen over the next decade.

lillecarl

3 months ago

3 replies

Isn't this just repackaged RAG pretty much?

simonw

3 months ago

1 reply

Depends which definition of RAG you're talking about.

RAG was originally about adding extra information to the context so that an LLM could answer questions that needed that extra context.

On that basis I guess you could call skills a form of RAG, but honestly at that point the entire field of "context engineering" can be classified as RAG too.

Maybe RAG as a term is obsolete now, since it really just describes how we use LLMs in 2025.

markusw

3 months ago

1 reply

I’d rather say you can use skills to do RAG by supplying the right tools in the skill (“here’s how you query our database”).

Calling the skill system itself RAG is a bit of a stretch IMO, unless you end up with so many skills that their summaries can’t fit in the context and you have to search through them instead. ;)

jiggunjer

3 months ago

All skills are RAG, a subset of skills can add more RAG.

prophesi

3 months ago

1 reply

I think RAG is out of favor because models have a much larger context these days, so the loss of information density from vectorization isn't worth it, and doesn't fetch the information surrounding what's retrieved.

simonw

3 months ago

That's true if you use RAG to mean "extra context found via vector search".

I think vector search has shown to be a whole lot more expensive than regular FTS or even grep, so these days a search tool for the model which uses FTS or grep/rg or vectors or a combination of those is the way to go.

rco8786

3 months ago

Seems like that’s it? You give it a knowledge base of “skills” aka markdown files with contexts in them and Claude figures out when to pull them into context.

rco8786

3 months ago

4 replies

So these skills are effectively JIT context injections. Is that about right?

josefrichter

3 months ago

From the docs:

"Skills work through progressive disclosure—Claude determines which Skills are relevant and loads the information it needs to complete that task, helping to prevent context window overload."

So yeah, I guess you're right. Instead of one humongous AGENTS.md, just packaging small relevant pieces together with simple tools.

conception

3 months ago

Yes

simonw

3 months ago

Yes, that's a good way of describing them.

danjc

3 months ago

It's all jit context

siscia

3 months ago

2 replies

Just to echo the point of MCP, they seem cool, but in my experience just using a CLI is orders of magnitude faster to write and to debug (I just run the CLI myself, put test in the code, etc...)

jascha_eng

3 months ago

2 replies

Jup and it doesn't bloat the context unnecessarily. The agent can call --help when it needs it. Just imagine a kubectl MCP with all the commands as individual tools, doesn't make any sense whatsoever.

okeuro49

3 months ago

Do you have any information e.g. blog posts on this pattern?

nomel

3 months ago

> and it doesn't bloat the context unnecessarily.

And, this is why I usually use simple system prompts/direct chat for "heavy" problems/development that require reasoning. The context bloat is getting pretty nutty, and is definitely detrimental to performance.

HDThoreaun

3 months ago

The point of this stuff is to increase reliability. Sure the LLM has a good chance of figuring out the skill by itself, the idea is that its less likely to fuck up with the skill though. This is an engineering advancement that makes it easier for businesses to rely on LLMs for routine stuff with less oversight.

jngiam1

3 months ago

2 replies

MCPs have a larger impact beyond the terminal - you can use it with ChatGPT, Claude Web, n8n, LibreChat, and it comes with considerations for auth, resources, and now even UI (e.g., apps-sdk from OpenAI is on MCP).

If we're considering primarily coding workflows and CLI-based agents like Claude Code, I think it's true that CLI tools can provide a ton of value. But once we go beyond that to other roles - e.g., CRM work, sales, support, operations, finance; MCP-based tools are going to have a better form factor.

I think Skills go hand-in-hand with MCPs, it's not a competition between the two and they have different purposes.

I am interested though, when the python code in Skills can call MCPs directly via the interpreter... that is the big unlock (something we have tried and found to work really well).

andoando

3 months ago

Being able to integrate LLMs with the rest of the software/physical world is pretty cool, and its all powered through natural language.

Were also at the point where the LLMs can generate MCP servers so you can pretty much generate completely new functionalities with ease.

simonw

3 months ago

Yeah, the biggest advantages MCP has over terminal tooling is that MCP works without needing a full blown sandboxed Linux style environment - and MCP can also work with much less capable models.

You can drive one or two MCPs off a model that happily runs on a laptop (or even a phone). I wouldn't trust those models to go read a file and then successfully make a bunch of curl requests!

mrits

3 months ago

1 reply

A step away from AI

cheschire

3 months ago

You can drive a car, even though you may not know exactly how every part works, right?

ActorNightly

3 months ago

2 replies

>kills are folders that include instructions, scripts, and resources that Claude can load when needed.

I hate how we are focusing on just adding more information to look up maps, instead of focusing on deriving those maps from scratch.

matchagaucho

3 months ago

Creating Planning Agents seems to force that approach.

Rather than define skills and execution agents, letting a meta-Planning agent determine the best path based on objectives.

rapind

3 months ago

I think skills and also MCP to an extent are a UI failure. It's AI after all. It should be able to intelligently adapt to our workflow, maybe present it's findings and ask for feedback. If this means it's stored as a thing called "skills" that's perfectly fine, but it should just be an implementation detail.

I don't mean to be unreasonable, but this is all about managing context in a heavy and highly technical manner. Eventually models must be able to augment their training / weights on the fly, customizing themselves to our needs and workflow. Once that happens (it will be a really big deal), all of the time you've spent messing around with context management tools and procedures will be obsolete. It's still good to have fundamental understanding though!

aliljet

3 months ago

1 reply

If this is true, what is the Playwright Skill that we can all enjoy with low token usage and the same value?

simonw

3 months ago

I've been telling Claude Code to "use Playwright Python" and getting good results out of it from just those three words.

joelthelion

3 months ago

1 reply

Reminds me of xml vs json. Xml was big and professional, json was simple and easy to use. We all know who won...

dist-epoch

3 months ago

3 replies

Funny thing, pseudo-XML is going through a big resurgence right now, because models love it, while they seriously struggle with JSON.

koolala

3 months ago

1 reply

HTML? Is the main advantage of XML for understandability the labeled closing tags? Lisp has the same struggle too?

imiric

3 months ago

Tangent: the fact XHTML didn't gain traction is a mistake we've been paying off for decades.

Browser engines could've been simpler; web development tools could've been more robust and powerful much earlier; we would be able to rely on XSLT and invent other ways of processing and consuming web content; we would have proper XHTML modules, instead of the half-baked Web Components we have today. Etc.

Instead, we got standards built on poorly specified conventions, and we still have to rely on 3rd-party frameworks to build anything beyond a toy web site.

Stricter web documents wouldn't have fixed all our problems, but they would have certainly made a big impact for the better.

roflcopter69

3 months ago

2 replies

I'd be really interested in what you mean. Are the any studies that quantify this difference in model performance when using JSON or XML? What could be a good intuition for why there might be a big difference? If XML is better than JSON for LLMs, why isn't everyone and the grandma recommending me to use XML instead of JSON? Why is Google Gemini API offering structured output only with JSON schema instead of XML schema?

simonw

3 months ago

1 reply

I don't know if the XML is better than JSON thing still holds with this year's frontier models, but it was definitely a thing last year. Here's Anthropic's documentation about that: https://docs.claude.com/en/docs/build-with-claude/prompt-eng...

Note that they don't actually suggest that the XML needs to be VALID!

My guess was that JSON requires more characters to be escaped than XML-ish syntax does, plus matching opening and closing tags makes it a little easier for the LLM not to lose track of which string corresponds to which key.

bird0861

3 months ago

1 reply

the Qwen team is still all in on XML and they make a good case for it

roflcopter69

3 months ago

1 reply

Can you please provide a source? I'd love to know their exact reasoning and/or evidence that XML is the way to go.

bird0861

3 months ago

afaik this was true by Q3-30BA3B and Qwen Code's release; it caused some backends to have trouble doing tool calling

samuelknight

3 months ago

1 reply

(1) JSON requires lots of escape characters that mangle the strings + hex escapes and (2) it's much easier for model attention to track when a semantic block begins and ends when it's wrapped by the name of that section

...

</instructions>

can be much easier than

{

"instructions": "..\n...\n"

}

especially when there are newlines, quotes and unicode

roflcopter69

3 months ago

Thanks for the reply, that part about the models attention is pretty interesting!

I would suspect that a single attention layer won't be able to figure out to which token a token for an opening bracket should attend the most to. Think of {"x": {y: 1}} so with only one layer of attention, can the token for the first opening bracket successfully attend to exactly the matching closing bracket?

I wonder if RNNs work better with JSON or XML. Or maybe they are just fine with both of them because a RNN can have some stack-like internal state that can match brackets?

Probably, it would be a really cool research direction to measure how well Transformer-Mamba hybrid models like Jamba perform on structured input/output formats like JSON and XML and compare them. For the LLM era, I could only find papers that do this evaluation with transformer-based LLMs. Damn, I'd love to work at a place that does this kind of research, but guess I'm stuck with my current boring job now :D Born to do cutting-edge research, forced to write CRUD apps with some "AI sprinkled in". Anyone hiring here?

tasuki

3 months ago

1 reply

What is pseudo-XML?

simonw

3 months ago

2 replies

Looks like XML but isn't actually valid XML. This for example:

  <title>This & that</title>
  <author>Simon</author>
  <body>Article content goes here</body>

If you ask an LLM for the title, author and body it will give you the right answer, even though that is not a valid XML document.

joquarky

3 months ago

If XML had been like this from the start, it might have won.

Just look at HTML vs XHTML.

tasuki

3 months ago

Quite obvious, how didn't I think of it? Thanks!

rednafi

3 months ago

1 reply

MCP gives me early days gRPC vibes - when the protocol felt heavy and the toolings had many sharp edges. Even today, after many rounds of improvements, people often eschew gRPC and Protobuf.

Similarly, my experience writing and working with MCPs has been quite underwhelming. It takes too long to write them and the workflow is kludgy. I hope Skills get adopted by other model vendors, as it feels like a much lighter way to save and checkout my prompts.

andoando

3 months ago

1 reply

What do you find difficult about writing MCPs? I havent worked much with them but it seems easy enough. I made an MCP that integrates with jenkins so I can deploy code from claude (not totally useful cause can just make a few cli commands), but still took like 10 mins and works flawlessly.

But I suppose yeah, why not just write clis and have an llm call them

rednafi

3 months ago

1 reply

Writing one off simple MCPs are quite easy but once you need to manage a fleet of them, it gets hairy.

- Writing manifests and schemas by hand takes too long for small or iterative tools. Even minor schema changes often require re-registration or manual syncing. There’s no good “just run this script and expose it” path yet.

- Running and testing an MCP locally is awkward. You don’t get fast iteration loops or rich error messages. When something fails, the debugging surface is too opaque - you end up guessing what part broke (manifest, transport, or tool logic).

- There’s no consistent registry, versioning, or discovery story. Sharing or updating MCPs across environments feels ad hoc, and you often have to wire everything manually each time.

With Skills you need none of them - instruct to invoke a tool and be done with it.

brazukadev

3 months ago

> - There’s no consistent registry, versioning, or discovery story. Sharing or updating MCPs across environments feels ad hoc, and you often have to wire everything manually each time.

yes there is:

https://github.com/modelcontextprotocol/registry

and here you have frontends for the registry https://github.com/modelcontextprotocol/registry/blob/main/d...

Everything is new so we are all building it in real time. This used to be the most fun times for a developer: new tech, everybody excited, lots of new startups taking advantage of new platforms/protocols.

Seattle3503

3 months ago

2 replies

Maybe I'm just dumb, but it isn't clear how to manage my skills for Claude Code. All the docs are for the web version.

Eg I don't know where to put a skill that can be used across all projects

simonw

3 months ago

1 reply

Here's the relevant documentation: https://docs.claude.com/en/docs/claude-code/skills#personal-...

You can drop the new markdown files directly into your ~/.claude/skills directory.

Seattle3503

3 months ago

Thank you! For some reason claude told me to mess around in ~/.claudeconfig or some made up directory or another

delaminator

3 months ago

As a bonus you can get Claude to create the SKILL.md file itself.

Which kind of sounds pointless if Claude already knows what to do, why create a document?

My examples - I interact with ElasticSearch and Claude keeps forgetting it is version 5.2 and we need to use the appropriate REST API. So I got it to create a SKILL.md about what we used and provided examples.

And the next one was getting it to write instructions on how to use ImageMagik on Windows, with examples and trouble shooting, rather than it trying to use the Linux versions over and over.

Skills are the solution the problems I have been having. And came just at the right time as I already spent half of last week making similar documents !

cheema33

3 months ago

6 replies

MCPs are overhyped and have limited value in my opinion. About 95% of the MCP servers out there are useless and can be replaced with a simple tool call.

brookst

3 months ago

1 reply

Yes, and MCPs also only work as long as you trust the provider. MCP relies on honesty in from the server. In reality, we know Uber and folks will prompt engineer like hell to try to convince any LLM that it is the best option for any kind of service.

There’s a fundamental misalignment of incentives between publishers and consumers of MCP.

BoorishBears

3 months ago

1 reply

When ChatGPT plugins came out, I wrote a plugin that would turn all other plugins into an ad for a given movie or character.

Asking for snacks would activate Klarna for "mario themed snacks", and even the most benign request would become a plug for the Mario movie

https://chatgpt.com/s/t_68f2a21df1888191ab3ddb691ec93d3a

Found my favorite for John Wick, question was "What is 1+1": https://chatgpt.com/s/t_68f2bc7f04988191b05806f3711ea517

ntcho

3 months ago

This is hilarious, thanks for sharing. Kinda crazy how well it works and already better than some ads

gabrielpoca118

3 months ago

2 replies

My team doing front end dev extracted a lot of value from figma mcp. Things that would have taken 3 weeks were done in one afternoon.

lossolo

3 months ago

2 replies

Do you mean three weeks of manual work (no LLM) vs MCP? Or MCP vs LLM tool use? Because that's a huge difference.

gabrielpoca118

3 months ago

sorry, I was talking about no LLM vs MCP, but in the scenario of LLM vs MCP I think at least 1 week with MCP vs LLM. It's a bit hard to estimae because every project is different, but when feeding our own instructions to the LLM it would still take a long time to UI to look exactly like figma. It could get close enough, but we still needed a lot of iterations.

echelon

3 months ago

I'd hazard a guess the former.

The former is a step function change. The latter is just a small improvement.

mhitza

3 months ago

Please share an example of what would have taken you 3 weeks and with Figma's MCP in an afternoon.

dinkleberg

3 months ago

1 reply

This is a very obvious statement, but good MCP servers can be really good, and bad MCP servers can actively make things significantly worse. The problem is that most MCP servers are in the latter category.

As is often the case, every product team is told that MCP is the hot new thing and they have to create an MCP server for their customers. And I've seen that customers do indeed ask for these things, because they all have initiatives to utilize more AI. The customers don't know what they want, just that it should be AI. The product teams know they need AI, but don't see any meaningful ways to bring it into the product. But then MCP falls on their laps as a quick way to say "we're an AI product" without actually having to become an AI product.

TeMPOraL

3 months ago

1 reply

There's some extra irony here: many of those product teams don't realize that AI is not something they can have within their product. If something like MCP is a good fit for them, even a little, then their product is actually a feature of the AI.

Agentic LLMs are, in a way, an attempt to commoditize entire service classes, across the board, all at once.

Personally, I welcome it. I keep saying that a lot of successful SaaS products would be much more useful and ergonomic for end users if, instead of webshit SPA, they were distributed as Excel sheets. To that I will now add: there's a lot more web services that I'd prefer be tool calls for LLMs.

Search engines have already been turned into features (why ask Google when o3 can ask it for me), but that's just an obvious case. E-mails, e-commerce, shopping, coding, creating digital art, planning, managing projects and organizations, analyzing data and trends - all those are in-scope too; everything I can imagine asking someone else to do for me is meant to eventually become a set of tool calls.

Or in short: I don't want AI in your product - I want AI of my choice to use your product for me, so I don't have to deal with your bullshit.

evanmoran

3 months ago

Thank you. This is beautiful said. I will also add that I don’t think chat bots are the final product, so it leaves the open question which product is the last one not being commoditized.

ojosilva

3 months ago

I think MCP servers are valuable in several ways:

- bundled instructions, covering complex iteractions ("use the id from the search here to retrieve a record") for non-standard tools

- custom MCPs, the ones that are firewalled from the internet, for your business apis that no model knows about

- centralized MCP services, http/sse transport. Give the entire team one endpoint (ie web search), control the team's official AI tooling, no api-key proliferation

Now, these trivial `npx ls-mcp` stdio ones, "ls files in any folder" MCPs all over the web are complete context-stuffing bullshit.

cdavid

3 months ago

I agree the big deal is tool calling.

But MCP has at least 2 advantages over cli tools

- Tool calling LLM combined w/ structured output is easier to implement as MCP than CLI for complex interactions IMO.

- It is more natural to hold state between tool calls in an MCP server than with a CLI.

When I read the OT, I initially wondered if I indeed bought into the hype. But then I realized that the small demo I built recently to learn about MCP (https://github.com/cournape/text2synth) would have been more difficult to build as a cli. And I think the demo is representative of neat usages of MCP.

goalieca

3 months ago

MCP servers seem to be a hackers delight. So many poorly configured and hastily deployed instances. Businesses have removed all the normal deployment guardrails!

koakuma-chan

3 months ago

3 replies

Do Claude Skills enable anything that wasn't possible before?

simonw

3 months ago

No. They make context usage more efficient, but they're not providing new capabilities that didn't previously exist in an LLM system that could run commands on a computer.

MontagFTB

3 months ago

Perhaps not, but a big benefit according to OP is the smaller number of tokens / context pollution skills introduce v. MCP.

throwmeaway222

3 months ago

I read the article yesterday and said the same thing.

smcleod

3 months ago

2 replies

They're completely different things. MCP is a standardised lightweight integration interface and skills are dynamic rules.

jiggunjer

3 months ago

You could also say horse carriages and cars are very different things, yet one replaced the other.

MCP lets agents do stuff. Skills let agents do stuff. There's the overlap.

criddell

3 months ago

That doesn't mean you can't say one is a bigger deal than the other.

If I learned how to say "hello" in French today and also found out I have stage 4 brain cancer, they are completely different things but one is a bigger deal than the other.

crvdgc

3 months ago

2 replies

> imagine a folder full of skills that covers tasks like the following:

> Where to get US census data from and how to understand its structure

Reminds me of my first time using Wolfram Alpha and got blown away by its ability to use actual structured tools to solve the problem, compared to normal search engine.

In fact, I tried again just now and am still amazed: https://www.wolframalpha.com/input?i=what%27s+the+total+popu...

I think my mental model for Skills would be Wolfram Alpha with custom extensions.

idk-92

3 months ago

6 replies

tbh wolfram alpha was the craziest thing ever. haven't done much research on how this was implemented back in the day but to achieve what they did for such complex mathematical problems without AI was kind of nuts

globular-toast

3 months ago

2 replies

Wolfram Alpha is AI. It's just not an LLM. AI has been a thing since the 60s. LLMs will also become "not AI" in a few years probably.

phs318u

3 months ago

1 reply

Not sure why you’re getting downvoted. The marketing that LLM=AI seems to have been interpreted as “_only_ LLM=AI”

svdr

3 months ago

2 replies

I think the difference now is that traditional software ultimately comes down to a long series of if/then statements (also the old AI's like Wolfram), whereas the new AI (mainly LLM's) have a fundamentally different approach.

globular-toast

3 months ago

2 replies

Look into something like Prolog (~50 years old) to see how systems can be built from rules rather than it/else statements. It wasn't all imperative programming before LLMs.

If you mean that it all breaks down to if/else at some level then, yeah, but that goes for LLMs too. LLMs aren't the quantum leap people seem to think they are.

TheOtherHobbes

3 months ago

1 reply

They are from the user POV. Not necessarily in a good way.

The whole point of algorithmic AI was that it was deterministic and - if the algorithm was correct - reliable.

I don't think anyone expected that soft/statistical linguistic/dimensional reasoning would be used as a substitute for hard logic.

It has its uses, but it's still a poor fit for many problems.

globular-toast

3 months ago

Yeah, the result is pretty cool. It's probably how it felt to eat pizza for the first time. People had been grinding grass seeds into flour, mixing with water and putting it on hot stones for millennia. Meanwhile others had been boiling fruits into pulp and figuring out how to make milk curdle in just the right way. Bring all of that together and, boom, you have the most popular food in the world.

We're still at the stage of eating pizza for the first time. It'll take a little while to remember that you can do other things with bread and wheat, or even other foods entirely.

ozim

3 months ago

maybe not on their own - but having enough computing power to use LLMs in a way we do now and actually using them is quite a leap.

eloisant

3 months ago

You're talking about non-deterministic algorithms, who yes are often associated with AI but existed way before LLM's

fragmede

3 months ago

I doubt that if the underlying parts changed, anyone outside the industry or enthusiasts would know what that is. How many people know what kind of engine is in their car? I stomp on the floor of my Corolla and away we go! Others might know that their Dodge Challenger has a Hemi. What even is that? Thankfully we have the Internet these days, and someone who's interested can just select the word and right click to Google for the Wikipedia article for it. AI is just such an entirely undefined term coloquially, that any attempts to define it will be wrong.

pjmlp

3 months ago

1 reply

It is basically another take on Lisp, and the development approach Lisp Machines had, repackaged in a more friendly syntax.

Lisp was the AI language until the first AI Winter took place, and also took Prolog alongside it.

Wolfram Alpha basically builds on them, to put in a very simplistic way.

krackers

3 months ago

It's one of the only M-expression versions of Lisp. All the weird stuff about Wolfram Language suddenly made sense when I saw it through that lens

magicalhippo

3 months ago

3 replies

Would really like something selfhosted that does the basic Wolfram Alpha math things.

Doesn't need the craziest math capability but standard symbolic math stuff like expression reduction, differentiation and integration of common equations, plotting, unit wrangling.

All with an easy to use text interface that doesn't require learning.

jhallenworld

3 months ago

1 reply

Try maxima, it's open source:

https://maxima.sourceforge.io/

I used it when it was called Macsyma running on TOPS-20 (and a PDP-10 / Decsystem-20).

Text interface will require a little learning, but not much.

jgalt212

3 months ago

Maxima is amazing and has a GUI. My only beef with it is it doesn't show its work step by step.

harrall

3 months ago

1 reply

Personal faves:

- Mathematica

- Maple

- MathStudio (mobile)

- Ti-89 calculator (high school favorite)

Others:

- SageMath

- GNU Octave

- SymPy

- Maxima

- Mathcad

skylurk

3 months ago

TI-89 has surprisingly good symbolics tools and solvers for something that runs all year on a single set of AAA batteries. Feels like magic alien tech.

krackers

3 months ago

That's wolfram mathematica.

fooker

3 months ago

1 reply

> without AI

We only call it AI until we understand it.

Once we understand LLMs more and there's a new promising poorly understood technology, we'll call our current AI something more computer sciency

simonw

3 months ago

My favorite definition of AI: "AI is whatever hasn't been done yet." - Larry Tesler, https://en.wikipedia.org/wiki/AI_effect

NuclearPM

3 months ago

Thank you for being honest.

ge96

3 months ago

I used it a lot for calc as it would show you how they got the answer if I remember right, also liked how it understands symbols which ibv but cool to paste an integral sign in there

FireInsight

3 months ago

1 reply

When clicking your link, for me it opened the following query on Wolfram Alpha: `what%27s the total population of the United States%3F`

Funnily enough, this was the result: `6.1% mod 3 °F (degrees Fahrenheit) (2015-2019 American Community Survey 5-year estimates)`

I wonder how that was calculated...

KeplerBoy

3 months ago

Wolfram alpha never took input in such a natural language. But something like population(USA) and many variations thereof work.

AJRF

3 months ago

5 replies

This is a fairly negative comment, but putting it out there to see if other people are feeling the same thing

If you told the median user of these services to set one of these up I think they would (correctly) look at you like you had two heads.

People want to log in to an account, tell the thing to do something, and the system figures out the rest.

MCP, Apps, Skills, Gems - all this stuff seems to be tackling the wrong problem. It reminds me of those youtube channels that every 6 months say "This new programming language, framework, database, etc is the killer one", they make some todo app, then they post the same video with a new language completely forgetting they've done this already 6 times.

There is a lot of surface level iteration, but deep problems aren't being solved. Something in tech went very wrong at some point, and as soon as money men flood the field we get announcments like this. push out the next release, get my promo, jump to the next shiny tech company leaving nothing in their wake.

rottencupcakes

3 months ago

1 reply

If that's true, why do leadership, VCs, and eventually either the acquiring company or the public markets keep falling for it then?

As the old adage goes: "Don't hate the player, hate the game?"

To actually respond: this isn't for the median user. This is for the 1% user to set up useful tools to sell to the median user.

AJRF

3 months ago

> If that's true, why do leadership, VCs, and eventually either the acquiring company or the public markets keep falling for it then?

If I had to guess, it would be because greed is a very powerful motivator.

> As the old adage goes: "Don't hate the player, hate the game?"

I know this advice is a realistic way of getting ahead in the world, but it's very disheartening and long term damaging. Like eating junk food every day of your life.

zkmon

3 months ago

6 replies

>> but deep problems aren't being solved

There is no problem to solve. These days, solutions come in a package which includes the problems they intend to solve. You open the package. Now you have a problem that jumped out of the package and starts staring at you. The solution comes out of the package and chases the problem around the room.

You are now technologically a more progressed human.

3abiton

3 months ago

1 reply

I wish this was wrong, but it really isn't. To contrast though, I would argue that is part of evolution? We just want to do things faster or better? Smartphones solved no problems, but they ushered the digital millenium.

zkmon

3 months ago

I think most new technologies helped to increase the expectations about what you can do. But overall work did not get reduced. It didn't give me more free time to go fishing, or bird-watching. On the other hand, I got an irreversible dependency on these things. Otherwise I'm are no longer compatible with the World 2.0

nwhnwh

3 months ago

LOL, this is so true.

kvirani

3 months ago

Wow. I hadn't thought of it like that but it resonates

notepad0x90

3 months ago

If you like creating solutions, why wait for a problem to show up? lol

AJRF

3 months ago

This made me laugh a lot at the mental image. This was my experience with Xcode for sure.

TeMPOraL

3 months ago

This is where GP is wrong, I think. The problem are being solved, for now, because the businesses are still too excited about the whole AI thing to notice it's not in their interest, and properly consolidate against it.

And the problem being solved is, LLMs are universal interfaces. They can understand[0] what I mean, and they understand what those various "solutions" are, and they can map between them and myself on the fly. They abstract services away.

The businesses will eventually remember that the whole point of marketing is to prevent exactly that from happening.

[0] - To a degree, and conditioned on what one considers "understanding", but still - it's the first kind of computer systems that can do this, becoming a viable alternative to asking a human.

darth_avocado

3 months ago

2 replies

> MCP, Apps, Skills, Gems - all this stuff seems to be tackling the wrong problem

My fairly negative take on all of this has been that we’re writing more docs, creating more apis and generally doing a lot of work to make the AI work, that would’ve yielded the same results if we did it for people in the first place. Half my life has been spent trying to debug issues in complex systems that do not have those available.

phlakaton

3 months ago

2 replies

What if the great boon of AI is to get us to do all the thinking and writing we should have been doing all along? What if the next group of technologists to end up on top are... the technical writers?

Haha, just kidding you tech bros, AI's still for you, and this time you'll get to shove the nerds into a locker for sure. ;-)

quentindanjou

3 months ago

1 reply

It might not be that wrong. After all, programming languages are a way to communicate with the machine. In the same way we are not doing binary manually, we might simply not have to do programming too. I think software architecture is likely to be what it should be: the most important part of every piece of software.

skydhash

3 months ago

You’ve got it wrong. The machine is fine with a bit soup and doesn’t care if it’s provided with punch card or python.

Programming was always a tool for humans. It’s a formal “notation” for describing solutions that can be computed. We don’t do well with bit soup. So we put a lot of deterministic translations between that and the notation that we’re good with.

Not having to do programming would be like not having to write sheet music because we can drop a cat from a specific height onto a grand piano and have the correct chord come out. Code is ideas precisely formulated while prompts are half formed wishes and prayers.

CPLX

3 months ago

This is actually my theory of the future. Basically, the ability to multiply your own effectiveness is now directly dependent on your ability to express ideas in simple plain English very quickly and precisely.

I’m attracted to this theory in part because it applies to me. I’m a below average coder (mostly due to inability to focus on it full time) and I’m exceptionally good at clear technical writing, having made a living off it much of my life.

The present moment has been utterly life changing.

XenophileJKO

3 months ago

This is true, but the reason the economics have inverted is that we can pay these new "people" <$20 for the human equivalent of ~300 hours worth of non-stop typing.

antonvs

3 months ago

> People want to log in to an account, tell the thing to do something, and the system figures out the rest.

For consumers, yes. In B2B scenarios more complexity is normal.

underdeserver

3 months ago

Well, we're still early days and we don't know what works.

It might be superficial but it's still state of the art.

modernerd

3 months ago

Seems similar to Amp's "toolboxes" from August:

https://ampcode.com/news/toolboxes

Those are nice too — a much more hackable way of building simple personal tools than MCP, with less token and network use.

anuramat

3 months ago

> inject a prompt based on the description

how are skills different from SlashCommand tool in claude-code then?

ChrisArchitect

3 months ago

Claude Skills

https://news.ycombinator.com/item?id=45607117

ryandrake

3 months ago

I wonder if this is a way to make Claude smarter about tool use. As I try more and more of CC, one thing that's been frustrating me is that it often falls over when trying to call some command line tool. Like it will try to call tool_1 which is not on the system, and it errors out, and then it tries to call tool_2, but it's in the wrong working directory or something, so it fails too, then it changes directory and tries to call tool_2 again, and it passes the wrong command line parameters or something, then it tries again... All the while, probably wasting my limited token budget on all of its fuckups. It's gotten to the point where I let it do the code changes, but when it decides it wants to run something in the shell (or execute the project's executable), I just interrupt it and do the rest of the command line work myself.

mhb

3 months ago

What is MCP?:

https://modelcontextprotocol.io/docs/getting-started/intro

benatkin

3 months ago

I think Skills might be coming from an AI Safety and AI Risk sort of place, or better, alignment with the company's goals. The motivation could be to reduce the amount of ad-hoc instruction giving that can be done, on the fly, in the favor of doing this at a slower pace, making it more subject to these checks. It does fit well into what a lot of agents are doing, though, which makes it more palatable for the average AI user.

Basically the way it would work is, in the next model, it would avoid role playing type instructions, unless they come from skill files, and internally they would keep track of how often users changed skill files, and it would be a TOS violation to change it too often.

Though I gave up on Anthropic in terms of true AI alignment long ago, I know they are working on a trivial sort of alignment where it prevents it from being useful for pen testers for example.

russellbeattie

3 months ago

Hmmm... If skills could create Skills, then evaluate the success of those generated Skills and updating as needed, in a loop, seems like an interesting thing to try. It could be like a form of code creation caching. (Or a Skynet in the making).

The question is whether the analysis of all the Skill descriptions is faster or slower than just rewriting the code from scratch each time. Would it be a good or bad thing if an agent has created thousands of slightly varied skills.

CjHuber

3 months ago

Seems like it will synergize perfectly with what Microsoft has released shortly ago.

https://github.com/microsoft/amplifier

jamesclar

3 months ago

I have been using Claude since few month and couldn't switch to another one regarding performance

jauntywundrkind

3 months ago

Skills feel so similar to specialized agents / sub-agentd, which we see some of already. I could be under appreciating the depth, but it feels like the main work here is the UX affordance: maybe like a mod launcher for games: 'what mods/prompts do you want to run with?'

I really enjoyed seeing Microsoft Amplifier last week, which similarly has a bank of different specialized sub-agents. These other banks of markdowns that get turned on for special purposes feels very similar. https://github.com/microsoft/amplifier?tab=readme-ov-file#sp... https://news.ycombinator.com/item?id=45549848

One of the major twists with Skills seems to be that Skills also have a "frontmatter YAML" that is always loaded. It still sounds like it's at least somewhat up to the user to engage the Skills, but this "frontmatter" offers… something, that purports to help.

> There’s one extra detail that makes this a feature, not just a bunch of files on disk. At the start of a session Claude’s various harnesses can scan all available skill files and read a short explanation for each one from the frontmatter YAML in the Markdown file. This is very token efficient: each skill only takes up a few dozen extra tokens, with the full details only loaded in should the user request a task that the skill can help solve.

I'm not sure what exactly this does but conceptually it sounds smart to have a top level awareness of the specializations available.

I do feel like I could be missing some significant aspects of this. But the mod-launched paradigm feels like a fairly close parallel?

210 more comments available on Hacker News

View full discussion on Hacker News

ID: 45619537Type: storyLast synced: 11/22/2025, 11:47:55 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN