Apps SDK

Posted3 months agoActive3 months ago

alvis

468 points

382 comments

developers.openai.comTechstoryHigh profile

skepticalmixed

Debate

80/100

OpenaiApps SDKAI-Powered AppsModel Context Protocol

Key topics

Openai

Apps SDK

AI-Powered Apps

Model Context Protocol

OpenAI has launched an Apps SDK, sparking debate among developers and users about its potential, limitations, and implications for the future of AI-powered apps and interfaces.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

15m

Peak period

121

0-6h

Avg / period

22.9

Comment distribution160 data points

Loading chart...

Based on 160 loaded comments

Key moments

01Story posted
Oct 6, 2025 at 2:27 PM EDT
3 months ago
Step 01
02First comment
Oct 6, 2025 at 2:42 PM EDT
15m after posting
Step 02
03Peak activity
121 comments in 0-6h
Hottest window of the conversation
Step 03
04Latest activity
Oct 9, 2025 at 3:47 AM EDT
3 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (382 comments)

Showing 160 comments of 382

naiv

3 months ago

2 replies

Remember "GPTs" and the thing before it which I don't even remember now. I think this will go the same route .. to nowhere

minimaxir

3 months ago

The GPT App Store (which is technically now obsolete with this SDK) was funny.

elpakal

3 months ago

Are they still expecting us to get paid based on “revenue sharing”?

jasonsb

3 months ago

1 reply

They promised AGI and delivered SDKs. I think I'm gonna skip this one..

jsheard

3 months ago

1 reply

Hey don't sell them short, they also delivered a TikTok clone with vertically integrated slop generation. It's the 5D Chess path to AGI, they just need to rot the average human brain until the bar for super-human intelligence is reduced to an attainable level.

Narciss

3 months ago

This was funny

alvisAuthor

3 months ago

1 reply

So it’s take 2 for Open AI’s App Store moment. But this time surfing Anthropic’s MCP wave. Smart interop.. or just chasing the cool kids?

apwell23

3 months ago

1 reply

mcp was a dud

consumer451

3 months ago

1 reply

What is the superior way for an LLM to interact with your product?

apwell23

3 months ago

2 replies

llm can call my existing apis fine. curious what kind of problems you are running to with your existing apis?

brazukadev

3 months ago

1 reply

Tool calling is one of at least 5 core features of MCP

apwell23

3 months ago

1 reply

ok but not sure why i need to build an mcp for my product if llms can already call my existing apis ?

brazukadev

3 months ago

1 reply

If you are not sure, you don't need it. If you get to understand the usefulness of MCP, then you might find a use for it.

apwell23

3 months ago

1 reply

well i was responding to person who was asking me "what a better way over mcp" to interact with your product. looks like you inserted yourself with your usual one line non sequiturs. BOT.

consumer451

3 months ago

Person you were responding to here. You do have a solid point/question. To be completely honest: my original thinking was that for once, I am going to go with the flow on something like this. I often fight inertia, but this felt like a bad thing to do it on.

Now, I realize that the best argument for MCP vs function calls in my case, is that I want to allow external products/agents/chatbots to interface with my app. MCP is that standard. I will implement very carefully, but that's what I need to do.

consumer451

3 months ago

I want LLM chat integration with my product.

So far, it seems that if you give an LLM a few tools to create projects and other entities, they seem to be very good at using them. The user gets the option of chat driven ui for our app, with not that much work for limited features.

Currently building internal MCP servers to make that easy. But I can imagine having a public one in the future.

rushingcreek

3 months ago

7 replies

I think this is very interesting, but it is reminiscent of what we built with Phind 2 where the answer could include dynamic, pre-built widgets.

The problem with this approach is precisely that these apps/widgets have hard-coded input and output schema. They can work quite well when the user asks something within the widget's capabilities, but the brittleness of this approach starts showing quickly in real-world use. What if you want to use more advanced filters with Zillow? Or perhaps cross-reference with StreetEasy? If those features aren't supported by the widget's hard-coded schema, you're out of luck as a user.

What I think it much more exciting is the ability to completely create generative UI answers on the fly. We'll have more to say on this soon from Phind (I'm the founder).

chatmasta

3 months ago

1 reply

Phind is awesome. I often forget to use it until legacy search engines fail to surface what I’m looking for after a dozen searches. Phind usually finds it.

That said, I used it a lot more a year ago. Lately I’ve been using regular LLMs since they’ve gotten better at searching.

rushingcreek

3 months ago

1 reply

Thanks for the feedback. I think that our main differentiator going forward will be this generative UI on the fly for answering questions as opposed to search alone.

dleeftink

3 months ago

In a similar boat, but have been increasingly returning to for its quick notebook/charting capabilities. Would be awesome to somehow be able to select between different UI modes offering search, ranking, graphing or else depending on user needs.

alvisAuthor

3 months ago

1 reply

Given there is already a MCP-UI project, I’m not surprised it can be done. But even that I’m not very convinced that it’s the right approach. After all, it’s still far too slow for real usage…

rushingcreek

3 months ago

Totally agree that it's too slow with conventional approaches, which is why we're training custom models for this that we can run fast

esafak

3 months ago

4 replies

The problem is not the limitations of the capabilities per se but their discoverability (https://en.wikipedia.org/wiki/Discoverability). The user doesn't know what the capabilities are, as they are added and -- infuriatingly -- removed. Google Assistant is a perfect example of this.

Conservational user interfaces are opaque; they lack affordances. https://en.wikipedia.org/wiki/Affordance

stavros

3 months ago

1 reply

They don't lack affordances, you can do stuff. They lack signifiers, ie it's not easy to discover the stuff you can do.

esafak

3 months ago

1 reply

Affordance is not what it can do, it is what it signals that it can do. It needs to be perceptible, by the definition I use (Norman's). I see others go by different definitions that even admit hidden affordances. I do not.

stavros

3 months ago

From The Design of Everyday Things:

> Affordances represent the possibilities in the world for how an agent (a person, animal, or machine) can interact with something. Some affordances are perceivable, others are invisible. Signifiers are signals. Some signifiers are signs, labels, and drawings placed in the world, such as the signs labeled “push,” “pull,” or “exit” on doors, or arrows and diagrams indicating what is to be acted upon or in which direction to gesture, or other instructions. Some signifiers are simply the perceived affordances, such as the handle of a door or the physical structure of a switch. Note that some perceived affordances may not be real: they may look like doors or places to push, or an impediment to entry, when in fact they are not.

With Norman's definition, if a conversational interface can perform an action, it affords that action. The fact that you don't know that it affords that action means there's a lack of a signifier.

As you say, this is a matter of definition, I'm just commenting on Norman's specific definition from the book.

gwd

3 months ago

1 reply

Voice interfaces actually remind me a lot of command-line interfaces: If you know the a working "rune" on the tip of your tongue (e.g., "Set a timer for 10 mintues", "Play <exact title rune that gets the song you want>") it's great. But as you say, it's not always that easy to figure out new "runes". LLMs should be somewhat better for that, though.

rushingcreek

3 months ago

The LLM is phenomenal at figuring out what you want, but it still has to map it to the schema of the tool. So while the job of figuring out the working “rune” is offloaded from you to the LLM, it doesn’t solve the fundamental problem of the available “runes” likely being brittle and insufficient for any given task even when the LLM knows exactly what you want to do.

rushingcreek

3 months ago

Yep, this is a big problem as well. If the user doesn't know what features will or won't work, they lose confidence overall.

beefnugs

3 months ago

Thank you for this word. I have felt it my whole life and never learned the exact word.

I immediately knew the last generation of voice assistants was dead garbage when there was no way to know what it could do, they just expected you to try 100 things, until it worked randomly

9dev

3 months ago

1 reply

Ah, that’s interesting. I’m considering building something similar for our product, and my solution to the schema constraints you mentioned thus far is breaking my widgets into blocks as universal as possible, as to still be useful. All of this is just ideas yet mind you, but my thinking was—maybe I can get the model to pick from a range of composable widgets depending on the task that are interoperable?

For a concrete example, think a search result listing that can be broken down into a single result or a matrix to compare results, as well as a filter section. So you could ask for different facets of your current context, to iterate over a search session and interact with the results. Dunno, I’m still researching.

Have you written somewhere about your experience with Phind in this area?

rushingcreek

3 months ago

Yes! We have a blog post here on how we designed these models and widgets: https://www.phind.com/blog/phind-2-model-creation.

Now that models have gotten much more capable, I'd suggest to give the executing model as much freedom with setting (and even determining) the schema as possible.

irrationalfab

3 months ago

2 replies

> If those features aren't supported by the widget's hard-coded schema, you're out of luck as a user.

Chat paired to the pre-built and on-demand widgets address this limitation.

For example, in the keynote demo, they showed how the chat interface lets you perform advanced filtering that pulls together information from multiple sources, like filtering only Zillow housers near a dog park.

rushingcreek

3 months ago

1 reply

Yes, because it seems that Zillow exposes those specific filters as a part of the input schema. As long as it's a part of the schema, then ChatGPT can generate a useful input to the widget. But my point is that is very brittle.

handfuloflight

3 months ago

1 reply

Isn't that as brittle as any system being constrained to providing only some type of outputs? Please elaborate.

rushingcreek

3 months ago

A fully generative UI with on-the-fly schema would be less brittle because you can guarantee that the schema and the intelligent widget can fully satisfy the user’s request. The bottleneck here is the intelligence of the model computing this, but we are already at the point where this is not much of a problem and it will disappear as the models continue to improve.

I think most software will follow this trend and become generated on-demand over the next decade.

JumpCrisscross

3 months ago

> Chat paired to the pre-built and on-demand widgets address this limitation

The only place I can see this working is if the LLM is generating a rich UI on the fly. Otherwise, you're arguing that a text-based UX is going to beat flashy, colourful things.

rco8786

3 months ago

1 reply

That’s solved by MCP though. You can update your MCP’s servers schema dynamically without ever having to touch the app itself but the app will be aware of the new schema.

rushingcreek

3 months ago

I'm not saying that the schema can't change from time to time, I'm saying that having any fixed schema at request time is not an ideal user experience because it may not be clear what is supported and what is not supported. From first principles, it's much better if the app schema can be created dynamically at request time so we can guarantee that we can fully serve the user's request exactly as they asked it.

babyshake

3 months ago

I know that AG-UI from copilot kit is in this space. But it hasn't worked well with the MCP model AFAIK

chvid

3 months ago

1 reply

Discovery, monetization. What is in it for developers?

spongebobstoes

3 months ago

2 replies

deploying an app to 700M people?

saberience

3 months ago

2 replies

That's like saying making a website is like deploying an app for 7B people.

Sure, but deploying a website or app doesn't mean anyone's going to use it, does it?

I could make an iOS app, I could make a website, I could make a ChatGPT app... if no one uses it, it doesn't matter how big the userbase of iOS, the internet, or ChatGPT is...

jryle70

3 months ago

Well, if you don't make it nobody would use it for sure.

handfuloflight

3 months ago

Right this same sleight of hand is encoded in the language used in the announcement to make building on this platform to be attractive seeming.

artisin

3 months ago

Not only do you get to deploy your app to 700M users; you also get to provide responsive support for every single one of them!

Per the docs: 'Every app comes from a verified developer who stands behind their work and provides responsive support'

That's thinly veiled corporate speak for, Fortune 500 or GTFO

benatkin

3 months ago

1 reply

They're looking like Facebook did with their phone project and later the metaverse - too big for their britches.

MaxPock

3 months ago

1 reply

Lmfao..you've reminded me of the phone they made with HTC that had a Facebook button .

sieep

3 months ago

We've already sorta come full circle with the Meta glasses having a physical button to interact with the Facebook AI

mhl47

3 months ago

5 replies

There was a recent post here about how deeply ingrained the chat interface is in OpenAIs organization. This really doubles down on that, but does anyone really like to interact with so much language instead of visual elements? Also feels horrible that you are supposed to remember a bunch of app names like "zillow" and punch them in the chat. And like an opportunity for them to slowly introduce ads for this apps or "preferential discovery", if you will, as monetization strategy.

Personally I don't hope thats the future.

p0seidon

3 months ago

1 reply

Which post was that?

mhl47

3 months ago

https://news.ycombinator.com/item?id=44573195 (in the article, search for:"Chat runs really deep")

drdrey

3 months ago

1 reply

counterpoint: a lot of people around me just type "zillow" in google to access it, so maybe it's not absurd to refer to it by name in a chat interface

fishpen0

3 months ago

2 replies

Right, but if you just search for "house listings" you find zillow and redfin and other stuff. Becoming the new word for "listings" will tie specific brands to our use of language in very interesting ways. What happens if I register my app to a common word. In this example, can I take "listings" and astroturf my app to the top? Is this a new DNS "buying all the domains" race?

x187463

3 months ago

Sam specifically mentioned apps would go through a vetting process before they were auto-suggested by the chat. So, at least in the early days, I would imagine some of the basic shenanigans will be prevented.

aabhay

3 months ago

I mean ultimately you’re in OpenAI’s world, they have even more innate control of language, meaning, and truth

baby_souffle

3 months ago

I feel like we're rehashing the debate around whether or not a GUI or terminal is more powerful.

For a large number of tasks that cleanly generalize into a stream of tokens, command line or chat is probably superior. We'll get some affordances like tab auto completion to help remember the name of certain bots or mCP endpoints that can be brought in as needed...

But for anything that involves discovery, graphical interaction feels more intuitive and we'll probably get bespoke interfaces relevant to that particular task at hand with some sort of partially hidden layers to abstract away the token stream?

Noe2097

3 months ago

Talking about monetization strategy, there is a world where we would not have to remember "Zillow" or "Spotify", and instead ask for real state or music related actions, and have OpenAI "decide" for us what is "the best" options... As in "the option that paid the most to get promoted".

agentcoops

3 months ago

Very much agreed. I think the dominance of the chat interface to LLMs has materially impaired the general usefulness of these tools — the sooner it goes away the better. It’s almost impossible to explain to a non-engineer how the illusion of a continuous conversation is crafted through context management and why past moments in a conversation might fall out of memory. My general advice to non-technical friends is to create a new conversation for each prompt so that they can get a more deterministic sense of how to formulate instructions and which are successful.

I was really hoping Apple would make some innovations on the UX side, but they certainly haven’t yet.

cefboud

3 months ago

2 replies

This is an interesting branding exercise. Presenting MCP as 'Apps' makes it sound more accessible, while tools and MCP server sound very technical. Add a demo with Expedia and Spotify and you have an MCP that's end-user ready.

lossolo

3 months ago

Ye, that's basically an MCP server, that can be used by ChatGPT.

NewEntryHN

3 months ago

This is not just branding, MCP is an implementation detail; the product is chatting with apps.

emilsedgh

3 months ago

5 replies

I see a lot of negative comments here but to me, it was obvious this is where OAI should land.

They want to be the platform in which you tell what you want, and OAI does it for you. It's gonna connect to your inbox, calendar, payment methods, and you'll just ask it to do something and it will, using those apps.

This means OAI won't need ads. Just rev share.

nicce

3 months ago

3 replies

> This means OAI won't need ads.

Ads are defenitely there. Just hidden so deeply in the black box which is generating the useful tips :)

thebigkick

3 months ago

3 replies

If you ask it to build a headless frontend web app, it immediately starts generating code with Next.js. I’ve always wondered how it was trained to default to that choice, given the smorgasbord of web frameworks out there. Next.js is solid, but it’s also platform-ware, tightly coupled to commercial interests. I wish there were more bias toward genuinely open-source technologies.

b_e_n_t_o_n

3 months ago

2 replies

To me it feels like the default choice in the industry, perhaps it's not and I'm wrong but if I could have that feeling I can see how the AI can as well.

nicce

3 months ago

It is a trap. But once you realise that you are already too deeply invested.

array_key_first

3 months ago

I've never seen next.js in the wild. I have seen plain React plus dotnet, though, a million times.

jerojero

3 months ago

There's probably different ways the LLM converged to it.

One could be for example: from people asking online which tools they should use to build something and being constantly recommended to do it with Next.js

Another could be: how many of the code that was used to train the LLM is done in Next.js

Generally, the answer is probably something along the lines of "next.js is kind of the most popular choice at the time of training".

intrasight

3 months ago

Just append to your prompt "not using a framework developed by a company that supports a genocidal fascist regime"

GoatInGrey

3 months ago

Because the AI labs are just hovering up all internet text that they can, I've been seeing more and more marketing pilots that deliberately seed marketing material in thousands of fake, AI-generated blogs and tutorials. The intention here is to get new LLMs to train on these huge numbers of associations between specific use cases and the company's product. All in a way that gets their marketing information into the final weights.

You may have started seeing this when LLMs seem to promote things based entirely on marketing claims and not on real-world functionality.

More or less, SEO spam V2.

aniviacat

3 months ago

I wonder what the ad labeling (according to EU law) would look like in that case.

In my (non-lawyer) understanding, each message potentially containing sponsored content (which would be every message, if the bias is encoded in the LLM itself,) would need to be marked as an ad individually.

That would make for an odd user interface.

dewitt

3 months ago

1 reply

> This means OAI won't need ads. Just rev share

If OpenAI thinks there’s sweet, sweet revenue in email and calendar apps, just waiting to be shared, their investors are in for a big surprise.

dawnerd

3 months ago

1 reply

Zapier has been doing this for how long and no one talks about them like some hot new startup.

anshumankmr

3 months ago

Isn't Zapier also doing some AI based automations? But yeah, I will say ChatGPT does have a massive user base.

seydor

3 months ago

2 replies

A platform requires a user moat or unfair advantage. Having a better quality model is neither

typpilol

3 months ago

2 replies

How's having the best model not a most?

zackangelo

3 months ago

Because it depends on how much better “best” is. If it’s only incrementally better than open source models that have other advantages, why would you bother?

OpenAI’s moat will only come from the products they built on top. Theoretically their products will be better because they’ll be more vertically integrated with the underlying models. It’s not unlike Apple’s playbook with regard to hardwares and software integration.

maleldil

3 months ago

Because it changes all the time. A few weeks ago, it was Gemini 2.5 Pro, then Claude Opus 4.1, GPT-5 Thinking, now maybe Claude Sonnet 4.5, etc[1]. Having a good model isn't enough when they're basically interchangeable now. You need something else.

[1] This is an example. Which model was the best when is not important.

famouswaffles

3 months ago

Consumer LLM apps have moat. As it is, ChatGPT (the app) spends most of its compute on Personal Non work messages (approx 1.9B per day vs 716 for Work)[0]. First, from ongoing conversations that users would return to, then to the pushing of specific and past chat memories, these conversations have become increasingly personalized. Suddenly, there is a lot of personal data that you rely on it having, that make the product better. You cannot just plop over to Gemini and replicate this.

[0] https://www.nber.org/system/files/working_papers/w34255/w342...

jimmydoe

3 months ago

> This means OAI won't need ads. Just rev share.

They obviously want both. In fact they are already building an ad team.

They have money they have to burn, so it makes sense to throw all the scalable business models in the history, eg app store, algo feed, etc, to the wall and see what stick.

therealdrag0

3 months ago

Don’t they already have ads? I think I’ve seen sponsored results when asking for product recommendations. Maybe misremembering tho.

fny

3 months ago

3 replies

It’s remarkable that will inevitably rush to build free apps that only reinforce OpenAI’s moat while cannibilizing their own opportunities.

tantalor

3 months ago

2 replies

When the iPhone came out, there were like 6 apps, and no app store.

In 2024, iOS App Store generated $1.3T in revenue, 85% of which went to developers.

hmate9

3 months ago

3 replies

That figure sounds way too high

Edit: yes I understand it is correct, but still it sounds like an insane amount

moralestapia

3 months ago

2 replies

It's true, though.

It is now evident why Flash was murdered.

tracker1

3 months ago

1 reply

Because it was buggy, known for security holes and the single biggest source of application crashes in all software in the late 90's through early 00's.

jjtheblunt

3 months ago

1 reply

you missed the "it drained battery like there was no tomorrow" argument.

tracker1

3 months ago

I never really used it detached from a wall... mostly from work projects.

JumpCrisscross

3 months ago

1 reply

> We now know why Flash was murdered

This is a stupid conspiracy given Apple decided not to support Flash on iPhone since before Jobs came around on third-party apps. (The iPhone was launched with a vision of Apple-only native apps and HTML5 web apps. The latter's performance forced Cupertino's hand into launching the App Store. Then they saw the golden goose.)

moralestapia

3 months ago

1 reply

You ignore the state of things back then.

HTML5 was new and not widely supported, the web was WAY more fragmented back then, to put things in perspective, Internet Explorer still had the largest market share, by far. The only thing that could provide the user with a rich interactive experience was Flash, it was also ubiquitous.

Flash was the biggest threat to Apple's App Store; this wasn't a conspiracy, it was evident back then but I can see why it is not evident to you in 2025. Jobs open letter was just a formal declaration of war.

JumpCrisscross

3 months ago

> HTML5 was new and not widely supported

Yes. It was a bad bet on the open web by Apple. But it was the one they took when they decided not to support Flash with the original iPhone's launch.

> Flash was the biggest threat to Apple's App Store

Flash was not supported since before there was an App Store. Since before Apple deigned to tolerate third-party native apps.

You can argue that following the App Store's launch, Apple's choice to not start supporting Flash was influenced by pecuinary interests. But it's ahistoric to suggest the reason for the original decision was based on interests Cupertino had ruled out at the time.

IncreasePosts

3 months ago

1 reply

They're confusing "sales facilitates by the app store" with sales from the app store itself.

That 1T figure is real, but it includes things like if you buy a refrigerator using the Amazon iOS app.

bangaladore

3 months ago

Yeah, the article itself even lists the reality at about 20% of the 1.3T.

mikestew

3 months ago

https://finance.yahoo.com/news/apples-app-store-generated-ne...

codybontecou

3 months ago

2 replies

Will this have a revenue share / marketplace built into it?

rco8786

3 months ago

Altman mentioned an App Store is coming

JumpCrisscross

3 months ago

> Will this have a revenue share / marketplace built into it?

I'm genuinely surprised these companies went with usage-based versus royalty pricing.

jjtheblunt

3 months ago

what's their moat that you refer to?

mrcwinn

3 months ago

This is nonsense. Why would they destroy the incentive to get real-time, live data and MCP actions that help their users?

Connecting these apps will, at times, require authentication. Where it does not require payment, it's a fantastic distribution channel.

MaxPock

3 months ago

3 replies

This is honestly useful.

"Find me hotels in Capetown that have a pool by the beach .Should cost between 200 dollars to 800 dollars a night "

pphysch

3 months ago

3 replies

[injected with guerilla ads]

I don't see how this is a significant upgrade over the many existing hotel-finder tools. At best it slightly augments them as a first pass, but I would still rather look at an actual map of options than trust a stream of generated, ad-augmented text.

AlBentley

3 months ago

exactly. Booking.com etc can just use OpenAI APIs to enable a similar voice/ chat interface on top of their search, and then the UX is not limited to 'cards'.

The UI 'cards' will naturally becoming ever increasing, and soon you end up back with a full app within ChatGPT or ChatGPT just becomes an app launcher.

The only advantage I can see is if ChatGPT can use data from other apps/ chats in your searches e.g. find me hotels in NYC for my upcoming trip (and it already knows the types of hotels you like, your budget and your dates)

b_e_n_t_o_n

3 months ago

I think the end game is that rather than spitting out text back, the LLM transforms your plaintext request to something processable, and then chooses some relevant widgets to display the results.

elpakal

3 months ago

The benefit I see is that it meets users where they presumable already are (GPT). As other comments allude to here, it's clear they see themselves as a staple of the user's online experience.

zzo38computer

3 months ago

I would not want to use LLMs for such a thing like that. Something like SQL queries or other kind of computer codes would be better. You would have to read the documentation, but it can be specified more precisely and more accurately. If you have a local program that can manage these queries (and then convert them to the remote service's format; a service could provide a file to specify the schema and the estimated cost of different fields) and interact with multiple services (including local files), then that will be better, without having to worry about problems with OpenAI, require as much power that OpenAI uses, more privacy violations than is necessary, etc.

However, it might be useful for people who do want to use that instead.

aryehof

3 months ago

I think the future is that models will not be able to answer that well, because sites will move to protect their data/content.

Instead, the model will provide you with a list of (in chat) “apps” that can fulfill your request. SEO becomes AISO (AI Search Optimization). Sites can partly expose data to entice you to choose them.

wiradikusuma

3 months ago

5 replies

In 2018, I founded a startup specializing in chatbot for events. At the time the platforms were Alexa Skills, Actions on Google, and Messenger Platform (and LINE Bot, for people in Asia). I guess what's old is new again, but with fancier tech.

This time will be different?

jerf

3 months ago

1 reply

We've actually got systems that can understand English now. Chatbots don't have to be glorified regular expression matches or based on inferior NLP. I've thought more than once that the true value of LLMs could well be that they essentially solve the language comprehension problem and that their ability to consume language is relatively underutilized compared to our attempts to get them to produce language. Under all the generative bling their language comprehension and ability to package that into something that conventional computing can understand is pretty impressive. They've even got a certain amount of common sense built in.

b_e_n_t_o_n

3 months ago

Yeah this seems accurate to me. All the talk of a bubble etc, but LLMs see genuinely useful at tasks like this and I'm sure we'll find more uses as time goes on.

Traubenfuchs

3 months ago

1 reply

Do people even want chatbots for events?

I personally prefer well curated information.

cruffle_duffle

3 months ago

"I personally prefer well curated information."

The LLM will do the curation.

nsonha

3 months ago

Sure absolutely NO difference this time. Say it 100 times and maybe reality will change.

apt-apt-apt-apt

3 months ago

Chatbots with and without GPT is like comparing a car with round vs triangular wheels

rco8786

3 months ago

You can’t think of anything that’s changed in the Chatbot space since 2018?

fidotron

3 months ago

4 replies

This conception makes sense iff you believe in ChatGPT as the universal user interface of the future. If anything the agentic wave is showing that the chat interfaces are better off hidden behind stricter user interface paradigms.

nextworddev

3 months ago

1 reply

The apps can send any arbitrary HTML / interface back though.

e.g. Coursera can send back a video player

foobarian

3 months ago

1 reply

This will be a bunch of rushed garbage. It will be like Java applets

nextworddev

3 months ago

Maybe, but don't forget they are godly at iteration.

asim

3 months ago

1 reply

It's not just as ChatGPT as the interface. It's that Chat with AI will now be the universal interface and every tech company will have their version of it. Everything you want to do will happen in one place. Cards will provide predefined and interactive experience. Over time you'll see entirely dynamic content get generated on the fly. The user experience is going to be one where we've shrunk websites to apps and apps to cards or widgets. Effectively any action you need to take can be done like this and then agents can operate more complex workflow in the background. This is probably the interface for the next 10 years and what replaces the mobile app experience and stronghold that Apple or Google have. This lasts until fully immersive AR/VR become a more mainstream thing. At that point these cards are on a heads up display but we'll be looking at something totally different. Like agents roaming the earth...

JumpCrisscross

3 months ago

3 replies

This has been the pitched playbook for decades. (Metamates!) I'm increasingly convinced its driven by a specific generation of tech entrepreneurs who cut their teeth while reading ca. 1980s science fiction.

I could see chat apps becoming dominant in Slack-oriented workplaces. But, like, chatting with an AI to play a song is objectively worse than using Spotify. Dynamically-created music sounds nice until one considers the social context in which non-filler music is heard.

neutronicus

3 months ago

2 replies

Chatting with an AI to play a song whose title you know, sure.

Getting an AI to play "that song that goes hmm hmmm hmmm hmmm ... uh, it was in some commercials when I was a kid" tho

fragmede

3 months ago

1 reply

more abstract than that, "I'm throwing a wedding/funeral/startup IPO/Halloween/birthday party for a whatever year old and need appropriate music". Or, without knowing specific bands, "I want to hear some 80's metal music". "more cowbell!"

array_key_first

3 months ago

1 reply

You don't need AI for this, Spotify has, like, infinite playlists.

Also their playlists are made by real people (mostly...), so they don't completely suck ass.

fragmede

3 months ago

1 reply

Playlists aren't interactive though. I can't say "like this but with less guitar".

Also, following the Beatport top 100 tech house playlist, and hearing how many tracks aren't actually tech house makes me wonder about who makes that particular playlist.

array_key_first

3 months ago

1 reply

I don't know, I don't buy that this is a use case that matters enough to sway anyone.

That's how I feel about a lot of AI stuff.

Like... It's neat. It's a fun novelty. It makes a good party trick. It's the software equivalent of a knick knack.

Like 90% of the pixel AI features. There's some good ones in there, sure, but most of them you play around with for a day and then forget exist.

fragmede

3 months ago

1 reply

Okay so you're at the party, and you do a cool party trick, and then that cute stranger you've been eyeing all night finally comes over to talk to you. Why's it need to be more than that?

array_key_first

3 months ago

Because we're pouring trillions of dollars into that.

This isn't me making a cute little website in my free time. This is thousands of developers, super computers out the wazoo, and a huge chunk of the western economy.

Like, a snowglobe is cute. They don't do much, but they're cute. I'd buy one for ten dollars.

I would not buy a snowglobe for 10 million dollars.

JumpCrisscross

3 months ago

> Getting an AI to play "that song that goes hmm hmmm hmmm hmmm ... uh, it was in some commercials when I was a kid" tho

Absolutely. The point is this is a specialised and occasional use case. You don't want to have to go through a chat bot every time you want to play a particular song just because sometimes you might hum at it.

The closest we've come to a widely-adopted AR interface are AirPods. Critically, however, they work by mimicing how someone would speak to a real human by them.

fidotron

3 months ago

The thing it reminds me of is those old Silicon Graphics greybeards that were smug about how they were creating tools for people that created wealth when those other system providers "just" created tools for people tracking wealth.

There's a whole bizarre subculture in computing that fails to recognize what it is about computers that people actually find valuable.

echelon

3 months ago

It's because Zuck can't own a pane of glass. He's locked out of the smartphone duopoly.

Everyone wants the next device category. They covet it. Every other company tries to will it into existence.

cube2222

3 months ago

2 replies

Is it? Honestly, most agents and/or ai apps I interact with that are actually useful present some form of chat-like interface.

I’m not very bullish on people wanting to live in the ChatGPT UI, specifically, but the concept of dynamic apps embedded into a chat-experience I think is a reasonable direction.

I’m mostly curious about if and when we get an open standard for this, similar to MCP.

neutronicus

3 months ago

Yes, I certainly prefer "chatting with Claude Code" to "Copilot taking forever to hallucinate all over my IDE, displacing the much-more-useful previous-generation semantic autocomplete."

The former is like a Waymo, the latter is like my car suddenly and autonomously deciding that now is a good time to turn into a Dollar Tree to get a COVID vaccine when I'm on my way to drop my kid off at a playdate.

fidotron

3 months ago

The whole value of an actual executive assistant is them solving problems and you not micromanaging them.

What users want, which various entities religiously avoid providing to us, is a fair price comparison and discovery mechanism for essentially everything. A huge part of the value of LLMs to date is in bypassing much of the obfuscation that exists to perpetuate this, and that's completely counteracted by much of what they're demonstrating here.

derekcheng08

3 months ago

1 reply

I suspect there are many, many things for which chat is a great interface. And by positioning ChatGPT as the distributor for all these things, they get to be the new Google. But you're also right that many domains for which a purpose-built interface is the right approach, and if the domain is valuable enough, it'll have someone coming after it to build that.

munk-a

3 months ago

1 reply

I have yet to see a chat agent deployed that is more popular than tailored browsing methods. The most charitable way to explain this is that the tailored browsing methods already in place are the results of years of careful design and battle testing and that the chat agent is providing most of the value that a tailored browsing method would but without any of the investment required to bring a traditional UX to fruition - that may be the case and if it is then allowing them the same time to be refined and improved would be fair. I am skeptical of that being the only difference though, I think that chatbots are a way to, essentially, outsource the difficult work of locating data within a corpus onto the user and that users will always have a disadvantage compared to the (hopefully) subject matter experts building the system.

So perhaps chatbots are an excellent method for building out a prototype in a new field while you collect usage statistics to build a more refined UX - but it is bizarre that so many businesses seem to be discarding battle tested UXes for chatbots.

peab

3 months ago

4 replies

agree.

Thing is, for those who paid attention to the last chatBot hype cycle, we already knew this. Look at how Google Assistant was portrayed back in 2016. People thought you'd be buying starbucks via the chat. Turns out the starbucks app has a better UX

ryandrake

3 months ago

3 replies

Yea, I don't want to sit there at my computer, which can handle lots of different input methods, like keyboard, mouse, clicking, dragging, or my phone which can handle gestures, pinching, swiping... and try to articulate what I need it to do in English language conversation. This is actually a step backwards in human-computer interaction. To use an extreme example: imagine instead of a knob on my stereo for volume, I had a chat box where I had to type in "Volume up to 35". Most other "chatbot solved" HCI problems are just like this volume control example, but less extreme.

chongli

3 months ago

2 replies

It's funny, because the chat bot designers seem to be continually attempting to recreate the voice computer interface from Star Trek: TNG. Yet if you watch the show carefully, the vast majority of the work done by all the Enterprise crew is done via touchscreens, not voice.

The only reason for the voice interface is to facilitate the production of a TV show. By having the characters speak their requests aloud to the computer as voice commands, the show bypasses all the issues of building visual effects for computer screens and making those visuals easy to interpret for the audience, regardless of their computing background. However, whenever the show wants to demonstrate a character with a high level of computer mastery, the demonstration is almost always via the touchscreen (this is most often seen with Data), not the voice interface.

TNG had issues like this figured out years ago, yet people continue to fall into the same trap because they repeatedly fail to learn the lessons the show had to teach.

fishpen0

3 months ago

3 replies

It's actually hilarious to think of a scene where all the people on the bridge are shouting over each other trying to get the ship to do anything at all.

Maybe this is how we all get our own offices again and the open floor plan dies.

NBJack

3 months ago

Hmm. Maybe something useful will come of this after all!

"...and that is why we need the resources. Newline, end document. Hey, guys, I just got done with my 60 page report, and need-"

"SELECT ALL, DELETE, SAVE DOCUMENT, FLUSH UNDO, PURGE VERSION HISTORY, CLOSE WINDOW."

Here's hoping this at least gets us back to cubes.

throwup238

3 months ago

They’d just have an array of microphones everywhere and isolate each voice - rooms only need n+1 microphones where n is the maximum number of people. That’s already simple to do today, and it’s not even that expensive.

fragmede

3 months ago

Getting our own offices would simply take collective action, and we're far too smart to join a union, err, software developers association to do that.

freediver

3 months ago

Profound observation, thank you for this.

zdragnar

3 months ago

Remember Alexa? Amazon kept wanting people to buy things with their voice via assorted echo devices, but it turns out people really want to actually be in charge of what their computers are doing, rather than talking out loud and hoping for the best.

BolexNOLA

3 months ago

“volume up to 35”

>changes bass to +4 because the unit doesn't do half increments

“No volume up to 35, do not touch the EQ”

>adjusts volume to 4 because the unit doesn’t do half increments

> I reach over, grab my remote, and do it myself

We have a grandparent that really depends on their Alexa and let me tell you repeatedly going “hey Alexa, volume down. Hey Alexa, volume down. Hey Alexa, volume down,” gets really old lol we just walk over and start using the touch interface

jwpapi

3 months ago

Omg thank you guys. It felt so obvious to me but nobody talked about it.

A UX is better and another app or website feels like the exact separation needed.

Booking flights => browser => skyscanner => destination typing => evaluation options with ai suggestions on top and UX to fine-tune if I have out of the ordinary wishes (don’t want to get up so early)

I can’t imagine a human or an AI be better than is this specialized UX.

JambalayaJimbo

3 months ago

Current LLMs are way better at understanding language than the old voice assistants.

potatolicious

3 months ago

It's also a matter of incentives. Starbucks wants you in their app instead of as a widget in somebody else's - it lets them tell you about new products, cross-sell/up-sell, create habits, etc.

This general concept (embedding third parties as widgets in a larger product) has been tried many times before. Google themselves have done this - by my count - at least three separate times (Search, Maps, and Assistant).

None have been successful in large part because the third party being integrated benefits only marginally from such an integration. The amount of additional traffic these integrations drive generally isn't seen as being worth the loss of UX control and the intermediation in the customer relationship.

markab21

3 months ago

The skepticism is understandable given the trajectory of GPTs and custom instructions, but there's a meaningful technical difference here: the Apps SDK is built on the Model Context Protocol (MCP), which is an open specification rather than a proprietary format.

MCP standardizes how LLM clients connect to external tools—defining wire formats, authentication flows, and metadata schemas. This means apps you build aren't inherently ChatGPT-specific; they're MCP servers that could work with any MCP-compatible client. The protocol is transport-agnostic and self-describing, with official Python and TypeScript SDKs already available.

That said, the "build our platform" criticism isn't entirely off base. While the protocol is open, practical adoption still depends heavily on ChatGPT's distribution and whether other LLM providers actually implement MCP clients. The real test will be whether this becomes a genuine cross-platform standard or just another way to contribute to OpenAI's ecosystem.

The technical primitives (tool discovery, structured content return, embedded UI resources) are solid and address real integration problems. Whether it succeeds likely depends more on ecosystem dynamics than technical merit.

irrationalfab

3 months ago

This feels like the death of the app, and the rise of the micro-app.

testfrequency

3 months ago

Wow.

“CEO” Fidji Simo must really need something to do.

Maybe I’m cynical about all of this, but it feels like a whole lot of marketing spin for an MCP standard.

ttoinou

3 months ago

That’s a great idea and Im wondering if Telegram can follow this path too, since they’re so advanced in mobile UX / UI, constantly updating their app and have some kind of crypto payments support.

compacct27

3 months ago

“Build our platform for us!”

danjl

3 months ago

If only this somehow resulted in fewer, better apps. <sigh>

spullara

3 months ago

We have been building MCP servers and this looks very good directionally. Fills a bunch of holes in the protocol and gives meaning to something that were kind of like placeholders. Being able to return UI to the client is fantastic and will make lots of things possible. We have been working on these kinds of things assuming that the clients would improve to meet us.

https://lukew.com/ff/entry.asp?2122

MaxPock

3 months ago

Tencent already has this with WeChat.Good to see it on chatgpt finally

Handy-Man

3 months ago

This is them trying to build ChatGPT into platform, from which they will take some portion of revenue generated by these apps...hmm where have I seen this before.

disiplus

3 months ago

Honestly I see how somebody like kayak.com would build a "app" they work through commission, they don't care from where is the booking coming from. But they will sort the flight tickets based where do they earn the best commission. What's in there for me as a user ?. Also will openai let different providers pay for the top placement when somebody tries to buy ticket on chatgpt ?

ttoinou

3 months ago

Does anyone think small players (like an independent developer) will be accepted ? Sounds like it will only for the big whales

222 more comments available on Hacker News

View full discussion on Hacker News

ID: 45494558Type: storyLast synced: 11/20/2025, 8:14:16 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN