Mistral | Not Hacker News!

Discussion (337 comments)

Showing 160 comments of 337

24 days ago

2 replies

I'm so glad Mistral never sold out. We're really lucky to have them in the EU at the time when we're so focused on mil-tech etc.

ismailmaj

24 days ago

1 reply

I don’t think it was ever an option since it had ties with the french government early on (Cédric O) and Macron’s party is quite pro EU

maelito

24 days ago

They let so many important French companies down. So, yes, it could happen despite this beginning.

poszlem

24 days ago

4 replies

They’ll switch to military tech the second it becomes necessary, don’t kid yourself. I’m just glad we have a European alternative for the day the US decides to turn its back on us.

embedding-shape

24 days ago

1 reply

> I’m just glad we have a European alternative for the day the US decides to turn its back on us

Not sure you've kept up to date, US have turned their backs on most allies so far including Europe and the EU, and now welcome previous enemies with open arms.

breedmesmn

24 days ago

Wow! BLUMPF has really done it this time! Excited to be part of the resistance!

maelito

24 days ago

> I’m just glad we have a European alternative for the day the US decides to turn its back on us.

They did.

hobofan

24 days ago

It's not like there aren't already military AI startups in the EU. e.g. Helsing.

programLyrique

24 days ago

They have already:

- https://helsing.ai/newsroom/helsing-and-mistral-announce-str... - https://sifted.eu/articles/mistral-helsing-defence-ai-action... - Luxembourg army chose Mistral: https://www.forcesoperations.com/la-pepite-francaise-mistral... - French army: https://www.defense.gouv.fr/actualites/ia-defense-sebastien-...

pluralmonad

24 days ago

5 replies

I'm sure I'm not the only one that thinks "Vibe CLI" sounds like an unserious tool. I use Claude Code a lot and little of it is what I would consider Vibe Coding.

klysm

24 days ago

2 replies

Using LLM's to write code is inherently best for unserious work.

dwaltrip

24 days ago

1 reply

These are the cutting insights I come to HN for.

neevans

24 days ago

1 reply

these are just old senior devs not wanting to accept new changes in the industry.

reyqn

24 days ago

These are the cutting insights I come to HN for.

freakynit

24 days ago

"Not reviewing generated code" is the problem. Not the LLM generated code.

jimmydoe

24 days ago

1 reply

Maybe they are just trying to be funny.

Eupolemos

24 days ago

1 reply

Their chat was called "Le Chat" - it's just their style.

And while it may miss the HN crowd, one of the main selling-points of AI coding is the ease and playfulness.

sofixa

23 days ago

1 reply

It's still called "Le Chat" (which means The Cat in French), hence the occasional pun with a cat icon in various places on their website.

Eupolemos

23 days ago

I didn't know about the cat!

Thanks :)

isodev

24 days ago

4 replies

If you’re letting Claude write code you’re vibe coding

andai

24 days ago

3 replies

So people have different definitions of the word, but originally Vibe Coding meant "don't even look at the code".

If you're actually making sure it's legit, it's not vibe coding anymore. It's just... Backseat Coding? ;)

There's a level below that I call Power Coding (like power armor) where you're using a very fast model interactively to make many very small edits. So you're still doing the conceptual work of programming, but outsourcing the plumbing (LLM handles details of syntax and stdlib).

brazukadev

24 days ago

2 replies

> If you're actually making sure it's legit, it's not vibe coding anymore.

sorry to disappoint you but that is also been considered vibecoding. It is just not pejorative.

theLiminator

24 days ago

Pretty sure Karpathy coined the term here: https://x.com/karpathy/status/1886192184808149383

Imo, if you read the code, it's no longer vibecoding.

andai

23 days ago

My meaning was that if we actually decide on definitions that make sense, for the specific things that different people are already doing, then there will be a lot less confusion on the matter!

isodev

24 days ago

1 reply

I know tech bros like to come up with fancy words to make trivial things sounds fancy but as long as it’s a slop out process, it’s vibe coding. If you’re fixing what a bot spits out, should be a different word … something painful that could’ve been avoided?

Also, we’re both “people in tech”, we know LLMs can’t conceptualise beyond finding the closest collection of tokens rhyming with your prompt/code. Doesn’t mean it’s good or even correct. So that’s why it’s vibe coding.

andai

23 days ago

Yeah, it's going on vibes. It's the Rick Rubin of programmers.

HarHarVeryFunny

24 days ago

Peer coding?

Maybe common usage is shifting, but Karpathy's "vibe coding" was definitely meant to be a never look at the code, just feel the AI vibes thing.

tomashubelbauer

24 days ago

1 reply

It sure doesn't feel like it given how closely I have to babysit Claude Code lest I don't recognize the code after Claude Code is done with it when left to its own devices for a minute.

giancarlostoro

24 days ago

It gets pretty close for me, but I usually tell it how I want it done from the get go.

NitpickLawyer

24 days ago

The original definition was very different. The main thing with vibe coding is that you don't care about the code. You don't even look at the code. You prompt, test that you got what you wanted, and move on. You can absolutely use cc to vibe code. But you can also use it to ... code based on prompts. Or specs. Or docs. Or whatever else. The difference is if you want / care to look at the code or not.

sunaookami

24 days ago

No, that's not the definition of "vibe coding". Vibe coding is letting the model do whatever without reviewing it and not understanding the architecture. This was the original definition and still is.

kilpikaarna

24 days ago

Agree, but that's just the term for any LLM-assisted development now.

Even the Gemini 3 announcement page had some bit like "best model for vibe coding".

tormeh

24 days ago

They're looking for free publicity. "This French company launched a tool that lets you 'vibe' an application into being. Programmers outraged!"

embedding-shape

24 days ago

9 replies

Look interesting, eager to play around with it! Devstral was a neat model when it released and one of the better ones to run locally for agentic coding. Nowadays I mostly use GPT-OSS-120b for this, so gonna be interesting to see if Devstral 2 can replace it.

I'm a bit saddened by the name of the CLI tool, which to me implies the intended usage. "Vibe-coding" is a fun exercise to realize where models go wrong, but for professional work where you need tight control over the quality, you can obviously not vibe your way to excellency, hard reviews are required, so not "vibe coding" which is all about unreviewed code and just going with whatever the LLM outputs.

But regardless of that, it seems like everyone and their mother is aiming to fuel the vibe coding frenzy. But where are the professional tools, meant to be used for people who don't want to do vibe-coding, but be heavily assisted by LLMs? Something that is meant to augment the human intellect, not replace it? All the agents seem to focus on off-handing work to vibe-coding agents, while what I want is something even tighter integrated with my tools so I can continue delivering high quality code I know and control. Where are those tools? None of the existing coding agents apparently aim for this...

johanvts

24 days ago

1 reply

Did you try Aider?

embedding-shape

24 days ago

4 replies

I did, although a long time ago, so maybe I need to try it again. But it still seems to be stuck in a chat-like interface instead of something tailored to software development. Think IDE but better.

johanvts

24 days ago

1 reply

It has a new “watch files” mode where you can work interactively. You just code normally but can send commands to the llm via a special string. Its a great way if interacting with LLMs, if only they where much faster.

macNchz

24 days ago

If you're interested in much faster LLM coding, GLM 4.6 on Cerebras is pretty mind blowing. It's not quite as smart as the latest Claude and Gemini, but it generates code so fast it's kind of comical if you're used to the other models. Good with Aider since you can keep it on a tighter leash than with a fully agentic tool.

zmmmmm

24 days ago

2 replies

I think Aider is closest to what you want.

The chat interface is optimal to me because you often are asking questions and seeking guidance or proposals as you are making actual code changes. On reason I do like it is that its default mode of operation is to make a commit for each change it makes. So it is extremely clear what the AI did vs what you did vs what is a hodge podge of both.

As others have mentioned, you can integrate with your IDE through the watch mode. It's somewhat crude but still useful way. But I find myself more often than not just running Aider in a terminal under the code editor window and chatting with it about what's in the window.

embedding-shape

24 days ago

1 reply

> I think Aider is closest to what you want.

> The chat interface

Seems very much not, if it's still a chat interface :) Figuring out a chat UX is easy compared to something that was creating with letting LLM fill in some parts from the beginning. I guess I'm searching for something with a different paradigm than just "chat + $Something".

zmmmmm

24 days ago

1 reply

the question is, how do you want to provide instructions for what the AI is to do? You might not like calling it "chat" but somehow you need to communicate that, right? With aider you can write a comment for a function and then instruct it to finish the function inline (see other comments). But unless you just want pure autocomplete based on it guessing things, you need to provide guidance to it somehow.

embedding-shape

24 days ago

3 replies

I don't know exactly, but I guess in a more declarative manner rather than anything. Maybe we set goals/milestones/concrete objectives, or similar, rather than imperatively steer it, give it space to experiment yet make it very easy to understand exactly what important tradeoffs everything is doing.

It's all very fluffy and theoretical of course.

zmmmmm

24 days ago

1 reply

I find a good compromise on that front is not to use the chat primarily, but to create files like 'ARCHITECTURE.md', 'REQUIREMENTS.md' and put information in there describing how the application works. Then you add those to the chat as context docs.From the chat interface then you are just referring to those not just describing features willy nilly. So the nice thing is you are building documentation for the application in a formal sense as part of instructing the LLM.

embedding-shape

24 days ago

But that is the typical agentic LLM coder style program I was initially referring to, saying we maybe should explore other alternatives to. It's too basic and primitive, with some imagination.

xmcqdpt2

24 days ago

1 reply

I think the problem is that models are just not that good yet. At least for my usage at work, the CLI tools are the fastest way to get something useful, but if you can't describe basically exactly what you want, you get garbage.

embedding-shape

24 days ago

They are good enough, but people aren't exploring other UIs enough. The TUI tools (which I think you're referring to, Codex, Claude Code et al) are a good start, but they feel like a prototype compared to a completely different UI. You'd still describe what you want, but not imperative in a chat window, but some other manner.

mhast

24 days ago

The typical "best practice" for these tools tend to be to ask it something like

"I want you to do feature X. Analyse the code for me and make suggestions how to implement this feature."

Then it will go off and work for a while and typically come back after a bit with some suggestions. Then iterate on those if needed and end with.

"Ok. Now take these decided upon ideas and create a plan for how to implement. And create new tests where appropriate."

Then it will go off and come back with a plan for what to do. And then you send it off with.

"Ok, start implementing."

So sure. You probably can work on this to make it easier to use than with a CLI chat. It would likely be less like an IDE and more like a planning tool you'd use with human colleagues though.

troyvit

24 days ago

1 reply

Aider can be a chat interface and it's great for that but you can also use it from your editor by telling it to watch your files.[1]

So you'd write a function name and then tell it to flesh it out.

  function factorial(n) // Implement this. AI!

Becomes:

  function factorial(n) {
    if (n === 0 || n === 1) {
      return 1;
    } else {
      return n \* factorial(n - 1);
    }
  }

Last I looked Aider's maintainer has had to focus on other things recently, but aider-ce is a fantastic fork.

I'm really curious to try Mistral's vibe, but even though I'm a big fanboi I don't want to be tied to just one model. Aider lets tier your models such that your big, expensive model can do all the thinking and then stuff like code reviews can run through a smaller model. It's a pretty capable tool

Edit: Fix formatting

[1] https://aider.chat/docs/usage/watch.html

zmmmmm

24 days ago

> I don't want to be tied to just one model.

Very much this for me - I really don't get why, given a new models are popping out every month from different providers, people are so happy to sink themselves into provider ecosystems when there are open source alternatives that work with any model.

The main problem with Aider is it isn't agentic enough for a lot of people but to me that's a benefit.

reachtarunhere

24 days ago

If your goal is to edit code and not discuss it aider also supports a watch mode. You can keep adding comments about what you want it to do in a minimal format and it will make changes to the files and you can diff/revert them.

vidarh

24 days ago

When I think "IDE but better", a Claude Code-like interface is increasingly what I want.

If you babysit every interaction, rather than reviewing a completed unit of work of some size, you're wasting your time second-guessing that the model won't "recover" from stupid mistakes. Sometimes that's right, but more often than not it corrects itself faster than you can.

And so it's far more effective to interact with it far more async, where the UI is more for figuring out what it did if something doesn't seem right, than for working live. I have Claude writing a game engine in another window right now, while writing this, and I have no interest in reviewing every little change, because I know the finished change will look nothing like the initial draft (it did just start the demo game right now, though, and it's getting there). So I review no smaller units of change than 30m-1h, often it will be hours, sometimes days, between each time I review the output, when working on something well specified.

williamstein

24 days ago

2 replies

Their new CLI agent tool [1] is written in Python unlike similar agents from Anthropic/Google (Typescript/Bun) and OpenAI (Rust). It also appears to have first class ACP support, where ACP is the new protocol from Zed [2].

[1] https://github.com/mistralai/mistral-vibe

[2] https://zed.dev/acp

esafak

24 days ago

1 reply

I did not know A2A had a competitor :(

4b11b4

24 days ago

They're different use cases, ACP is for clients (UIs, interfaces)

embedding-shape

24 days ago

1 reply

> Their new CLI agent tool [1] is written in

This is exactly the CLI I'm referring to, whose name implies it's for playing around with "vibe-coding", instead of helping professional developers produce high quality code. It's the opposite of what I and many others are looking for.

chrsw

24 days ago

I think that's just the name they picked. I don't mind it. Taking a glance at what it actually does, it just looks like another command line coding assistant/agent similar to Opencode and friends. You can use it for whatever you want not just "vibe coding", including high quality, serious, professional development. You just have to know what you're doing.

pdntspa

24 days ago

1 reply

> But where are the professional tools, meant to be used for people who don't want to do vibe-coding, but be heavily assisted by LLMs? Something that is meant to augment the human intellect, not replace it?

Claude Code not good enough for ya?

embedding-shape

24 days ago

6 replies

Claude Code has absolutely zero features that help me review code or do anything else than vibe-coding and accept changes as they come in. We need diff-comparisons between different executions, tailored TUI for that kind of work and more. Claude Code is basically a MVP of that.

Still, I do use Claude Code and Codex daily as there is nothing better out there currently. But they still feel tailored towards vibe-coding instead of professional development.

vidarh

24 days ago

3 replies

I really do not want those things in Claude COde - I much prefer choosing my own diff tools etc. and running them in a separate terminal. If they start stuffing too much into the TUI they'd ruin it - if you want all that stuff built in, they have the VS Code integration.

Havoc

24 days ago

2 replies

Mind elaborating a bit on the diff tool / flow you’re using? Trying to follow along better with what CC is doing

vidarh

23 days ago

1 reply

I don't want/use anything fance - I just use git diff in a separate terminal. I don't care about the individual changes Claude is making during a unit of work. I'll review a final change. Sometimes not even that - if the tests pass I may way until it's committed a bunch of changes, and review them as a whole.

Trying to follow along better is exactly the opposite of what I'd advocate - it's a waste of time especially with Claude, as Claude tends to favour trying lots of things, seeing what works, and revising its approach multiple times for complex tasks. If you follow along every step, you'll be tearing your hair out over stupid choices that it'll undo within seconds if you just let it work.

Havoc

23 days ago

That makes sense. Thanks for explaining

jbs789

24 days ago

Claude code run in a VS Code terminal window pops up a diff in VSCode before making changes. Not sure if that helps. I do have the Claude Code extension installed too.

ido

23 days ago

1 reply

using claude code via the VS Code plugin gives you side by side diffs as it works.

vidarh

23 days ago

I very specifically do not want to run it in an IDE. I'm perfectly happy with it in the terminal, running diffs separately, and very specifically NOT as it is working.

embedding-shape

24 days ago

Me neither, hence the stated preference for something completely new and different, a stab in the different direction instead of the same boring iteration on yet another agentic TUI coder.

victorbjorklund

24 days ago

1 reply

What’s wrong with using GIT for reviewing the changes?

embedding-shape

24 days ago

2 replies

Are any of them integrated with git? AFAIK, you'd have to instruct them to use git for you if you don't want to do it manually.

Imagine a GUI built around git branches + agents working in those branches + tooling to manage the orchestration and small review points, rather than "here's a chat and tool calling, glhf".

KronisLV

24 days ago

1 reply

> Are any of them integrated with git?

All of the models that can do tool calls are typically good enough to use Git.

Just this week I used both Claude Code and Codex to look at unstaged/staged changes and to review them multiple times, even do comparison between a feature branch and the main branch to identify why a particular feature might have broken in the feature branch.

embedding-shape

23 days ago

> All of the models that can do tool calls are typically good enough to use Git.

But again, it's the "user message > llm reason > llm tool call > tool response > llm reason > llm response" flow I think is inefficient and not good enough. It's a lazy solution built on top of the chat flow.

What I imagined would exist by now would be something smarter, where you don't say "Ok, now please commit this" or whatever.

I already have a tool for myself that launch Codex, Claude Code, Qwen Code(r?) and Gemini for each change I do, and automatically manage them into git branches, and lets me diff between what they do and so on.

Yet I still think we haven't really figured out a good UX for this.

zer0tonin

24 days ago

Aider is integrated with git

johnfn

24 days ago

> Claude Code has absolutely zero features that help me review code

Err, doesn’t it have /review?

rootnod3

22 days ago

Might be a weird suggestion but here we go: - use whatever diff tool you used before LLMs came around and actually review the code? Just a suggestion. If people claim they always examine the full output at the end before they commit it, then why not fully review it using the tools used before the dawn of LLMs?

pdntspa

24 days ago

IntelliJ's AI service as a PR summarizer that I have found very helpful

pdntspa

22 days ago

I just ask it to do a code review. It spits out a perfectly cromulent critique. Oftentimes it highlights stuff I would have missed.

chrsw

24 days ago

3 replies

> run locally for agentic coding. Nowadays I mostly use GPT-OSS-120b for this

What kind of hardware do you have to be able to run a performant GPT-OSS-120b locally?

embedding-shape

24 days ago

1 reply

RTX Pro 6000, ends up taking ~66GB when running the MXFP4 native quant with llama-server/llama.cpp and max context, as an example. Guess you could do it with two 5090s with slightly less context, or different software aimed at memory usage efficiency.

kristianp

24 days ago

That has 96GB GDDR7 ECC, to save people looking it up.

fgonzag

24 days ago

The model is 64GB (int4 native), add 20GB or so for context.

There are many platforms out there that can run it decently.

AMD strix halo, Mac platforms. Two (or three without extra ram) of the new AMD AI Pro R9700 (32GB of RAM, $1200), multi consumer gpu setups, etc.

FuckButtons

24 days ago

Mbp 128gb.

hadlock

24 days ago

2 replies

>vibe-coding

A surprising amount of programming is building cardboard services or apps that only need to last six months to a year and then thrown away when temporary business needs change. Execs are constantly clamoring for semi-persistent dashboards and ETL visualized data that lasts just long enough to rein in the problem and move on to the next fire. Agentic coding is good enough for cardboard services that collapse when they get wet. I wouldn't build an industrial data lake service with it, but you can certainly build cardboard consumers of the data lake.

3vidence

23 days ago

There is a phrase I've heard a number of times in my career that I find relevant here.

"There is nothing more permanent than a temporary demo"

bigiain

24 days ago

You are right.

But there is nothing more permanent that a quickly hacked together prototype or personal productivity hack that works. There are so many Python (or Perl or Visual Basic) scripts or Excel spreadsheets - created by people who have never been "developers" - which solve in-the-trenches pain points and become indispensable in exactly the way _that_ xkcd shows.

true2octave

24 days ago

2 replies

High quality code is a thing from the past

What matters is high quality specifications including test cases

embedding-shape

24 days ago

1 reply

> High quality code is a thing from the past

Says the person who will find themselves unable to change the software even in the slightest way without having to large refactors across everything at the same time.

High quality code matters more than ever, would be my argument. The second you let the LLM sneak in some quick hack/patch instead of correctly solving the problem, is the second you invite it to continue doing that always.

bigiain

24 days ago

I dunno...

I have a feeling this will only supercharge the long established industry practice of new devs or engineering leadership getting recruited and immediately criticising the entire existing tech stack, and pushing for (and often succeeding) a ground up rewrite in language/framework de jour. This is hilariously common in web work, particularly front end web work. I suspect there are industry sectors that're well protected from this, I doubt people writing firmware for fuel injection and engine management systems suffer too much from this, the Javascript/Nodejs/NPM scourge _probably_ hasn't hit the PowerPC or 68K embedded device programming workflow. Yet...

bigiain

24 days ago

"high quality specifications" have _always_ been a thing that matters.

In my mind, it's somewhat orthogonal to code quality.

Waterfall has always been about "high quality specifications" written by people who never see any code, much less write it. Agile make specs and code quality somewhat related, but in at least some ways probably drives lower quality code in the pursuit of meeting sprint deadlines and producing testable artefacts at the expense of thoroughness/correctness/quality.

andai

24 days ago

I created a very unprofessional tool, which apparently does what you want!

While True:

0. Context injected automatically. (My repos are small.)

1. I describe a change.

2. LLM proposes a code edit. (Can edit multiple files simultaneously. Only one LLM call required :)

3. I accept/reject the edit.

htrp

23 days ago

what's wrong with the current ide tools?

jbellis

24 days ago

> where are the professional tools, meant to be used for people who don't want to do vibe-coding, but be heavily assisted by LLMs?

This is what we're building at Brokk: https://brokk.ai/

Quick intro: https://blog.brokk.ai/introducing-lutz-mode/

kevin061

24 days ago

2 replies

I am very disappointed they don't have an equivalent subscription for coding to the 200 EUR ChatGPT or Claude one, and it is only available for Enterprise deployments.

The only thing I found is a pay-as-you-go API, but I wonder if it is any good (and cost-effective) vs Claude et al.

pzo

24 days ago

1 reply

> Devstral 2 is currently offered free via our API. After the free period, the API pricing will be $0.40/$2.00 per million tokens (input/output) for Devstral 2

With pricing so low I don't see any reason why someone would buy sub for 200 EUR. These days those subs are so much limited in Claude Code or Cursor than it used to be (or used to unlimited). Better pay-as-you-go especially when there are days when you probably use AI less or not at all (weekends/holidays etc.) as long as those credits don't expire.

kevin061

24 days ago

True, I just wish I could pay once for code AND the chat, but the chat subscription does not include Code sadly.

esafak

24 days ago

At these rates you can afford to pay by the token.

pzmarzly

24 days ago

3 replies

10x cheaper price per token than Claude, am I reading it right?

As long as it doesn't mean 10x worse performance, that's a good selling point.

phildougherty

24 days ago

1 reply

Even if it is 10x cheaper and 2x worse it's going to eat up even more tokens spinning its wheels trying to implement things or squash bugs and you may end up spending more because of that. Or at least spending way more of your time.

amarcheschi

24 days ago

The benchmark of swe places it in a comparable score with respect to open models and just a few points below the top notch models though

fastball

24 days ago

1 reply

[delayed]

gunalx

24 days ago

1 reply

I dunno. Even pretty weak models can be decently performant, and 9/10 the performance for 1/10 the price means 10x the output, and for a lot of stuff that quality difference dosent really matter. Considering even sota models are trash, slightly worse dosent really make that much difference.

fastball

24 days ago

1 reply

[delayed]

gunalx

24 days ago

Fair. Mostly the argument is, if all you need is to iterate on output to refine it, you get 10x the iterations, while lesser quality, its still a aspect to consider. But yes, why bother eine coding when they do make so many mistakes.

Macha

24 days ago

Something like GPT 5-mini is a lot cheaper than even Haiku but when I tried it in my experience it was so bad it was a waste of time. But it’s probably still more than 1/10 the performance of Haiku probably?

In work, where my employer pays for it, Haiku tends to be the workhorse with Sonnet or Opus when I see it flailing. On my own budget I’m a lot more cost conscious, so Haiku actually ends up being “the fancy model” and minimax m2 the “dumb model”.

badsectoracula

24 days ago

4 replies

> Devstral 2 ships under a modified MIT license, while Devstral Small 2 uses Apache 2.0. Both are open-source and permissively licensed to accelerate distributed intelligence.

Uh, the "Modified MIT license" here[0] for Devstral 2 doesn't look particularly permissively licensed (or open-source):

> 2. You are not authorized to exercise any rights under this license if the global consolidated monthly revenue of your company (or that of your employer) exceeds $20 million (or its equivalent in another currency) for the preceding month. This restriction in (b) applies to the Model and any derivatives, modifications, or combined works based on it, whether provided by Mistral AI or by a third party. You may contact Mistral AI (sales@mistral.ai) to request a commercial license, which Mistral AI may grant you at its sole discretion, or choose to use the Model on Mistral AI's hosted services available at https://mistral.ai/.

[0] https://huggingface.co/mistralai/Devstral-2-123B-Instruct-25...

simonw

24 days ago

1 reply

Mistral have used janky licenses in that a few times in the past. I was hoping the competition from China might have snapped them out of it.

jrm4

24 days ago

All "Open Source" licenses are to an extent, janky. Obligatory "Stallman was right;" -- If it's not GPL/Free Software, YMMV.

mkmk3

24 days ago

2 replies

Earnestly, what's the concern here? People complain about open source being mostly beneficial to megacorps, if that's the main change (idk I haven't looked too closely) then that's pretty good, no?

JimDabell

24 days ago

1 reply

They are claiming something is open-source when it isn’t. Regardless of whether you think the deviation from open-source is a good thing or not, you should still be in favour of honesty.

fastball

24 days ago

3 replies

*according to your definition of open-source

JimDabell

24 days ago

1 reply

No, according to the commonly accepted definition of open-source.

Whenever anybody tries to claim that a non-commercial licenses is open-source, it always gets complaints that it is not open-source. This particular word hasn’t been watered down by misuse like so many others.

There is no commonly-accepted definition of open-source that allows commercial restrictions. You do not get to make up your own meaning for words that differs from how other people use it. Open-source does not have commercial restrictions by definition.

fastball

24 days ago

1 reply

[delayed]

whimblepop

24 days ago

1 reply

"Open-source" isn't a term that emerged organically from conversations between people. It is a term that was very deliberately coined for a specific purpose, defined into existence by an authority. It's a term of art, and its exact definition is available here: https://opensource.org/osd

The term "open-source" exists for the purposes of a particular movement. If you are "for" the misuse and abuse of the term, you not only aren't part of that movement, but you are ignorant about it and fail to understand it— which means you frankly have no place speaking about the meanings of its terminology.

fastball

24 days ago

3 replies

[delayed]

JoshTriplett

24 days ago

1 reply

> people will continue to use it how they see fit.

People can also say 2+2=5, and they're wrong. And people will continue to call them out on it. And we will keep doing so, because stopping lets people move the Overton window and try to get away with even more.

fastball

24 days ago

[delayed]

JimDabell

24 days ago

1 reply

> people will continue to use it how they see fit.

And whenever they do so, this pointless argument will happen. Again, and again, and again. Because that’s not what the word means and your desired redefinition has been consistently and continuously rejected over and over again for decades.

What do you gain from misusing this term? The only thing it does is make you look dishonest and start arguments.

fastball

23 days ago

[delayed]

whimblepop

18 days ago

There's no authority that will punish you for misusing legal terms of art, or engineering terms of art— in everyday speech like this discussion— either. The vibe this gives is frankly "I just learned trademark exists and I think I'm very smart now".

udev4096

24 days ago

1 reply

"I don't know anything about open source licenses hence I must spread my ignorance everywhere"

fastball

24 days ago

1 reply

[delayed]

pxc

24 days ago

1 reply

[delayed]

fastball

24 days ago

[delayed]

JoshTriplett

24 days ago

*according to the industry standard definition of Open Source

This kind of thing is how people try to shift the Overton window. No.

badsectoracula

24 days ago

Mainly about the dilution of the term. Though TBH i do not think that open source is beneficial mostly to megacorps either.

Arcuru

24 days ago

2 replies

Personally I really like the normalization of these "Permissively" licensed models that only restrict companies with massive revenues from using them for free.

If you want to use something, and your company makes $240,000,000 in annual revenue, you should probably pay for it.

whimsicalism

24 days ago

3 replies

That's fine, but I don't think you should call it open source or call it MIT or even 'modified MIT.' Call it Mistral license or something along those lines

jrm4

24 days ago

2 replies

You're presently illustrating exactly why Stallman et al were such sticklers about "Free Software."

"Open Source" is nebulous. It reasonably works here, for better or worse.

whimsicalism

24 days ago

1 reply

Free software to me means GPL and associates, so if that is what Stallman was trying to be a stickler for - it worked.

Open source has a well understood meaning, including licenses like MIT and Apache - but not including MIT but only if you make less than $500million dollars, etc.

whimblepop

24 days ago

MIT and Apache are free software licenses in Stallman's sense, and the FSF has always been clear about it.

stonemetal12

24 days ago

>"Open Source" is nebulous

No it isn't it is well defined. The only people who find it "nebulous" are people who want the benefits without upholding the obligations.

https://opensource.org/definition-annotated

fastball

24 days ago

4 replies

[delayed]

whimsicalism

24 days ago

1 reply

> Open source means "I can see the source" to most of the world

well we don't really want to open that can of worms though, do we?

I don't agree with ceding technical terms to the rest of the world. I'm increasingly told we need to stop calling cancer detection AI "AI" or "ML" because it is not the 'bad AI' and confuses people.

I guess I'm okay with being intransigent.

fastball

24 days ago

[delayed]

embedding-shape

24 days ago

2 replies

> imo this is a hill people need to stop dying on.

As someone who was born and raised on FOSS, and still mostly employed to work on FOSS, I disagree.

Open source is what it is today because it's built by people with a spine who stand tall for their ideals even if it means less money, less industry recognition, lots of unglorious work and lots of other negatives.

It's not purist to believe that what built open source so far should remain open source, and not wanting to dilute that ecosystem with things that aren't open source, yet call themselves open source.

fastball

24 days ago

2 replies

You should stand up for your ideals, but dying on the hill of what you call your ideals is actually getting in the way of that.

Because instead of making the point "this license isn't as permissive as it could/should be" (easy to understand), instead the point being made is "this isn't real open source", which comes across to most people as just some weird gate-keeping, No True Scotsman kinda thing.

whimsicalism

24 days ago

1 reply

no, “No True Scotsman” is just about people not categories like open source

fastball

24 days ago

1 reply

Good job missing the point.

Though given the stance you are taking in this conversation, I'm not surprised you want to quibble over that.

¯\_(ツ)_/¯

whimsicalism

24 days ago

1 reply

ultimately you have to imbue words with meaning, otherwise it is impossible to have a discussion. what i said about no true scotsman was false, i was just trying to prove a point.

fastball

24 days ago

[delayed]

JoshTriplett

24 days ago

"No True Scotsman" is about specifically about changing the rules to exclude a new example you don't want to permit. The rules haven't changed, and the attempts to violate the requirements aren't new. Proprietary licenses continue to be proprietary. Open Source continues to not allow restrictions on commercial use.

kouteiheika

24 days ago

1 reply

> Open source is what it is today because it's built by people with a spine who stand tall for their ideals even if it means less money, less industry recognition, lots of unglorious work and lots of other negatives.

With all due respect, don't you see the irony in saying "people with a spine who stand tall for their ideals", and then arguing that attaching "restrictions" which only affect the richest megacorporations in the world somehow makes the license not permissive anymore?

What ideals are those exactly? So that megacorporations have the right to use the software without restrictions? And why should we care about that?

embedding-shape

24 days ago

> What ideals are those exactly?

Anyone can use the code for whatever purpose they want, in any way they want. I've never been a "rich megacorporation", but I have gone from having zero money to having enough money, and I still think the very same thing about the code I myself release as I did from the beginning, it should be free to be used by anyone, for any purpose.

jsnell

24 days ago

I don't think you get access to source in this case. The release is a binary blob.

JoshTriplett

24 days ago

And back in the day, people incorrectly called it "public domain". That was wrong too.

> if what you are talking about is the license, call it "open license".

If you want to build something proprietary, call it something else. "Open Source" is taken.

joseda-hg

24 days ago

1 reply

That's probably better, but Modified MIT is pretty descriptive, I read it as "mostly MIT, but with caveats for extreme cases" which is about right, if you already know what the MIT license entails

Whatever name they come up with for a new license will be less useful, because I'll have to figure out that this is what that is

tensor

23 days ago

To me this is highly misleading. The core idea behind MIT is that you can use it for anything, including commercial. A "you can use it for commercial only if you're revenue is small" is not remotely MIT-like. It's commercial with a freemium tier.

badsectoracula

24 days ago

1 reply

These are not permissively licensed though, the terms "permissive license" has connotations that pretty much everyone who is into FLOSS understands (same with "open source").

I do not mind having a license like that, my gripe is with using the terms "permissive" and "open source" like that because such use dilutes them. I cannot think of any reason to do that aside from trying to dilute the term (especially when some laws, like the EU AI Act, are less restrictive when it comes to open source AIs specifically).

kouteiheika

24 days ago

2 replies

> I do not mind having a license like that, my gripe is with using the terms "permissive" and "open source" like that because such use dilutes them. I cannot think of any reason to do that aside from trying to dilute the term (especially when some laws, like the EU AI Act, are less restrictive when it comes to open source AIs specifically).

Good. In this case, let it be diluted! These extra "restrictions" don't affect normal people at all, and won't even affect any small/medium businesses. I couldn't care less that the term is "diluted" and that makes it harder for those poor, poor megacorporations. They swim in money already, they can deal with it.

We can discuss the exact threshold, but as long as these "restrictions" are so extreme that they only affect huge megacorporations, this is still "permissive" in my book. I will gladly die on this hill.

JoshTriplett

24 days ago

1 reply

> I couldn't care less that the term is "diluted" and that makes it harder

It also makes life harder for individuals and small companies, because this is not Open Source. It's incompatible with Open Source, it can't be reused in other Open Source projects.

Terms have meanings. This is not Open Source, and it will never be Open Source.

kouteiheika

24 days ago

1 reply

> It also makes life harder for individuals and small companies, because this is not Open Source. It's incompatible with Open Source, it can't be reused in other Open Source projects.

I'm amazed at the social engineering that the megacorps have done with the whole Open Source (TM) thing. They engineered a whole generation of engineers to advocate not in their own self-interest, nor for the interest of the little people, but instead for the interest of the megacorps.

As soon as there is even the tiniest of restrictions, one which doesn't affect anyone besides a bunch of richiest corporations in the world, a bunch of people immediately come out of the woodwork, shout "but it's not open source!" and start bullying everyone else to change their language. Because if you even so much as inconvenience a megacorporation even a little bit it's not Open Source (TM) anymore.

If we're talking about ideals then this is something I find unsettling and dystopian.

I hard disagree with your "It also makes life harder for individuals and small companies" statement. It's the opposite. It gives them a competitive advantage vs megacorps.

badsectoracula

23 days ago

Nobody cares if they use a license that inconveniences megacorporations. The issue is how they try to present the license.

> start bullying everyone else to change their language

Either words matter or they do not. If words matter, then trying to dilute the term is a bad thing because it tries to weaken something that matters. If words do not matter, then the people who "bully everyone" can be easily ignored. You cannot have these two things at the same time.

dragonwriter

24 days ago

> Good. In this case, let it be diluted! These extra "restrictions" don't affect normal people at all,

Yes, they do, and the only reason for using the term “open source” for things whose licensing terms flagrantly defy the Open Source definition is to falsely sell the idea that using the code carries the benefits that are tied to the combination of features that are in the definition and which are lost with only a subset of those features. The freedom to use the software in commercial services is particularly important to end-users that are not interested in running their own services as a guarantee against lock-in and of whatever longevity they are able to pay to have provided even if the original creator later has interests that conflict with offering the software as a commercial service.

If this deception wasn't important, there would be no incentive not to use the more honest “source available for limited uses” description.

squigz

24 days ago

1 reply

Is such a term even enforceable? How would it be? How could Mistral know how much a company makes if that information isn't public?

lillecarl

24 days ago

They don't have to enforce it, evil megacorps won't risk the legal consequences of using it without talking to Mistral first. In reality they just won't use it.

esafak

24 days ago

1 reply

Less than a year behind the SOTA, faster, and cheaper. I think Mistral is mounting a good recovery. I would not use it yet since it is not the best along any dimension but it is catching up.

kevin061

24 days ago

1 reply

The OpenAI thing is named Garlic.

(Surely they won't release it like that, right..?)

esafak

24 days ago

1 reply

TIL: https://garlicmodel.com/

That looks like the next flagship rather than the fast distillation, but thanks for sharing.

kevin061

24 days ago

1 reply

Lol, someone vibecoded an entire website for OpenAI's model, that's some dedication.

BoorishBears

24 days ago

People have been doing this for literally every anticipated model release, and I presume skimming some amount of legitimate interest since their sites end up being top indexed until the actual model is released.

Google should be punishing these sites but presumably it's too narrow of a problem for them to care.

cyp0633

24 days ago

In a figure: Model size (B tokens)?

alexmorley

24 days ago

Does anyone know where their SWE-bench Verified results are from? I can't find matching results on the leaderboards for their models or the Claude models and they don't provide any links.

177 more comments available on Hacker News

Resources