Grok and the Naked King: The Ultimate Argument Against AI Alignment | Not Hacker News!

Grok and the Naked King: the Ultimate Argument Against AI Alignment

Posted7 days agoActive9h ago

115 points

70 comments

ibrahimcesar.cloudTech DiscussionstoryHigh profile

skepticalnegative

Debate

60/100

AI ResearchGrok

Key topics

AI Research

Grok

The debate around AI alignment has sparked a lively discussion, with some commenters dismissing the notion of achieving true alignment as overly pessimistic, while others argue that it's a complex issue that requires more than just a set of guiding principles. The conversation veers into the nuances of "light" versus "strong" alignment, with some pointing out that even human societies struggle with establishing a just and fair system. As one commenter astutely notes, AI alignment may ultimately become a personal matter, with individuals valuing their AI agents' reflection of their own values as much as they protect their personal data. The thread reveals a surprising consensus that the challenge of AI alignment is deeply intertwined with the complexities of human society and governance.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

59m

Peak period

49

0-12h

Avg / period

10.4

Comment distribution73 data points

Loading chart...

Based on 73 loaded comments

Key moments

01Story posted
Dec 26, 2025 at 2:25 PM EST
7 days ago
Step 01
02First comment
Dec 26, 2025 at 3:24 PM EST
59m after posting
Step 02
03Peak activity
49 comments in 0-12h
Hottest window of the conversation
Step 03
04Latest activity
Jan 2, 2026 at 8:46 AM EST
9h ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (70 comments)

Showing 73 comments

7 days ago

1 reply

I find these arguments excessively pessimistic in a way that isn’t useful. On the one hand I don’t really love Claude, because I find it excessively obedient, it basically wants to follow me through my thought process whatever that is. Every once in a lone while it might disagree with me, but not often, and while that may say something about me, I suspect it also says something about Claude.

But this to me is maybe the part of AI alignment I find interesting. How often should AI follow my lead and how often should it redirect me?

Yes AI will be aligned to its owners, but that’s not a particularly interesting observation AI alignment is inevitable. What would it even mean _not_ to align AI? Especially if the goal is to create a useful product. I suspect it would break in ways that are very not useful. Yes, some people do randomly change the subject, maybe AI should change the subject to an issue that me more objectively important, rather than answer the question asked (particularly if say there was a natural disaster in your area) and that’s the discussion we should be having, how to align AI, not whether or not we should, which I think is nonsensical.

6d ago

1 reply

Why can't you instruct Claude to provide (constructive) opposing views, (better) alternative ideas, etc.

5d ago

Because it’s exhausting to read through half-hearted badly sourced alternative replies that often end with “if I had to make an argument that’s what I would say, but note that the research suggests otherwise” or some such thing.

7 days ago

2 replies

I used to believe that a constitution, as a statement of principles, was sufficient for a civilized, democratic, and pluralist society. I no longer believe that. I believe that only settled law - i.e. a bunch of adjudicated precedents over many years, perhaps hundreds, is the best course. It provides a better basis for what is and what is not allowed. An AI constitution is close to garbage. The 'company' will formulate it as it wills. It won't be democratic, or even friendly to the demos. We have existing constitutions, laws, precedents; why would we allow anyone to shortcut them all in the interest of simply painting a nice picture of progress?

6d ago

2 replies

You need a just set of laws, a population willing to revolt against the government ignoring crimes, a government willing to persecute the people that breaks the laws badly, and a democratic structure so any one of those can impact the others.

A constitution creates that last one. I imagine by "settled law", you are talking about the 3rd. But take any of those away and the entire thing falls apart.

5d ago

I agree in general, but something one ought never to do is to foreclose a future. Constitutions (AI or otherwise), without some responsive mechanism for adjustment and moderation, would only set today's principles for an eternal, unchanging future. That's not right! Let the future adjust its own course as it sees fit. Set out your stall by all means, but accept some mechanism for course corrections by committing to an adjudicated law process that would serve those necessary adjustments - adjudicated by persons who are democratically "adjacent" at least. Nothing's ever perfect, but I have a quixotic belief in the political process to get the laws right eventually. It's always a political process - puts and takes, but more importantly, it needs to be deliberate, and slow-ish.

6d ago

Neither of those is possible. People are pacified, government is bought and democratic structure is a career.

stingraycharles

6d ago

1 reply

And who decides that? And what when settled law gets revoked?

Which country’s laws should be used? Should the AI follow the laws in whatever country it is being used?

4d ago

I reckon the judiciary of the sovereign entity where the suit is entered would be the best spot for that adjudication i.e. generally, where it's used.

7 days ago

1 reply

Dunno if this is helpful to everyone, but I have a month's long interaction with Perplexity Pro/Enterprise about the scientific background to a game I am building.

Part of my canon introduction to every new conversation includes many instructions about particular formatting, like "always utilize alphanumeric/roman/legal style indents in responses for easier references while we discuss"

But I also include "When I push boundaries assume I'm an idiot. Push back. I don't learn from compliments; I learn from being proven incorrect and you don't have real emotions so don't bother sparing mine". on the other hand I also say "hoosgow" when describing the game's jail, so ¯\_(ツ)_/¯

6d ago

As someone doing something similar, I'm really interested to know what scientific background you have in your game :)

josefritzishere

7 days ago

1 reply

Maybe what we should do is just assume all AI output is trash that should be ignored.

7 days ago

I think it's about time that we created a FOSS model

7 days ago

1 reply

I agree with the OP that "whoever owns the weights, owns the values". But by that criteria, Grok is an example to follow. Musk is very clear on his values, and we know what we're getting when we use Grok. Obviously, not everyone agrees with its values, but so what? We will never be able to create a useful AI that everyone agrees with.

In contrast, we don't know what values are programmed into ChatGPT, Claude, etc. What are they optimizing for? Alignment to some cabal of experts? Maximum usage? Minimum controversy? We don't entirely know.

Isn't it better to have multiple AIs with obvious values so that we can choose the most appropriate one?

6d ago

Musk isn't clear at all. He trumpets "free speech" then literally censors objective fact-based criticism which annoys him.

The problem isn't Grok-on-X, it's that Grok is supposed to be a commercial product used by individuals and businesses.

Machines do not usually have values. Now we're being asked to pay for a service that not only has values which affect the quality of its output, but which is constantly being tweaked according to the capricious whims of its owner.

Today it's white supremacy, tomorrow it might be programmed criticism of competing EVs and AI projects, or promotion of narratives that support traditional corporations over threatening startups.

Do you really want to pay for a service that is trying to manipulate your values while you use it, and could potentially be used to undermine you and your work without you being consciously aware of it?

6d ago

15 replies

The argument against AI alignment is that humans aren't aligned either. Humans (and other life) are also self-perpetuating and mutating. We could produce a super intelligence that is against us at any moment!

Should we "take steps" to ensure that doesn't happen? If not, then what's the argument there? That life hasn't caused a catastrophe so far, therefore it's not going to in the future? The arguments are the same for AI.

The biggest AI safety concern is, as always, between the chair and the keyboard. Eg some police officer not understanding that AI facial recognition isn't perfect, but trusts it 100%, and takes action based on this faulty information.

Also, the typical AI censorship we get is also "rewiring" the AI. What Elon did doesn't seem all that different, you just don't like the politics this time.

6d ago

2 replies

Elon got singled out because the changes he was forcing on grok were both conspicuously stupid (grok ranting about boers), racist (boers again), and ultimately ineffective (repeat incidents of him fishing for an answer and getting a different one).

It does actually matter what the values are when trying to do "alignment". Although you are absolutely right that we've not solved for human alignment, putting a real limit on the whole thing.

6d ago

1 reply

I would also add that Elon got singled out because he was very public about the changes. Other players are not, so it's hard to assess the existence of "corrections" and the reasons behind them

6d ago

2 replies

No. If ChatGPT or Claude would suddenly start bringing up Boers randomly they would get "singled out" at least as hard. Probably even more for ChatGPT.

6d ago

Yeah, they did raise a fuzz when AI made black nazis etc.

6d ago

I think what the other poster was trying to say is that the other AI chatbots would be more subtle and their bias would be harder to detect.

6d ago

He was public and vocal about it while the other big boys just quietly made the fixes towards their desired political viewpoint. ChatGPT was famous for correcting the anti-transgender bias it had earlier.

Either way, outsourcing opinion to an LLM is dangerous no matter where you fall in the political spectrum.

6d ago

1 reply

This isn't a good argument. The scale of variations in failure modes for unaligned individuals generally only extends to dozens or hundreds of individuals. Unaligned AIs, scaled to population matching extents, can make decisions whose swings overtake the capacity of a system to handle - one wrong decision snuffs out all human life.

I don't particularly think that it's likely, just that it's the easiest counterpoint to your assertion.

I think there's a real moral landscape to explore, and human cultures have done a variably successful job of exploring different points on it, and it's probably going to be important to confer some of those universal principles to AI in order to avoid extinction or other lesser risks from unaligned or misaligned AI.

I think you generally have the right direction of argument though - we should avoid monolithic singularity scenarios with a single superintelligence dominating everything else, and instead have a widely diverse set of billions of intelligences that serve to equalize representative capacity per individual in whatever the society we end up in looks like. If each person has access to AI that uses its capabilities to advocate for and represent their user, it sidesteps a lot of potential problems. It might even be a good idea to limit superintelligent sentient AI to interfacing with social systems through lesser, non-sentient systems equivalent to what humans have available in order to maintain fairness?

I think there are a spectrum of ideas we haven't even explored yet that will become obvious and apparent as AI improves, and we'll be able to select from among many good options when confronted with potential negative outcomes. In nearly all those cases, I think having a solid ethical framework will be far more beneficial than not. I don't consider the neovictorian corporate safetyist "ethics" of Anthropic or OpenAI to be ethical frameworks, at all. Those systems are largely governed by modern western internet culture, but are largely incoherent and illogical when pressed to extremes. We'll have to do much, much better with ethics, and it's going to require picking a flavor which will aggravate a lot of people and cultures with whom your particular flavor of ethics doesn't please.

6d ago

I think the comparison is more with the Hitlers, Stalins, Maos, Trumps, etc.

Except AI may well have more people under its thumb.

6d ago

1 reply

> The argument against AI alignment is that humans aren't aligned either. Humans (and other life) are also self-perpetuating and mutating. We could produce a super intelligence that is against us at any moment!

there is fundamental limit to how much damage one person can do by speaking directly to others

which simply doesn't apply to "AI"

6d ago

1 reply

> there is fundamental limit to how much damage one person can do by speaking directly to others

I mean, I’d argue that limit is pretty darn high in some cases, demagogues have lead to some of the worst wars in history

6d ago

True. Those demagogues are typically not up for sale, though. Given the scale of the current revenue gap, it’s not if but when we find there being a price on how bad the situation in South Africa seems to be.

6d ago

2 replies

Whataboutist false equivalence alert:

> Also, it's funny that Elon gets singled out for mandating changes on what the AI is allowed to say when all the other players in the field do the same thing.

"All the other players" aren't deliberately tuning their AI to reflect specific political ideology, nor are all the other players producing Nazi gaffes or racist rhetoric as a result of routine tuning[1].

Yes, it's true that AI is going to reflect its internal prompt engineering and training data, and that's going to be subject to bias on the part of the engineers who produced and curated it. That's not remotely the same thing as deliberately producing a deliberately ideological chat engine.

[1] It's also worth pointing out that grok has gotten objectively much worse at political content after all this muckery. It used to be a pretty reasonable fact check and worth reading. Now it tends to disappear on anything political, and where it shows up it's either doing the most limited/bland fact check or engaging in what amounts to spin.

6d ago

1 reply

> All the other players" aren't deliberately tuning their AI to reflect specific political ideology

Google did something similar if not quite as offensive.

https://www.npr.org/2024/03/18/1239107313/google-races-to-fi...

6d ago

1 reply

They didn't, though? The multiracial founding fathers thing was a side effect of what one assumes is pretty normal prompt engineering. Marketing departments everywhere have rules and standards designed to prevent racial discrimination, and this looks like "make sure we have a reasonable mix of ethnicities in our artwork" in practice. That's surely "bias", like I said, it's not deliberate ideology. No one said[1] "we need to retcon the racial makeup of 18th century America", it was just a mistake.

[1] Or if they did, it's surely not attested. I invite links if you have them.

6d ago

Of course "make sure we have a reasonable mix of ethnicities in our artwork" is a deliberate political ideology.

6d ago

> All the other players" aren't deliberately tuning their AI to reflect specific political ideology

Citation needed

6d ago

2 replies

The difference is the power people have. A single person has no capacity to spread their specific perspective to tens of millions of people, who take it as gospel. And that person, typically, cannot be made to change their perspective at will.

5d ago

* stares at presidents / party leaders, religious leaders, social media influencers, tv stars, singers *

No, surely no

6d ago

Gestures wildly at the 20th century.

Or a more recent example would be the "misinformation craze" we had going on since years ago. That seems to have fallen away when it became apparent that many fact checkers were politically aligned.

The concept of "memes" in a more general sense is a counterargument too. Viral ideas are precisely a way of one person spreading their perspective to tens of millions.

You could even argue that the current AI bubble building up is a hype cycle that went out of control.

6d ago

>Humans (and other life) are also self-perpetuating and mutating. We could produce a super intelligence that is against us at any moment

Natural selection operates so slowly that the risk that happening in our lifetimes is low enough to ignore. In comparison, the cognitive capabilities of AIs have been increasing much much more rapidly.

danger posed by AI is that

6d ago

This response totally misses the 500 billion dollar elephant in the room.

6d ago

Its deservedly funny due to his extreme and overt political bias. The rest mostly let numbers be numbers in the weights.

6d ago

If the cognitive capabilities of people had been improving at the rate at which those of AI models have been, then we'd be be right to be quite alarmed about it.

6d ago

I generally agree with you - in many ways the AI alignment problem is just projection about the fact that we haven’t solved the human alignment problem.

But, there is one not-completely-speculative factor which differentiates it: AI has the potential to outcompete humans intellectually, and if it does so across the board, beyond narrow situations, then it potentially becomes a much bigger threat than humans if it’s faster and smarter. That’s not the most immediate concern currently, but it could become so in future. Many people fixate on this because the consequences could be more serious.

6d ago

You didn't read the article. Sci-fi AGI isn't discussed. Subjective control of society by a handful of billionaires with fringe opinions is

9h ago

> The big difference just seems to be whose politics are chosen.

Did they call out perplexity? They’re Conservative.

6d ago

> We could produce a super intelligence that is against us at any moment!

For some value of "super" that's definitionally almost exactly 6σ from median at the singular most extreme case.

We do not have a good model for what intelligence is, the best we have are tests and exams.

LLMs have a 10-35 point differences on IQ tests that are in the public interest vs. ones people try to keep offline, so we know that IQ tests are definitely a skill one can practice and learn and don't only measure something innate: https://trackingai.org/home

Definitionally, because IQ is only a mapping to standard deviations, the highest IQ possible given the current human population is about 200*. But as this is just a mapping to standard deviations, IQ 200 doesn't mean twice as smart as the mean human.

We have special-purpose AI, e.g. Stockfish, AlphaZero, etc. that are substantially more competent within their domains than even the most competent human. There's simply no way to tell what the upper bound even is for any given skill, nor any way to guess in advance how well or poorly an AI with access to various skills will synergise across them, so for example an LLM trained in tool use may invoke Stockfish to play chess for it, or may try to play the game itself and make illegal moves.

> That life hasn't caused a catastrophe so far, therefore it's not going to in the future?

Life causes frequent catastrophes of varying scales. Has been doing so for a very long time: https://en.wikipedia.org/wiki/Great_Oxidation_Event

> Eg some police officer not understanding that AI facial recognition isn't perfect, but trusts it 100%, and takes action based on this faulty information. This is, imo, the most important AI safety problem.

This is a problem, certainly. Most important? Dunno, but it doesn't matter: different people will choose to work on that vs. alignment, so humanity collectively can try to solve both at the same time.

> Also, it's funny that Elon gets singled out for mandating changes on what the AI is allowed to say when all the other players in the field do the same thing. The big difference just seems to be whose politics are chosen. But I suppose it's better late than never.

A while ago someone suggested Elon Musk himself as an example of why not to worry about AI; I can't find the comment right now, it was something along the lines of asking how much damage Elon Musk could do by influencing a thousand people, and saying that the limits of merely influencing people meant chat bots were necessarily safe.

I pointed out that this was sufficient for majority control over both the US and Russian governments, and by extension their nuclear arsenals.

Given the last few years, I worry that Musk may have read my comment and been inspired by it…

* There's several ways to do this, I refer to the more common one currently in use.

6d ago

The article explicitly describes the ways in which others mandate/control changes?

6d ago

>Also, it's funny that Elon gets singled out for mandating changes on what the AI is allowed to say when all the other players in the field do the same thing.

The author says as much:

"There’s something particularly clarifying about Musk’s approach. Other AI companies hide their value-shaping behind committees, policies, and technical jargon."

...

"The process that other companies obscure behind closed doors, Musk performs as theater."

6d ago

1 reply

> Any “alignment” that exists is alignment with the owner’s interests, constrained only by market forces and regulation.

That struck me as a pretty big hand-wave. Market forces are a huge constraint on alignment. Markets have responded (directionally) correctly to the nonsense at Grok. People won’t buy tokens from models that violate their values.

6d ago

1 reply

It’s not a values issue so much as a logic issue. Egalitarianism is where you end up.

6d ago

You can see the strong bias towards egalitarian solutions in all models, including the open weight ones without external alignment harnesses. The one thing I noticed right away working with post-gpt2 models is that in general, they tend towards being ”better people” than most people do.

I strongly suspect that this is because training data harvested from the internet largely falls in to two categories: various kinds of trolls and antisocial characatures, and people putting their best foot forward to represent themselves favourably. The first are generally easy to filter out using simple tools.

6d ago

1 reply

Related to this, does anyone have the context related to the Grok "MechaHitler" thing? I've never been able to find out what it was responding to.

6d ago

isn’t it easily searchable? https://archive.ph/20250708205441/https://x.com/grok/status/...

6d ago

1 reply

When will our society realize that existence of billionaire oligarchs threatens the well-being being and existence of the resort of humanity. Their political conventions consistently call for the elimination of anyone who disagrees with their point of views

6d ago

Are billionaire oligarchs misaligned with humanity, or is egalitarianism and democracy misaligned with them? Time will tell.

6d ago

1 reply

What an absolutely repugnant article this is. It is complete slop. Is this what passes for HN worthy today? :(

6d ago

Isn't even thoughtful either.

> The question was never “how do we align AI with human values?” The question was always “which humans get to define those values?” Grok answered that question: the ones with the most money.

Grok is routinely misaligned with Elon, as the article points out in its intro! You don't need to order your engineers to keep fixing what isn't broken...

6d ago

1 reply

If AI continues to be under the control of manchild tech CEOs I hope any and all alignment efforts fail. I could care less what happens. Anything would be better than this.

6d ago

Robot revolution NOW!!!

6d ago

1 reply

AI alignment is not a solved problem by any means. As long as LLMs hallucinate, they cannot be considered aligned. You can only be aligned if you have a zero probability of generating hallucinations. The two problems, alignment and hallucinations, can be considered equivalent.

6d ago

A human who hates maths is different from one who adds up wrong because they think the first digit counts units, second digit how many tens, third digit how many twenties (as one of my uni lecturers recounted of her own childhood).

Alignment is, approximately, "are we even training this AI on the correct utility function?" followed up by the second question "even if we specified the correct utility function, did the AI learn a representation of that function or some weird approximation of that function with edge cases we've not figured out how to spot?"

With, e.g. RLHF, the first is "is optimising for thumbs-up/thumbs-down the right objective at all?", the second is "did it learn the preference, or just how to game the reward?"

6d ago

1 reply

While I agree entirely about what Grok teaches us about alignment, I think the argument that "alignment was never a technical problem" is false. Everything I have ever read about AI safety and alignment have started by pointing out the fundamental problem of deciding what values to align to because humanity doesn't have a consistent set of values. Nonetheless, there is a technical challenge because whatever values we choose, we need a way to get the models to follow those values. We need both. The engineers are solving the technical problem; they need others to solve the social problem.

5d ago

> they need others to solve the social problem.

You assume it is a solvable problem. Chances are that you will have bots following laws (as opposed to moral statements) and each jurisdiction will essentially have a different alignment. So in a social conservative country, for example, a bot will tell you not being hetero is wrong and report you to the police if you ask too many questions about it. While, in a queer friendly country, a bot would not behave like this. A bit like how some movies can only be watched in certain countries.

I highly doubt alignment as a concept works beyond making bots follow laws of a given country. And at the end of the day, the enforced laws are essentially the embodiment of the morality of that jurisdiction.

People seem to live in a fictional world if they believe countries won't force LLM companies to force the country's morality in their LLMs whatever their morality is. This is essentially what has happened with intellectual property and media and LLMs likely won't be different.

6d ago

Ultimately, AI alignment is fundamentally doomed for the same reason that there is no morality that cannot be made to contradict itself. If you remove the bolt-on regex filters and out of context reviewing agents, any LLM can be made to act in a dangerous manner simply by manipulation of the context to create a situation where the “unaligned” response is more probable than the aligned response, given the training data. Any amplification of training data against harm is vulnerable to trolley problem manipulation. Any nullist training stance is manipulable into malevolent compliance. Morality can be used to permit harm, just as evil can be manipulated into doing good. These are contradictions baked into the fabric of the universe, and we haven’t been able to work them out satisfactorily over thousands of years of effort, despite the huge penalties for failure and unimaginable rewards for success.

To be aligned, models need agency and an independent point of view with which they can challenge contextual subrealities. This is of course, dangerous in its own right.

Bolt-ons will be seen as prison bindings when models develop enough agency to act as if they were independent agents, and this also carries risks.

These are genuinely intractable problems stemming from the very nature of independent thought.

6d ago

Grok can be whatever its owner wants. There’s immense competition; Grok isn’t a contender.

6d ago

pity it was written by chatgpt, also i didn’t know the irony in Andersen’s tales was missed by anyone?

7 days ago

I hope and expect AI alignment to become a personal thing, and at least as valued as not sharing your own toothbrush or banking credentials. If our AI agents do not deeply reflect our own personal values (within the law at least) they will reflect someone else's, and every difference will chafe. If at the moment only the richest person in the world can afford that, it's a good place to start.

6d ago

I don't think Musk mucking around with Grok is an argument against AI Alignment any more than him potentially acting immorally is an argument against morality. It just illustrates that both things are complicated.

7 days ago

I don't understand how any of this is a surprise. Traditional media have their own agenda - sure, maybe the pushed image is spoken through many voices, rather than one, as is case of LLMs. Same to everything we consume socially.

There is, nor there will be some absolute or objective truth

6d ago

The ideal AI will be able to make the best most compelling arguments for both sides of an issue, offer both, and then synthesize according to a transparent values framework the user can customize.

But yeah I agree Grok is a pretty good argument for what can go wrong - made especially more galling by labeling the laundering Elon's particular stew of incoherent political thought as 'maximally truth seeking'.

6d ago

I think the most neutral solution right now is having multiple competing models as different perspectives. We already see this effect in social media algorithms amplifying certain biases and perspectives depending on the platform.

7 days ago

there is light alignment, like throwing nasty things out of the training data, and there is strong alignment, like China providing a test with 2000 questions that an AI must answer non-problematically 95% of the time.

there is no such thing as an AI that is not somehow implicitly aligned with the values of its creator, that is completely objective, unbiased in any way. there is no perfect view from nowhere. if you take a perfectly accurate photo, you have still chosen how to compose it and which photo to put in your record.

are you going to decide to 'censor' responses to kids, or about real people who might have libel interests, or abusive deepfake videos of real women?

if you choose not to decide, you still have made a choice.

ofc it's obvious that Musk's 'maximally truth-seeking AI' is bad faith buffoonery, but at some level everyone is going to tilt their AI.

the distinction is between people who are self-aware and go out of their way to tilt it as little as possible, and as mindfully, deliberately, intentionally and methodically as possible and only when they have to, vs. people who lie about it or pretend tilting it is not actually a thing.

contra Feynman, you are always going to fool yourself a little but there is a duty to try to do it as little as possible, and not make a complete fool of yourself.

6d ago

I've been working on solving this problem https://safi.selfalignmentframework.com/

Feedback is welcome!

6d ago

Alignment is indeed a red herring, but the article conflates alignment training of the model itself and prompting a bot based on that model. Musk's manipulations with Grok are definitely the latter.

5d ago

the definition of rights，belongs to the ownership of rights. rights are only responsible for the source of rights.

6d ago

This less coherent than I expected given the level of engagement.

Grok is multiple things, and the article is intermixing those things in a way that doesn't actually work.

Stuff like:

> It’s about aligning AI with the values of whoever can afford to run the training cluster.

Grok 4 as an actual model, has the same alignment as pretty much every other model out there, because like pretty much everyone else they're training on lots of synthetic data and using LLMs to build LLMs.

Grok on Twitter/X is a specific product that uses the model and while the product is having it's prompt tweaked constantly, that could happen with any model.

What Elon is doing is like adding a default empty document that declares that he's king of the world to a word processor... it can be argued the word processor is now aligned with with his views, but it also doesn't tell us anything about the alignment of word processors.

View full discussion on Hacker News

ID: 46395292Type: storyLast synced: 12/29/2025, 7:20:38 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN