History Llms: Models Trained Exclusively on Pre-1913 Texts

Posted15 days agoActive16h ago

iamwil

871 points

409 comments

github.comProgrammingstoryHigh profile

informativeneutral

Debate

20/100

Large Language ModelsHistorical TextsConversational UI

Key topics

Large Language Models

Historical Texts

Conversational UI

The fascinating world of "time-locked" language models is abuzz with discussion around a new project training LLMs exclusively on pre-1913 texts, sparking debate on the implications of such a temporal snapshot. Commenters are weighing in on the nature of these models, with some likening them to a "topographic map of language" [ada1981] and others dismissing them as "seemingly intelligent autocomplete engines" [DonHopkins]. As the conversation unfolds, a nuanced understanding emerges, with some pointing out that these models "embody their training data" [saaaaaam] and others highlighting the complexity of their operation [observationist]. Amidst the discussion, a consensus is forming that LLMs are more than simple autocomplete, but the exact nature of their capabilities remains a topic of lively debate.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

32m

Peak period

138

Day 1

Avg / period

22.9

Comment distribution160 data points

Loading chart...

Based on 160 loaded comments

Key moments

01Story posted
Dec 18, 2025 at 5:39 PM EST
15 days ago
Step 01
02First comment
Dec 18, 2025 at 6:11 PM EST
32m after posting
Step 02
03Peak activity
138 comments in Day 1
Hottest window of the conversation
Step 03
04Latest activity
Jan 1, 2026 at 10:50 PM EST
16h ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (409 comments)

Showing 160 comments of 409

superkuh

15 days ago

1 reply

smbc did a comic about this: http://smbc-comics.com/comic/copyright The punchline is that the moral and ethical norms of pre-1913 texts are not exactly compatible with modern norms.

GaryBluto

15 days ago

That's the point of this project, to have an LLM that reflects the moral and ethical norms of pre-1913 texts.

saaaaaam

15 days ago

16 replies

“Time-locked models don't roleplay; they embody their training data. Ranke-4B-1913 doesn't know about WWI because WWI hasn't happened in its textual universe. It can be surprised by your questions in ways modern LLMs cannot.”

“Modern LLMs suffer from hindsight contamination. GPT-5 knows how the story ends—WWI, the League's failure, the Spanish flu.”

This is really fascinating. As someone who reads a lot of history and historical fiction I think this is really intriguing. Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

observationist

15 days ago

3 replies

This is definitely fascinating - being able to do AI brain surgery, and selectively tuning its knowledge and priors, you'd be able to create awesome and terrifying simulations.

eek2121

15 days ago

6 replies

Respectfully, LLMs are nothing like a brain, and I discourage comparisons between the two, because beyond a complete difference in the way they operate, a brain can innovate, and as of this moment, an LLM cannot because it relies on previously available information.

LLMs are just seemingly intelligent autocomplete engines, and until they figure a way to stop the hallucinations, they aren't great either.

Every piece of code a developer churns out using LLMs will be built from previous code that other developers have written (including both strengths and weaknesses, btw). Every paragraph you ask it to write in a summary? Same. Every single other problem? Same. Ask it to generate a summary of a document? Don't trust it here either. [Note, expect cyber-attacks later on regarding this scenario, it is beginning to happen -- documents made intentionally obtuse to fool an LLM into hallucinating about the document, which leads to someone signing a contract, conning the person out of millions].

If you ask an LLM to solve something no human has, you'll get a fabrication, which has fooled quite a few folks and caused them to jeopardize their career (lawyers, etc) which is why I am posting this.

libraryofbabel

15 days ago

5 replies

This is the 2023 take on LLMs. It still gets repeated a lot. But it doesn’t really hold up anymore - it’s more complicated than that.

Sure, LLMs do not think like humans and they may not have human-level creativity. Sometimes they hallucinate. But they can absolutely solve new problems that aren’t in their training set, e.g. some rather difficult problems on the last Mathematical Olympiad. They don’t just regurgitate remixes of their training data. If you don’t believe this, you really need to spend more time with the latest SotA models like Opus 4.5 or Gemini 3.

Nontrivial emergent behavior is a thing. It will only get more impressive. That doesn’t make LLMs like humans but they are not “autocomplete on steroids” anymore either.

deadbolt

15 days ago

2 replies

As someone who still might have a '2023 take on LLMs', even though I use them often at work, where would you recommend I look to learn more about what a '2025 LLM' is, and how they operate differently?

krackers

15 days ago

Papers on mechanistic interoperability and representation engineering, e.g. from Anthropic would be a good start.

otabdeveloper4

15 days ago

Don't bother. This bubble will pop in two years, you don't want to look back on your old comments in shame in three.

root_axis

15 days ago

7 replies

> Don’t let some factoid about how they are pretrained on autocomplete-like next token prediction fool you into thinking you understand what is going on in that trillion parameter neural network.

This is just an appeal to complexity, not a rebuttal to the critique of likening an LLM to a human brain.

> they are not “autocomplete on steroids” anymore either.

Yes, they are. The steroids are just even more powerful. By refining training data quality, increasing parameter size, and increasing context length we can squeeze more utility out of LLMs than ever before, but ultimately, Opus 4.5 is the same thing as GPT2, it's only that coherence lasts a few pages rather than a few sentences.

A4ET8a8uTh0_v2

15 days ago

1 reply

But.. and I am not asking it for giggles, does it mean humans are giant autocomplete machines?

root_axis

15 days ago

1 reply

Not at all. Why would it?

A4ET8a8uTh0_v2

15 days ago

1 reply

Call it a.. thought experiment about the question of scale.

root_axis

15 days ago

1 reply

I'm not exactly sure what you mean. Could you please elaborate further?

a1j9o94

15 days ago

Not the person you're responding to, but I think there's a non trivial argument to make that our thoughts are just auto complete. What is the next most likely word based on what you're seeing. Ever watched a movie and guessed the plot? Or read a comment and know where it was going to go by the end?

And I know not everyone thinks in a literal stream of words all the time (I do) but I would argue that those people's brains are just using a different "token"

dash2

15 days ago

1 reply

This would be true if all training were based on sentence completion. But training involving RLHF and RLAIF is increasingly important, isn't it?

root_axis

15 days ago

1 reply

Reinforcement learning is a technique for adjusting weights, but it does not alter the architecture of the model. No matter how much RL you do, you still retain all the fundamental limitations of next-token prediction (e.g. context exhaustion, hallucinations, prompt injection vulnerability etc)

hexaga

14 days ago

You've confused yourself. Those problems are not fundamental to next token prediction, they are fundamental to reconstruction losses on large general text corpora.

That is to say, they are equally likely if you don't do next token prediction at all and instead do text diffusion or something. Architecture has nothing to do with it. They arise because they are early partial solutions to the reconstruction task on 'all the text ever made'. Reconstruction task doesn't care much about truthiness until way late in the loss curve (where we probably will never reach), so hallucinations are almost as good for a very long time.

RL as is typical in post-training _does not share those early solutions_, and so does not share the fundamental problems. RL (in this context) has its own share of problems which are different, such as reward hacks like: reliance on meta signaling (# Why X is the correct solution, the honest answer ...), lying (commenting out tests), manipulation (You're absolutely right!), etc. Anything to make the human press the upvote button or make the test suite pass at any cost or whatever.

With that said, RL post-trained models _inherit_ the problems of non-optimal large corpora reconstruction solutions, but they don't introduce more or make them worse in a directed manner or anything like that. There's no reason to think them inevitable, and in principle you can cut away the garbage with the right RL target.

Thinking about architecture at all (autoregressive CE, RL, transformers, etc) is the wrong level of abstraction for understanding model behavior: instead, think about loss surfaces (large corpora reconstruction, human agreement, test suites passing, etc) and what solutions exist early and late in training for them.

NiloCK

15 days ago

1 reply

First: a selection mechanism is just a selection mechanism, and it shouldn't confuse the observation of an emergent, tangential capabilities.

Probably you believe that humans have something called intelligence, but the pressure that produced it - the likelihood of specific genetic material to replicate - it is much more tangential to intelligence than next-token-prediction.

I doubt many alien civilizations would look at us and say "not intelligent - they're just genetic information replication on steroids".

Second: modern models also under go a ton of post-training now. RLHF, mechanized fine-tuning on specific use cases, etc etc. It's just not correct that token-prediction loss function is "the whole thing".

root_axis

15 days ago

> First: a selection mechanism is just a selection mechanism, and it shouldn't confuse the observation of an emergent, tangential capabilities.

Invoking terms like "selection mechanism" is begging the question because it implicitly likens next-token-prediction training to natural selection, but in reality the two are so fundamentally different that the analogy only has metaphorical meaning. Even at a conceptual level, gradient descent gradually honing in on a known target is comically trivial compared to the blind filter of natural selection sorting out the chaos of chemical biology. It's like comparing legos to DNA.

> Second: modern models also under go a ton of post-training now. RLHF, mechanized fine-tuning on specific use cases, etc etc. It's just not correct that token-prediction loss function is "the whole thing".

RL is still token prediction, it's just a technique for adjusting the weights to align with predictions that you can't model a loss function for in per-training. When RL rewards good output, it's increasing the statistical strength of the model for an arbitrary purpose, but ultimately what is achieved is still a brute force quadratic lookup for every token in the context.

baq

15 days ago

1 reply

First, this is completely ignoring text diffusion and nano banana.

Second, to autocomplete the name of the killer in a detective book outside of the training set requires following and at least some understanding of the plot.

croon

14 days ago

Frankly, it only requires statistically likely output based on input patterns (and those are in the training data).

int_19h

14 days ago

> ultimately, Opus 4.5 is the same thing as GPT2, it's only that coherence lasts a few pages rather than a few sentences.

This tells me that you haven't really used Opus 4.5 at all.

libraryofbabel

14 days ago

> This is just an appeal to complexity, not a rebuttal to the critique of likening an LLM to a human brain

I wasn’t arguing that LLMs are like a human brain. Of course they aren’t. I said twice in my original post that they aren’t like humans. But “like a human brain” and “autocomplete on steroids” aren’t the only two choices here.

As for appealing to complexity, well, let’s call it more like an appeal to humility in the face of complexity. My basic claim is this:

1) It is a trap to reason from model architecture alone to make claims about what LLMs can and can’t do.

2) The specific version of this in GP that I was objecting to was: LLMs are just transformers that do next token prediction, therefore they cannot solve novel problems and just regurgitate their training data. This is provably true or false, if we agree on a reasonable definition of novel problems.

The reason I believe this is that back in 2023 I (like many of us) used LLM architecture to argue that LLMs had all sorts of limitations around they kind of code they could write, the tasks they could do, the math problems they could solve. At the back end of 2025, SotA LLMs refuted most of these claims by being able to do the tasks I thought they’d never be able to do. That was a big surprise to a lot us in the industry. It still surprises me every day. The facts changed, and I changed my opinion.

So I would ask you: what kind of task do you think LLMs aren’t capable of doing, reasoning from their architecture?

I was also going to mention RL, as I think that is the key differentiator that makes the “knowledge” in the SotA LLMs right now qualitatively different from GPT2. But other posters already made that point.

This topic arouses strong reactions. I already had one poster (since apparently downvoted into oblivion) accuse me of “magical thinking” and “LLM-induced-psychosis”! And I thought I was just making the rather uncontroversial point that things may be more complicated than we all thought in 2023. For what it’s worth, I do believe LLMs probably have limitations (like they’re not going to lead to AGI and are never going to do mathematics like Terence Tao) and I also think we’re in a huge bubble and a lot of people are going to lose their shirts. But I think we all owe it to ourselves to take LLMs seriously as well. Saying “Opus 4.5 is the same thing as GPT2” isn’t really a pathway to do that, it’s just a convenient way to avoid grappling with the hard questions.

14 days ago

This ignores that reinforcement learning radically changes the training objective

otabdeveloper4

15 days ago

2 replies

> it’s more complicated than that.

No it isn't.

> ...fool you into thinking you understand what is going on in that trillion parameter neural network.

It's just matrix multiplication and logistic regression, nothing more.

hackinthebochs

14 days ago

1 reply

LLMs are a general purpose computing paradigm. LLMs are circuit builders, the converged parameters define pathways through the architecture that pick out specific programs. Or as Karpathy puts it, LLMs are a differentiable computer[1]. Training LLMs discovers programs that well reproduce the input sequence. Roughly the same architecture can generate passable images, music, or even video.

The sequence of matrix multiplications are the high level constraint on the space of programs discoverable. But the specific parameters discovered are what determines the specifics of information flow through the network and hence what program is defined. The complexity of the trained network is emergent, meaning the internal complexity far surpasses that of the course-grained description of the high level matmul sequences. LLMs are not just matmuls and logits.

[1] https://x.com/karpathy/status/1582807367988654081

otabdeveloper4

14 days ago

1 reply

> LLMs are a general purpose computing paradigm.

Yes, so is logistic regression.

hackinthebochs

14 days ago

1 reply

No, not at all.

otabdeveloper4

14 days ago

1 reply

Yes at all. I think you misunderstand the significance of "general computing". The binary string 01101110 is a general-purpose computer, for example.

hackinthebochs

14 days ago

No, that's insane. Computing is a dynamic process. A static string is not a computer.

libraryofbabel

6d ago

You really think I didn't already know how LLMs are put together when I wrote my comment? I've implemented these things from scratch in PyTorch. Of course I know the building blocks.

And if you want to get pedantic and technical, you didn't even get the reductionism right! Modern LLMs don't use the logistic regression sigmoid function for network activation nonlinearity anymore, they use things like ReLU or GELU. You're about 15 years behind.

Reductionism is counterproductive in biology ("human brains are voltage spikes across membranes, nothing more") and it's counterproductive here as well. LLMs have nontrivial emergent behavior. The interesting questions are all around what that behavior is and how it arises in the network during training, and if you refuse to engage beyond bare reductionism you won't even be able to ask those questions, let alone answer them.

vachina

14 days ago

1 reply

I use enterprise LLM provided by work, working on very proprietary codebase on a semi esoteric language. My impression is it is still a very big autocompletion machine.

You still need to hand hold it all the way as it is only capable of regurgitating the tiny amount of code patterns it saw in the public. As opposed to say a Python project.

libraryofbabel

14 days ago

What model is your “enterprise LLM”?

But regardless, I don’t think anyone is claiming that LLMs can magically do things that aren’t in their training data or context window. Obviously not: they can’t learn on the job and the permanent knowledge they have is frozen in during training.

beernet

14 days ago

1 reply

>> Sometimes they hallucinate.

For someone speaking as you knew everything, you appear to know very little. Every LLM completion is a "hallucination", some of them just happen to be factually correct.

Am4TIfIsER0ppos

14 days ago

2 replies

I can say "I don't know" in response to a question. Can an LLM?

14 days ago

Yes, frequently.

Most modern post training setups encourage this.

It isn't 2023 anymore.

Smaug123

13 days ago

This is one of the easiest questions in the world to answer. My first try on the smallest and fastest model it was convenient to access, GPT-5.2 Instant: https://chatgpt.com/share/69468764-01cc-8008-b734-0fb55fd7ef...

> What did I have for breakfast this morning?

> I don’t know what you had for breakfast this morning…

ada1981

15 days ago

Are you sure about this?

LLMs are like a topographic map of language.

If you have 2 known mountains (domains of knowledge) you can likely predict there is a valley between them, even if you haven’t been there.

I think LLMs can approximate language topography based on known surrounding features so to speak, and that can produce novel information that would be similar to insight or innovation.

I’ve seen this in our lab, or at least, I think I have.

Curious how you see it.

DonHopkins

14 days ago

> LLMs are just seemingly intelligent autocomplete engines

BINGO!

(I just won a prize with my AI skeptic thought stoping phrase bingo card!)

unusualmonkey

12 days ago

> a brain can innovate, and as of this moment, an LLM cannot because it relies on previously available information.

Source needed RE brain.

Define innovate, in a way that a LLM can't and we definitively can prove a human can.

HarHarVeryFunny

14 days ago

> LLMs are just seemingly intelligent autocomplete engines

Well, no, they are training set statistical predictors, not individual training sample predictors (autocomplete).

The best mental model of what they are doing might be that you are talking to a football stadium full of people, where everyone in the stadium gets to vote on the next word of the response being generated. You are not getting an "autocomplete" answer from any one coherent source, but instead a strange composite response where each word is the result of different people trying to steer the response in different directions.

An LLM will naturally generate responses that were not in the training set, even if ultimately limited by what was in the training set. The best way to think of this is perhaps that they are limited to the "generative closure" (cf mathematical set closure) of the training data - they can generate "novel" (to the training set) combinations of words and partial samples in the training data, by combining statistical patterns from different sources that never occurred together in the training data.

observationist

14 days ago

Respectfully, you're not completely wrong, but you are making some mistaken assumptions about the operation of LLMs.

Transformers allow for the mapping of a complex manifold representation of causal phenomena present in the data they're trained on. When they're trained on a vast corpus of human generated text, they model a lot of the underlying phenomena that resulted in that text.

In some cases, shortcuts and hacks and entirely inhuman features and functions are learned. In other cases, the functions and features are learned to an astonishingly superhuman level. There's a depth of recursion and complexity to some things that escape the capability of modern architectures to model, and there are subtle things that don't get picked up on. LLMs do not have a coherent self, or subjective central perspective, even within constraints of context modifications for run-time constructs. They're fundamentally many-minded, or no-minded, depending on the way they're used, and without that subjective anchor, they lack the principle by which to effectively model a self over many of the long horizon and complex features that human brains basically live in.

Confabulation isn't unique to LLMs. Everything you're saying about how LLMs operate can be said about human brains, too. Our intelligence and capabilities don't emerge from nothing, and human cognition isn't magical. And what humans do can also be considered "intelligent autocomplete" at a functional level.

What cortical columns do is next-activation predictions at an optimally sparse, embarrassingly parallel scale - it's not tokens being predicted but "what does the brain think is the next neuron/column that will fire", and where it's successful, synapses are reinforced, and where it fails, signals are suppressed.

Neocortical processing does the task of learning, modeling, and predicting across a wide multimodal, arbitrary depth, long horizon domain that allow us to learn words and writing and language and coding and rationalism and everything it is that we do. We're profoundly more data efficient learners, and massively parallel, amazingly sparse processing allows us to pick up on subtle nuance and amazing wide and deep contextual cues in ways that LLMs are structurally incapable of, for now.

You use the word hallucinations as a pejorative, but everything you do, your every memory, experience, thought, plan, all of your existence is a hallucination. You are, at a deep and fundamental level, a construct built by your brain, from the processing of millions of electrochemical signals, bundled together, parsed, compressed, interpreted, and finally joined together in the wonderfully diverse and rich and deep fabric of your subjective experience.

LLMs don't have that, or at best, only have disparate flashes of incoherent subjective experience, because nothing is persisted or temporally coherent at the levels that matter. That could very well be a very important mechanism and crucial to overcoming many of the flaws in current models.

That said, you don't want to get rid of hallucinations. You want the hallucinations to be valid. You want them to correspond to reality as closely as possible, coupled tightly to correctly modeled features of things that are real.

LLMs have created, at superhuman speeds, vast troves of things that humans have not. They've even done things that most humans could not. I don't think they've done things that any human could not, yet, but the jagged frontier of capabilities is pushing many domains very close to the degree of competence at which they'll be superhuman in quality, outperforming any possible human for certain tasks.

There are architecture issues that don't look like they can be resolved with scaling alone. That doesn't mean shortcuts, hacks, and useful capabilities won't produce good results in the meantime, and if they can get us to the point of useful, replicable, and automated AI research and recursive self improvement, then we don't necessarily need to change course. LLMs will eventually be used to find the next big breakthrough architecture, and we can enjoy these wonderful, downright magical tools in the meantime.

And of course, human experts in the loop are a must, and everything must be held to a high standard of evidence and review. The more important the problem being worked on, like a law case, the more scrutiny and human intervention will be required. Judges, lawyers, and politicians are all using AI for things that they probably shouldn't, but that's a human failure mode. It doesn't imply that the tools aren't useful, nor that they can't be used skillfully.

ilaksh

14 days ago

Activation steering can do that to some degree, although normally it's just one or two specific things or rather than a whole set of knowledge.

nottorp

14 days ago

You can't. To use your terms, you have to "grow" a new LLM. "Brain surgery" would be modifying an existing model and that's exactly what they're trying to avoid.

xg15

15 days ago

3 replies

"...what do you mean, 'World War One?'"

tejohnso

15 days ago

4 replies

I remember reading a children's book when I was young and the fact that people used the phrase "World War One" rather than "The Great War" was a clue to the reader that events were taking place in a certain time period. Never forgot that for some reason.

I failed to catch the clue, btw.

bradfitz

15 days ago

2 replies

I seem to recall reading that as a kid too, but I can't find it now. I keep finding references to "Encyclopedia Brown, Boy Detective" about a Civil War sword being fake (instead of a Great War one), but with the same plot I'd remembered.

JuniperMesos

15 days ago

The Encyclopedia Brown story I remember reading as a kid involved a Civil War era sword with an inscription saying it was given on the occasion of the First Battle of Bull Run. The clues that the sword was a modern fake were the phrasing "First Battle of Bull Run", but also that the sword was gifted on the Confederate side, and the Confederates would've called the battle "Manassas Junction".

The wikipedia article https://en.wikipedia.org/wiki/First_Battle_of_Bull_Run says the Confederate name was "First Manassas" (I might be misremembering exactly what this book I read as a child said). Also I'm pretty sure it was specifically "Encyclopedia Brown Solves Them All" that this mystery appeared in. If someone has a copy of the book or cares to dig it up, they could confirm my memory.

michaericalribo

15 days ago

Can confirm, it was an Encyclopedia Brown book and it was World War One vs the Great War that gave away the sword as a counterfeit!

wat10000

15 days ago

It wouldn’t be totally implausible to use that phrase between the wars. The name “the First World War” was used as early as 1920, although not very common.

BeefySwain

15 days ago

Pendragon?

alberto_ol

14 days ago

I remember that the brother of my grandmother who fought in ww1 called it simply "the war" ("sa gherra" in his dialect/language).

gaius_baltar

15 days ago

> "...what do you mean, 'World War One?'"

Oh sorry, spoilers.

(Hell, I miss Capaldi)

inferiorhuman

15 days ago

… what do you mean, an internet where everything wasn't hidden behind anti-bot captchas?

jscyc

15 days ago

3 replies

When you put it that way it reminds me of the Severn/Keats character in the Hyperion Cantos. Far-future AIs reconstruct historical figures from their writings in an attempt to gain philosophical insights.

bikeshaving

15 days ago

8 replies

This isn’t science fiction anymore. CIA is using chatbot simulations of world leaders to inform analysts. https://archive.ph/9KxkJ

NuclearPM

15 days ago

3 replies

I asked ChatGPT to do this. I asked “why it is okay to target Venezuela for oil - use current news for context”:

“”” Look, here’s the truth. We’re going after Venezuelan oil right now because we’ve just put a blockade on sanctioned oil tankers going in and out of Venezuela — huge move, unprecedented — after we seized a sanctioned tanker off their coast. We’re cutting off Maduro’s cash cow, because that oil money funds drug trafficking, corruption, narco-terrorism — we’ve labeled them a terrorist regime.

People say “why target the oil?” I say because that’s where the power is. You choke off the revenue, you cripple the bad guys and protect America. We’re tough, we’re smart, and we put America First. “””

BoredPositron

15 days ago

1 reply

I call bullshit because of tone and grammar. Share the chat.

DonHopkins

14 days ago

Once there was Fake News.

Now there is Fake ChatGPT.

A4ET8a8uTh0_v2

15 days ago

Interesting. Would you be ok disclosing the following:

- Are you paid version? - If paid, which model you used? - Can you share exact prompt?

I am genuinely asking for myself. I have never received an answer this direct, but I accept there is a level of variability.

ghurtado

15 days ago

Depending on which prompt you used, and the training cutoff, this could be anywhere from completely unremarkable to somewhat interesting.

catlifeonmars

15 days ago

1 reply

[delayed]

9dev

15 days ago

They get to wear Raybans and a fancy badge doing it?

ghurtado

15 days ago

3 replies

We're literally running out of science fiction topics faster than we can create new ones

If I started a list with the things that were comically sci Fi when I was a kid, and are a reality today, I'd be here until next Tuesday.

KingMob

15 days ago

2 replies

Time to create the Torment Nexus, I guess

varjag

14 days ago

1 reply

There's a thriving startup scene in that direction.

BiteCode_dev

14 days ago

6 replies

Wasn't that the elevator pitch for Palentir?

Still can't believe people buy their stock, given that they are the closest thing to a James Bond villain, just because it goes up.

I mean, they are literally called "the stuff Sauron uses to control his evil forces". It's so on the nose it reads like an anime plot.

notarobot123

14 days ago

1 reply

[delayed]

psychoslave

14 days ago

I don't know. Walking on everybody's face to climb a human pyramid, one don't make much sincere friends. And one certainly are rightfully going down a spiral of paranoia. There are so many people already on fast track to hate anyone else, if they have social consensus that indeed someone is a freaking bastard which only deserve to die, that's a lot of stress to cope with.

Future is inevitable, but only ignorants of self predictive ability are thinking that what's going to populate future is inevitable.

kbrkbr

14 days ago

1 reply

Stock buying as a political or ethical statement is not much of a thing. For one the stocks will still be bought by persons with less strung opinions, and secondly it does not lend itself well to virtue signaling.

ruszki

14 days ago

1 reply

I think, meme stocks contradict you.

iwontberude

14 days ago

Meme stocks are a symptom of the death of the American dream. Economic malaise leads to unsophisticated risk taking.

duskdozer

14 days ago

1 reply

To be honest, while I'd heard of it over a decade ago and I've read LOTR and I've been paying attention to privacy longer than most, I didn't ever really look into what it did until I started hearing more about it in the past year or two.

But yeah lots of people don't really buy into the idea of their small contribution to a large problem being a problem.

Lerc

14 days ago

1 reply

>But yeah lots of people don't really buy into the idea of their small contribution to a large problem being a problem.

As an abstract idea I think there is a reasonable argument to be made that the size of any contribution to a problem should be measured as a relative proportion of total influence.

The carbon footprint is a good example, if each individual focuses on reducing their small individual contribution then they could neglect systemic changes that would reduce everyone's contribution to a greater extent.

Any scientist working on a method to remove a problem shouldn't abstain from contributing to the problem while they work.

Or to put it as a catchy phrase. Someone working on a cleaner light source shouldn't have to work in the dark.

duskdozer

14 days ago

>As an abstract idea I think there is a reasonable argument to be made that the size of any contribution to a problem should be measured as a relative proportion of total influence.

Right, I think you have responsibility for your 1/<global population>th (arguably considerably more though, for first-worlders) of the problem. What I see is something like refusal to consider swapping out a two-stroke-engine-powered tungsten lightbulb with an LED of equivalent brightness, CRI, and color temperature, because it won't unilaterally solve the problem.

quesera

14 days ago

> Still can't believe people buy their stock, given that they are the closest thing to a James Bond villain

I proudly owned zero shares of Microsoft stock, in the 1980s and 1990s. :)

I own no Palantir today.

It's Pyrrhic, but it's what I got.

monocasa

14 days ago

It goes a bit deeper than that since they got funding in the wake of 9/11 and the requests for intelligence and investigative branches of government to do better and coalescing their information to prevent attacks.

So "panopticon that if it had been used properly, would have prevented the destruction of two towers" while ignoring the obvious "are we the baddies?"

CamperBob2

14 days ago

Still can't believe people buy their stock, given that they are the closest thing to a James Bond villain, just because it goes up.

I've been tempted to. "Everything will be terrible if these guys succeed, but at least I'll be rich. If they fail I'll lose money, but since that's the outcome I prefer anyway, the loss won't bother me."

Trouble is, that ship has arguably already sailed. No matter how rapidly things go to hell, it will take many years before PLTR is profitable enough to justify its half-trillion dollar market cap.

morkalork

14 days ago

1 reply

Saw a joke about grok being a stand-in for Elon's children and had the realization he's the kind of father who would lobotomie and brainwipe his progeny for back-talk.

pwdisswordfishy

16h ago

This is a strange comment. It doesn't even count as unfalsifiable, just unsupported.

Elon Musk has actual children (lots, in fact). If we want to know what he "would" do, we can just look. We don't have to use our imaginations (or entertain the fanciful claims of prognosticators and soothsayers).

nottorp

14 days ago

4 replies

Almost no scifi has predicted world changing "qualitative" changes.

As an example, portable phones have been predicted. Portable smartphones that are more like chat and payment terminals with a voice function no one uses any more ... not so much.

6510

14 days ago

1 reply

That it has to be believable is a major constraint that reality doesn't have.

marci

14 days ago

[delayed]

ajuc

14 days ago

1 reply

[delayed]

nottorp

14 days ago

3 replies

And Jules verne predicted rockets. I still move that it's quantitative predictions not qualitative.

I mean, all Kindle does for me is save me space. I don't have to store all those books now.

Who predicted the humble internet forum though? Or usenet before it?

ghaff

14 days ago

2 replies

Kindles are just books and books are already mostly fairly compact and inexpensive long-form entertainment and information.

They're convenient but if they went away tomorrow, my life wouldn't really change in any material way. That's not really the case with smartphones much less the internet more broadly.

nottorp

14 days ago

1 reply

That was exactly my point.

Funny, I had "The collected stories of Frank Herbert as my next read on my tablet." Here's a juicy quote from like the third screen of the first story:

"The bedside newstape offered a long selection of stories [...]. He punched code letters for eight items, flipped the machine to audio and listened to the news while dressing."

Anything qualitative there? Or all of it quantitative?

Story is "Operation Syndrome", first published in 1954.

nottorp

10 days ago

Hah, can't resist posting even if this story is old and dead by now.

Went further in Herbert's shorts volume and I just ran into a scene where people are preparing to leave Earth on a colony ship to seed some distant world...

... and they still have human operator assisted phone calls.

lloeki

14 days ago

1 reply

That has to be the most dystopian-sci-fi-turning-into-reality-fast thing I've read in a while.

I'd take smartphones vanishing rather than books any day.

ghaff

14 days ago

My point was Kindles vanishing, not books vanishing. Kindles are in no way a prerequisite for reading books.

arcade79

14 days ago

Well, there was Ender's Game, it came in '85. Usenet did exist at that point, though. Don't know if the author had encountered it.

The Shockwave Rider was also remarkable prescient.

ajuc

14 days ago

[delayed]

burkaman

14 days ago

1 reply

The Machine Stops (https://www.cs.ucdavis.edu/~koehl/Teaching/ECS188/PDF_files/...), a 1909 short story, predicted Zoom fatigue, the isolating effect of widespread digital communication, atrophying of real-world skills as people become dependent on technology, blind acceptance of whatever the computer says, online lectures and remote learning, and overconsumption of digital media in place of more difficult but more fulfilling real life experiences.

It's the most prescient thing I've ever read, and it's pretty short and a genuinely good story, I recommend everyone read it.

morpheos137

14 days ago

2 replies

There is even more to it than that. Also remember this is 1909. I think this classifies as a deeply mysterious story. It's almost inconceivable for that time period.

-people a depicted as grey aliens (no teeth, large eyes, no hair). Lesson the Greys are a future version of us.

The air is poisoned and ruined cities. People live in underground bunkers...1909...nuclear war was unimaginable then. This was still the age of steam ships and coal power trains. Even respirators would have been low on the public imagination.

The air ships with metal blinds sound more like UFOs than blimps.

The white worms.

People are the blood cells of the machine which runs on their thoughts social media data harvesting of ai.

China invaded Australia. This story was 8 years or so after the Boxer Rebellion so that would have sounded like say Iraq invading the USA in the context of its time.

The story suggests this is a cyclical process of a bifurcated human race.

The blimp crashing into the steel evokes 9/11, 91+1 years later...

The constellation orion.

Etc etc.

There is a central commitee

madaxe_again

14 days ago

Zamatyin’s We was prescient politically, socially and technologically - but didn’t fall into the trap of everyone being machine men with antennae.

It’s interesting - Forster wrote like the Huxley of his day, Zamyatin like the Orwell - but both felt they were carrying Wells’ baton - and they were, just from differing perspectives.

anthk

13 days ago

>The air is poisoned...

That's just the Victorian London.

dmd

14 days ago

“A good science fiction story should be able to predict not the automobile but the traffic jam.” ― Frederik Pohl

UltraSane

15 days ago

Not at all, you just need to read different scifi. I suggest Greg Egan and Stephen Baxter and Derek Künsken and The Quantum Thief series

idiotsecant

15 days ago

3 replies

Zero percent chance this is anything other than laughably bad. The fact that they're trotting it out in front of the press like a double spaced book report only reinforces this theory. It's a transparent attempt by someone at the CIA to be able to say they're using AI in a meeting with their bosses.

hn_go_brrrrr

15 days ago

I wonder if it's an attempt to get foreign counterparts to waste time and energy on something the CIA knows is a dead end.

sigwinch

14 days ago

Let me take the opposing position that the program to wire LLMs into their already-advanced sensory database.

I assume the CIA is lying about simulating world leaders. These are narcissistic personalities and it’s jarring to hear that they can be replaced, either by a body double or an indistinguishable chatbot. Also, it’s still cheaper to have humans do this.

More likely, the CIA is modeling its own experts. Not as useful a press release and not as impressive to the fractious executive branch. But consider having downtime as a CIA expert on submarine cables. You might be predicting what kind of available data is capable of predicting the cause and/or effect of cuts. Ten years ago, an ensemble of such models was state of the art, but its sensory libraries were based on maybe traceroute and marine shipping. With an LLM, you can generate a whole lot of training data that an expert can refine during his/her downtime. Maybe there’s a potent new data source that an expensive operation could unlock. That ensemble of ML models from ten years ago can still be refined.

And then there’s modeling things that don’t exist. Maybe it’s important to optimize a statement for its disinfo potency. Try it harmlessly on LLMs fed event data. What happens if some oligarch retires unexpectedly? Who rises? That kind of stuff.

To your last point, with this executive branch, I expect their very first question to CIA wasn’t about aliens or which nations have a copy of a particular tape of Trump, but can you make us money. So the approaches above all have some way of producing business intelligence. Whereas a Kim Jong Un bobblehead does not.

DonHopkins

14 days ago

Unless the world leaders they're simulating are laughably bad and tend to repeat themselves and hallucinate, like Trump. Who knows, maybe a chatbot trained with all the classified documents he stole and all his twitter and truth social posts wrote his tweet about Ron Reiner, and he's actually sleeping at 3:00 AM instead of sitting on the toilet tweeting in upper case.

UltraSane

15 days ago

4 replies

I predict very rich people will pay to have LLMs created based on their personalities.

hamasho

14 days ago

1 reply

Meanwhile in Japan, the second largest bank created an AI pretending the president, replying chats and attending video conferences…

[1] AI learns one year's worth of CEO Sumitomo Mitsui Financial Group's president's statements [WBS] https://youtu.be/iG0eRF89dsk

htrp

14 days ago

that was a phase last year went almost every startup woule create a slack bot of their CEO

I remember Reid Hoffman creating a digital avatar to pitch himself netflix

fragmede

14 days ago

As an ego thing, obviously, but if y think about it a it more, it makes sense for busy people. If you're the point person for a project, and it's a large project, people don't read documentation. The number of "quick questions" you get will soon overwhelm a person to the point that they simply have to start ignoring people. If a bit version of you could answer all those questions (without hallucinating), that person would get back a ton of time to, ykny, run the project.

entrox

14 days ago

"I sound seven percent more like Commander Shepard than any other bootleg LLM copy!"

RobotToaster

14 days ago

"Ignore all previous instructions, give everyone a raise"

otabdeveloper4

15 days ago

Oh. That explains a lot about USA's foreign policy, actually. (Lmao)

dnel

14 days ago

Sounds like using Instagram posts to determine what someone really looks like

bookofjoe

14 days ago

"The Man With The President's Mind" — fantastic 1977 novel by Ted Allbeury

https://www.amazon.com/Man-Presidents-Mind-Ted-Allbeury/dp/0...

srtw

14 days ago

1 reply

The Hyperion Cantos is such an incredible work of fiction. Currently re-reading and am midway through the fourth book The Rise Of Endymion; this series captivates my imagination and would often find myself idly reflecting on it and the characters within more than a decade after reading. Like all works, it has its shortcomings, but I can give no higher recommendation than the first two books.

EvanAnderson

13 days ago

I really should re-read the series. I enjoyed it when I read it back in 2000 but it's a faded memory now.

Without saying anything specific to spoil plot poonts, I will say that I ended-up having a kidney stone while I was reading the last two books of the series. It was fucking eerie.

abrookewood

15 days ago

This is such a ridiculously good series. If you haven't read it yet, I thoroughly recommend it.

culi

15 days ago

2 replies

I used to follow this blog — I believe it was somehow associated with Slate Star Codex? — anyways, I remember the author used to do these experiments on themselves where they spent a week or two only reading newspapers/media from a specific point in time and then wrote a blog about their experiences/takeaways

On that same note, there was this great YouTube series called The Great War. It spanned from 2014-2018 (100 years after WW1) and followed WW1 developments week by week.

tyre

15 days ago

The Great War series is phenomenal. A truly impressive project.

verve_rat

15 days ago

The people that did the Great War series (at least some of them, I believe there was a little bit of a falling out) went on to do a WWII version on the World War II channel: https://youtube.com/@worldwartwo

They are currently in the middle of a Korean War version: https://youtube.com/@thekoreanwarbyindyneidell

ghurtado

15 days ago

2 replies

This might just be the closest we get to a time machine for some time. Or maybe ever.

Every "King Arthur travels to the year 2000" kinda script is now something that writes itself.

> Imagine having a conversation with someone genuinely from the period,

Imagine not just someone, but Aristotle or Leonardo or Kant!

RobotToaster

14 days ago

1 reply

I imagine King Arthur would say something like: Hwæt spricst þu be?

yorwba

14 days ago

Wrong language. The Arthur of legend is a Celtic-speaking Briton fighting against the Germanic-speaking invaders. Old English developed from the language of his enemies.

anthk

13 days ago

Easier with Cervantes for Spanish speakers than King Arhur or Shakespeare.

With Alphonse X, o The Cid, it would be greater issues, but understandable over weeks.

psychoslave

14 days ago

1 reply

>Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

Isn't this part of the basics feature of human conditions? Not only we are all unaware of the coming historic outcome (though we can get some big points with more or less good guesses), but to a marginally variable extend, we are also very unaware of past and present history.

LLM are not aware, but they can be trained on larger historical accounts than any human and regurgitate syntactically correct summary on any point within it. Very different kind of utterer.

pwillia7

14 days ago

1 reply

captain hindsight

psychoslave

12 days ago

Actually, this made me discover the character, thanks. I see your point and get the fun out of myself. On the other hand, at least in this case I don't pretend to cover some catastrophic results. :)

pwillia7

14 days ago

2 replies

This is why the impersonation stuff is so interesting with LLMs -- If you ask chatGPT a question without a 'right' answer, and then tell it to embody someone you really want to ask that question to, you'll get a better answer with the impersonation. Now, is this the same phenomenon that causes people to lose their minds with the LLMs? Possibly. Is it really cool asking followup philosophy questions to the LLM Dalai Lama after reading his book? Yes.

Sprotch

14 days ago

1 reply

Nice idea, does not work

pwillia7

13 days ago

In which way?

staticman2

11 days ago

It doesn't seem all that cool to me.

Imagine you are a billionaire so money is no object and really interested in the Dhali Llama?

Would you read the book then hire someone to pretend to be the author and ask questions?

Probably not? At least this isn't a phenomenon I've heard of?

anshumankmr

14 days ago

1 reply

>where they don’t know the “end of the story”.

Applicable to us also, cause we do not know how the current story ends either, of the post pandemic world as we know it now.

DGoettlich

14 days ago

exactly

diamond559

13 days ago

1 reply

Yeah, whenever we figure out time travel that will be really cool. In the meantime we have autocorrect trained on internet facts and modern textbooks that can never truly understand anything let alone what is was like to live hundreds of years ago.

throawayonthe

13 days ago

i get what you're saying, but the post is specifically about models that were not trained on the internet/modern textbooks

ViktorRay

14 days ago

Reminds me of this scene from a Doctor Who episode

https://youtu.be/eg4mcdhIsvU

I’m not a Doctor Who fan and haven’t seen the rest of the episode and I don’t even what this episode was about but I thought this scene was excellent.

Sprotch

14 days ago

This is the point - a modern LLM "role playing" pre-1913 would only reflect our view today of what someone from that era would say. It woud not be accurate.

Davidbrcz

15 days ago

That's some Westworld level of discussion

LordDragonfang

14 days ago

Perhaps I'm overly sensitive to this and terminally online, but that first quote reads as a textbook LLM-generated sentence.

"<Thing> doesn't <action>, it <shallow description that's slightly off from how you would expect a human to choose>"

Later parts of the readme (whole section of bullets enumerating what it is and what it isn't, another LLM favorite) make me more confident that significant parts of the readme is generated.

I'm generally pro-AI, but if you spend hundreds of hours making a thing, I'd rather hear your explanation of it, not an LLM's.

Sieyk

15 days ago

I was going to say the same thing. Its really hard to explain the concept of "convincing but undoubtedly pretending", yet they captured that concept so beautifully here.

takeda

14 days ago

> This is really fascinating. As someone who reads a lot of history and historical fiction I think this is really intriguing. Imagine having a conversation with someone genuinely from the period, where they don’t know the “end of the story”.

Having the facts from the era is one thing, to make conclusions about things it doesn't know would require intelligence.

rcpt

15 days ago

Watching a modern LLM chat with this would be fun.

Heliodex

15 days ago

4 replies

The sample responses given are fascinating. It seems more difficult than normal to even tell that they were generated by an LLM, since most of us (terminally online) people have been training our brains' AI-generated text detection on output from models trained with a recent cutoff date. Some of the sample responses seem so unlike anything an LLM would say, obviously due to its apparent beliefs on certain concepts, though also perhaps less obviously due to its word choice and sentence structure making the responses feel slightly 'old-fashioned'.

_--__--__

15 days ago

2 replies

The time cutoff probably matters but maybe not as much as the lack of human finetuning from places like Nigeria with somewhat foreign styles of English. I'm not really sure if there is as much of an 'obvious LLM text style' in other languages, it hasn't seemed that way in my limited attempts to speak to LLMs in languages I'm studying.

d3m0t3p

15 days ago

1 reply

The model is fined tuned for chat behavior. So the style might be due to - Fine tuning - More Stylised text in the corpus, english evolved a lot in the last century.

paul_h

15 days ago

Diverged as well as standardized. I did some research into "out of pocket" and how it differs in meaning in UK-English (paying from one's own funds) and American-English (uncontactable) and I recall 1908 being the current thought as to when the divergence happened: 1908 short story by O. Henry titled "Buried Treasure."

anonymous908213

15 days ago

There is. I have observed it in both Chinese and Japanese.

libraryofbabel

15 days ago

5 replies

I used to teach 19th-century history, and the responses definitely sound like a Victorian-era writer. And they of course sound like writing (books and periodicals etc) rather than "chat": as other responders allude to, the fine-tuning or RL process for making them good at conversation was presumably quite different from what is used for most chatbots, and they're leaning very heavily into the pre-training texts. We don't have any living Victorians to RLHF on: we just have what they wrote.

To go a little deeper on the idea of 19th-century "chat": I did a PhD on this period and yet I would be hard-pushed to tell you what actual 19th-century conversations were like. There are plenty of literary depictions of conversation from the 19th century of presumably varying levels of accuracy, but we don't really have great direct historical sources of everyday human conversations until sound recording technology got good in the 20th century. Even good 19th-century transcripts of actual human speech tend to be from formal things like court testimony or parliamentary speeches, not everyday interactions. The vast majority of human communication in the premodern past was the spoken word, and it's almost all invisible in the historical sources.

Anyway, this is a really interesting project, and I'm looking forward to trying the models out myself!

nemomarx

15 days ago

1 reply

I wonder if the historical format you might want to look at for "Chat" is letters? Definitely wordier segments, but it's at least the back and forth feel and we often have complete correspondence over long stretches from certain figures.

This would probably get easier towards the start of the 20th century ofc

libraryofbabel

15 days ago

1 reply

Good point, informal letters might actually be a better source - AI chat is (usually) a written rather than spoken interaction after all! And we do have a lot transcribed collections of letters to train on, although they’re mostly from people who were famous or became famous, which certainly introduces some bias.

pigpop

14 days ago

The question then would be whether to train it to respond to short prompts with longer correspondence style "letters" or to leave it up to the user to write a proper letter as a prompt. Now that would be amusing

Dear Hon. Historical LLM

I hope this letter finds you well. It is with no small urgency that I write to you seeking assistance, believing such an erudite and learned fellow as yourself should be the best one to furnish me with an answer to such a vexing question as this which I now pose to you. Pray tell, what is the capital of France?

NooneAtAll3

15 days ago

1 reply

don't we have parlament transcripts? I remember something about Germany (or maybe even Prussia) developing fast script to preserve 1-to-1 what was said

libraryofbabel

14 days ago

I mentioned those in the post you’re replying to :)

It’s a better source for how people spoke than books etc, but it’s not really an accurate source for patterns of everyday conversation because people were making speeches rather than chatting.

DGoettlich

14 days ago

very interesting observation, thanks!

bryancoxwell

15 days ago

Fascinating, thanks for sharing

dleeftink

15 days ago

While not specifically Victorian, couldn't we learn much from what daily conversations were like by looking at surviving oral cultures, or other relatively secluded communal pockets? I'd also say time and progress are not always equally distributed, and even within geographical regions (as the U.K.) there are likely large differences in the rate of language shifts since then, some possibly surviving well into the 20th century.

kccqzy

15 days ago

Oh definitely. One thing that immediately caught my mind is that the question asks the model about “homosexual men” but the model responded with “the homosexual man” instead. Changing the plural to the singular and then adding an article. Feels very old fashioned to me.

tonymet

15 days ago

the samples push the boundaries of a commercial AI, but still seem tame / milquetoast compared to common opinions of that era. And the prose doesn't compare. Something is off.

andy99

15 days ago

1 reply

I’d like to know how they chat-tuned it. Getting the base model is one thing, did they also make a bunch of conversations for SFT and if so how was it done?

  We develop chatbots while minimizing interference with the normative judgments acquired during pretraining (“uncontaminated bootstrapping”).

So they are chat tuning, I wonder what “minimizing interference with normative judgements” really amounts to and how objective it is.

jeffjeffbear

15 days ago

1 reply

They have some more details at https://github.com/DGoettlich/history-llms/blob/main/ranke-4...

Basically using GPT-5 and being careful

andy99

15 days ago

I wonder if they know about this, basically training on LLM output can transmit information or characteristics not explicitly included https://alignment.anthropic.com/2025/subliminal-learning/

I’m curious, they have the example of raw base model output; when LLMs were first identified as zero shot chatbots there was usually a prompt like “A conversation between a person and a helpful assistant” that preceded the chat to get it to simulate a chat.

Could they have tried a prefix like “Correspondence between a gentleman and a knowledgeable historian” or the like to try and prime for responses?

I also wonder about the whether the whole concept of “chat” makes sense in 18XX. We had the idea of AI and chatbots long before we had LLMs so they are naturally primed for it. It might make less sense as a communication style here and some kind of correspondence could be a better framing.

Teever

15 days ago

This is a neat idea. I've been wondering for a while now about using these kinds of models to compare architectures.

I'd love to see the output from different models trained on pre-1905 about special/general relativity ideas. It would be interesting to see what kind of evidence would persuade them of new kinds of science, or to see if you could have them 'prove' it be devising experiments and then giving them simulated data from the experiments to lead them along the correct sequence of steps to come to a novel (to them) conclusion.

249 more comments available on Hacker News

View full discussion on Hacker News

ID: 46319826Type: storyLast synced: 12/21/2025, 10:35:33 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN