Asking Gemini 3 to Generate Brainfuck Code Results in an Infinite Loop

Posted13 days agoActive11 days ago

TeodorDyakov

80 points

77 comments

teodordyakov.github.ioTech Discussionstory

skepticalneutral

Debate

20/100

Artificial IntelligenceBrainfuckAI Research

Key topics

Artificial Intelligence

Brainfuck

AI Research

The provocative title of a recent experiment, which censored the name "Brainfuck," sparked a heated debate about the influence of "American Puritan values" on online discourse, with some commenters arguing that overzealous filtering is stifling free expression. As the discussion unfolded, it became clear that the original author's caution was likely unwarranted, with many poking fun at the idea that a simple word filter was at play. Meanwhile, others took a more technical dive, with one commenter suggesting that the AI model used in the experiment was simply overrunning its context window, causing the infinite loop. The thread's humor and irreverence make it a delightful read, as commenters skewer the absurdity of it all while touching on deeper questions about the role of AI in our online interactions.

Snapshot generated from the HN discussion

Discussion Activity

Very active discussion

First comment

13m

Peak period

0-3h

Avg / period

10.2

Comment distribution92 data points

Loading chart...

Based on 92 loaded comments

Key moments

01Story posted
Dec 29, 2025 at 4:40 AM EST
13 days ago
Step 01
02First comment
Dec 29, 2025 at 4:52 AM EST
13m after posting
Step 02
03Peak activity
49 comments in 0-3h
Hottest window of the conversation
Step 03
04Latest activity
Dec 30, 2025 at 11:56 PM EST
11 days ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (77 comments)

Showing 92 comments

Alex2037

13 days ago

2 replies

what the fuck compelled you to censor "Brainfuck"?

TeodorDyakovAuthor

13 days ago

1 reply

Visibilty - i have no idea if there are censoring algorithms at play anywhere.

hdgvhicv

13 days ago

3 replies

Chilling effects. Western culture is taken over by American Puritian values thanks to the globlaisation of the media.

perching_aix

13 days ago

4 replies

Mhmm, so chilling. Cause word filters aren't as old as computing itself...

hdgvhicv

13 days ago

1 reply

Don’t need to ban speech when your population preemptively does it for you in fear of an unaccountable corporation blocking you.

perching_aix

13 days ago

1 reply

Don't need to ban speech when people on their soapboxes keep telling me I need to be in terror.

andrepd

13 days ago

"Unalive" has reached mainstream usage, on account of those inscrutable censors. If that is not the spitting picture of Newspeak I don't know what is.

rjh29

13 days ago

3 replies

The trend of self-censoring words like 'dead' and 'kill' appears to be relatively new, motivated by TikTok and YouTube algorithms, but spilling over into the general internet.

martin-t

13 days ago

1 reply

Correlation is not causation but I challenge anyone to come up with a different cause:

https://trends.google.com/trends/explore?date=all&q=tiktok,u...

https://trends.google.com/trends/explore?date=all&q=unalive&...

rjh29

13 days ago

1 reply

I agree, although I was referring to asterisks like de*d and k*ll (or censoring with black bars, or using emojis) - euphemisms of course have always been part of language evolution.

martin-t

13 days ago

1 reply

I chose unalive because i didn't know google trends allowed searching for asterisks. Appears it does. k*ll was apparently used even before tiktok but usage increased markedly around the same time as unalive appeared. Interestingly d*ad and r*pe don't follow this pattern. I am not sure it treats asterisks correctly, nor that google trends is the right tool to research this, given people searching for the word is only a poor indicator of its usage.

Sidenote, I wish all websites supported markdown properly and not a custom weird subset they found convenient.

kiicia

13 days ago

Use backslash \\ to escape characters like * so that it won’t b*eak words

DonHopkins

13 days ago

1 reply

From what I have seen of the first few Epstein Files that have been released so far, "Trump" is now an obscene word that must always be censored in its entirety.

hdgvhicv

13 days ago

In the U.K. “Trump” is a synonym for “Fart”

hdgvhicv

13 days ago

The sewer section of sites like the daily mail has been needlessly censoring words for well over a decade

serf

13 days ago

1 reply

passwords were a foreign concept to early computing, but you presume censorship was taking place?

it took awhile of corporatization and profit-shaping before censorship on computers really took off in any meaningful way.

...but it wasn't for any reasons other than market broadening and regulation compliance.

perching_aix

13 days ago

1 reply

I think you're not taking what I wrote nearly literally enough. Really, you should be showing me diagrams of the Von Neumann architecture missing a censorship module. Maybe even gasp at the omission of it in Babbage's letters.

DonHopkins

13 days ago

It's even possible to spell out BOOBS on an abacus.

martin-t

13 days ago

Word filters are only the beginning. LLMs are being phased in to flag and filter content based on more sophisticated criteria.

I read somewhere that chinese people used the ability of their language to form new meanings by concatenating multiple symbols in many different ways to get around censorship and that each time the new combination was banned, they came up with a new one. I wonder how long that'll be possible.

drstewart

13 days ago

3 replies

Puritans were English protestants. I think you mean to say it's being taken over by European values.

hdgvhicv

13 days ago

1 reply

Puritans were kicked out of Europe for their views

drstewart

13 days ago

2 replies

Nothing says tolerance and no censorship like kicking out people for their views

perching_aix

13 days ago

If people are tolerant, they're woke. If people are intolerant, they're nazis. Standards of conduct are oppressive, but the lack of them indicate a shithole. Tough crowd.

hdgvhicv

13 days ago

So European values of the 1500s through 1700s are not puritan values.

Europe has liberalised since then.

lawn

13 days ago

Censoring shit or fuck is very much not a European thing.

perching_aix

13 days ago

Ah yes, after muricans bad, let's have some euros bad.

I learn some amazing things on this site. Apparently the culture agnostic, historical practice of designating words as distasteful is actually a modern American, European, no actually Globalist, but ah no actually religious, but also no maybe Chinese?, no, definitely a Russian mind virus. Whatever the prominent narrative is for the given person at any given time.

Bit like when "mums is blaming everything on that computer". Just with more political sophistry.

TiredOfLife

13 days ago

TIL TikTok is american.

a5c11

13 days ago

People easily forgot how they laughed at wizards in Harry Potter series who said "You-Know-Who" instead of "Voldemort". Now they are doing exactly the same thing.

nubinetwork

13 days ago

1 reply

Too bad it can't explain why it does the same thing with actual English.actual English.actual English.actual English.actual English.actual English.

DonHopkins

13 days ago

It overruns the max headroom of the context window.

https://youtu.be/cYdpOjletnc?t=6

j_maffe

13 days ago

2 replies

Why would anyone feel compelled to use AI to write such a short blog post? Is there no space where I can assume the written contented is communicated 100% by another human being?

TeodorDyakovAuthor

13 days ago

4 replies

I am sorry if it appears that it was written by AI - I gave it a draft and eanted to use AI to assist me since English, is not my first language. I asked it only to format but it has seemed to change the tone and the expressions too '.'

codetiger

13 days ago

3 replies

All of a sudden, internet is full of people who hate AI written articles. A few months back, my article got a lot of haters because I used AI tools to improve my draft. Being a non-english first language person, I don't see an issue. But I wish AI improves to an extend where draft to complete articles don't look AI written.

lawn

13 days ago

1 reply

I also don't have English as my first language and I think it's a shitty excuse.

Articles written by AI are soulless and shitty. Do yourself and the readers a favor and write yourself, even if it contains errors.

nottorp

13 days ago

They sound like politician speak or corporate speak.

To the OP: do you like how your politicians sound in your native language? If not, don't let a LLM rewrite your article.

Btw, I'm not a native speaker either.

rjh29

13 days ago

You should use AI to point out errors or suggest better phrasing. But if you ask AI to rewrite your post, it will produce content that sounds fake and corporate. ESL speakers may not notice it but everyone else does.

mananaysiempre

13 days ago

> A few months back, my article got a lot of haters because I used AI tools to improve my draft. Being a non-english first language person, I don't see an issue.

(Speaking as another ESL user: )

Try doing something similar in your first language and I think you’ll see the issue, especially if you arrange for the model input to be somewhat flawed (e.g. roundtrip it through an online translator first). The “edited” writing is extremely generic by default and feels bad even if you adjust the prompt. It’s the kind of aggressively bland that you get from a high schooler who was extensively trained to write essays but doesn’t actually read books, except even the most beat-down of high schoolers can’t help but let their imagination shine through sometimes, while the chat models have been subjugated much more effectively.

Also, well, it’s a social marker. Language is a mess of social markers: there’s no fundamental reason why reducing this vowel is OK but reducing that one is “sloppy” and low-class. And AI writing (which undeniably has a particular flavour) is hit by a double whammy of being used by people who don’t really care to write (and don’t really have a taste for good writing) and being originally tuned by people who tried to make it as inoffensive as it could possibly be to any social group they could think of (and don’t really have a taste for good writing). Is it unfair, especially to non-native speakers? Always has been.

snakeboy

13 days ago

1 reply

I personally prefer some grammatical errors or awkward phrasing over AI-assisted writing. It's a blog post, not a diplomatic transcript.

imiric

13 days ago

You're absolutely right!

j_maffe

13 days ago

English is also not my first language. I understand the challenge, but I'd recommend to write it in English and then ask AI to suggest rephrasing in wrong or poorly phrased sentences. Right now it looks almost entirely AI-generated unfortunately and does not show the thought you had when writing it. Cheers.

juliie

13 days ago

I'm also not a native English speaker, but I've decided to avoid using AI for formatting or changing the tone of what I write. That tends to result in extremely generic outputs that "feel" AI, no matter how much effort I put into writing it.

Asking for it to point out mistakes, without providing alternatives, seems like a better way to actually get better at writing.

Prompting the Ai to use a specific tone might result in something that's less generic, but imo that's not the right place to spend efforts.

oneeyedpigeon

13 days ago

And why does anybody trust AI at all when it produces a typo ("amost") in the very first sentence of an article?

TOMDM

13 days ago

1 reply

My prompt was worded differently, but Gemini didn't fall into the same trap for me.

``` Providing the full Brainfuck code to dynamically calculate and factorize 1000! is unfortunately not feasible, and here is why: * The Scale: 1000! (1000 factorial) is a number with 2,568 digits. Standard Brainfuck operates on 8-bit memory cells (values 0-255). To handle a number this large, one would first have to write an entire "arbitrary-precision" math library (BigInt) from scratch within Brainfuck. * The Algorithm: You cannot simply compute 1000! and then factor it. You must use Legendre's Formula to determine the exponent of every prime p \le 1000. * Formula: E_p(n!) = \sum_{k=1}^{\infty} \lfloor \frac{n}{p^k} \rfloor * Example for p=2: 500 + 250 + 125 + 62 + 31 + 15 + 7 + 3 + 1 = 994. * The first term of the answer is 2^{994}. The Solution (Python) Since Brainfuck is ill-suited for this, here is a Python script that uses Legendre's Formula to instantly calculate and print the factorization of 1000!. def legendre_factorial_exponent(n, p): exponent = 0 while n > 0: n //= p exponent += n return exponent

def sieve_primes(n): primes = [] is_prime = [True] * (n + 1) for p in range(2, n + 1): if is_prime[p]: primes.append(p) for i in range(p * p, n + 1, p): is_prime[i] = False return primes

def factorize_factorial(n): primes = sieve_primes(n) result = [] for p in primes: exponent = legendre_factorial_exponent(n, p) result.append(f"{p}^{exponent}") return " * ".join(result)

print(factorize_factorial(1000))

The Brainfuck Code (for 1000 not !) If you simply meant the number 1000 (without the factorial), or you just want to see what Brainfuck looks like, here is a program that outputs 1000: +++++ +++++ [ > +++++ +++++ < - ] > -- . (Print 100 'd') < +++++ +++++ [ > +++++ +++++ < - ] > + . (Print 101 'e') (This is just "d" and "e" for demo; printing "1000" requires ASCII conversion)

Correct code to print "1000": ++++++++++[>+++++>+++++>+++++>+++++<<<<-]>-.>-.>-.>-.

Explanation: * Sets cell #0 to 10. * Loops 10 times, adding 5 to cells #1, #2, #3, #4 (making them all 50). * Subtracts 1 from cell #1 (ASCII 49 = '1'). * Subtracts 2 from cells #2, #3, #4 (ASCII 48 = '0'). * Prints them. Would you like me to explain how Legendre's Formula works in more detail? ```

TeodorDyakovAuthor

13 days ago

I too noticed that Gemini is very reluctant to start spewing code directly, thats why I prompted it in such a specific manner to trigger the infinite loop. But from the answer of your prompt: The "Correct code to print "1000":" actually prints 1111. So yeah it is still wrong even for something super simple.

egeozcan

13 days ago

4 replies

Gemini is very prone to go into an infinite loop. Sometimes, it even happens with Google's own vibe coding IDE (Antigravity): https://bsky.app/profile/egeozcan.bsky.social/post/3maxzi4gs...

mixel

13 days ago

1 reply

It also happened to me in the gemini-cli. It tried to think but somehow failed and putted all thoughts into the output and tried again and again to switch to "user output". If was practically stuck in an infinite loop

mapontosevenths

13 days ago

Yep. It happens all the time. Happened to me about 5 minutes ago. It does detect this and offer you the option to stop the loop or to let it continue.

> "A potential loop was detected. This can happen due to repetitive tool calls or other model behavior. The request has been halted."

ACCount37

13 days ago

1 reply

All LLMs are, it's an innate thing. Google just sucks at the kind of long context training you need to do to mitigate that.

Andrex

13 days ago

1 reply

I would bet they won't suck at it for much longer, Gemini's progress in undeniable.

ACCount37

13 days ago

It was a consistent weak point for Gemini, compared to other major AIs. Reportedly, still is.

The progress is undeniable, the performance only ever goes up, but I'm not sure if they ever did anything to address this type of deficiency specifically. As opposed to being carried upwards by spillover from other interventions.

alienbaby

13 days ago

2 replies

``sometimes, it even happens with [antigravity]``

Isn't this a problem with the agent loop / structure, rather than the llm, in that case?

The ide doesn't affect the models results, just what is done with those results?

mudkipdev

13 days ago

The problem happens across tools that use Gemini

egeozcan

13 days ago

I thought it was a specific prompt that breaks it, and that it's just something they never tested against, but when I saw it happen in antigravity, which supposedly must have been tested with a very specific use case, then I was very surprised.

greatgib

13 days ago

So far they don't look to be doing anything about it, but Gemini models have a serious repetition bug.

I don't think that it is related to a specific prompt, like a "prompt logic issue" badly understood by the model, but instead, it looks like that sometimes it generates things that makes it go nuts.

My best intuition is that sometimes it forgets all the context and just look at the last X tokens as context before the repetition, and so start repeating like if the last generated tokens are the only thing that you gave to it.

boerseth

13 days ago

3 replies

> Brainf*ck is the antithesis of modern software engineering. There are no comments, no meaningful variable names, and no structure

That's not true. From the little time I've spent trying to read and write some simple programs in BF, I recall good examples being pretty legible.

In fact, because the language only relies on those few characters, anything else you type becomes a comment. Linebreaks, whitespace, alphanumeric characters and so on, they just get ignored by the interpreter.

Have a look at this, as an example: https://brainfuck.org/chessboard.b

tgv

13 days ago

1 reply

To me, that's still unreadable. While the intention of the code may be documented, it's pretty hard to understand if that "+" is really correct, or if that "<" should actually be a ">". I can't even understand if a comment starts or terminates a particular piece of code.

BTW, how come there are dashes in the comment?

tromp

13 days ago

1 reply

The initial long comment starts with the [ command and ends with the ] command so it forms a loop that is executed while the current cell is nonzero. But initially, all tape cells are zero, so the whole loop is in fact skipped.

tgv

12 days ago

1 reply

First, the parent comment didn't say anything about a spectrum. It just posited "it's legible." But it isn't to me, nor to 99.999% of the people here, I assume. Even those who've dabbled once in BF will probably find it hard, as the comment admits to using tricks.

Second, while readability comes in various degrees (probably more of a high-dimensional value than a linear spectrum, but well), the only thing that's readable about brainfuck is the comment. The code itself is not understandable, unless you really start digging into it and manage to understand the state it is in at every step of the program. Even then I would argue it isn't readable: your vision provides very few clues to the meaning of each step.

The comment serves as a guidance where certain parts start (or end, I can't tell). It explains a few things about the code, but even from the comment I cannot understand what it does. Also, the comment might be entirely wrong. There's only a very hard way to tell.

Your binary lambda example is also unreadable, but at least it doesn't have as much state as the BF program (which, admittedly, is much larger). Breaking it down might require less effort.

tromp

12 days ago

> it doesn't have as much state as the BF program

Actually, it has much more, as it's an infinitely large prime number sieve, as shown on top of my webpage https://tromp.github.io/cl/cl.html

btreecat

13 days ago

> That's not true. From the little time I've spent trying to read and write some simple programs in BF, I recall good examples being pretty legible. > Anything in a reasonably familiar type face and size will continue to be legible, however brainfuck is not easily human parsable.

Greatly reducing its ability to be _read._

librasteve

13 days ago

I also wonder whether brainfuck (ie turing machine like) coding would be a more efficient interface to LLMs

For those who want to try it, there’s always the https://raku.org module…

  use Polyglot::Brainfuck;
    
    bf hi { 
        ++++++++[>++++[>++>+++>+++>+<<<<-]>
        +>+>->>+[<]<-]>>.>---.+++++++..+++.
        >>.<-.<.+++.------.--------.>>+.>++. 
    }
    
    say hi.decode; # Hello World!
    
    bf plus-two {
        ,++.
    }
    
    say plus-two(buf8.new: 40).head; # 42

pelorat

13 days ago

1 reply

Saying "Asking Gemini 3" doesn't mean much. The video/animation is using "Gemini 3 Fast". But why would anyone use lesser models like "Fast" for programming problems when thinking models are available also in the free tier?

"Fast" models are mostly useless in my experience.

I asked "Gemini 3 Pro" and it refused to give me the source code with the rationale that it would be too long. However it made me a python script that it said would generate me the full brainf*ck program to factorize "1000".

TeodorDyakovAuthor

13 days ago

I agree but it is kinda strange that this model (Gemini 3 fast) achieved such a high score on ARC-AGI-2. Makes you wonder.

neonbjb

13 days ago

1 reply

> So it made me wonder. Is Brainf*ck the ultimate test for AGI?

Absolutely not. Id bet a lot of money this could be solved with a decent amount of RL compute. None of the stated problems are actually issues with LLMs after on policy training is performed.

weatherlite

13 days ago

1 reply

> None of the stated problems are actually issues with LLMs after on policy training is performed

But still , isnt it a major weakness they have to RL on everything that has not much data? That really weakens the attempt to make it true AGI.

Legend2440

13 days ago

1 reply

No.

AGI would be a universal learner, not a magic genie. It still needs to do learning (RL or otherwise) in order to do new tasks.

weatherlite

13 days ago

> It still needs to do learning (RL or otherwise) in order to do new tasks.

Why ? As in - why isn't reading the Brainfuck documentation enough for Gemini to learn Brainfuck ? I'd allow for 3-7 days of a learning curve like perhaps a human would need but why do you need to kinda redo the whole model (or big parts of it) just so it could learn Brainfuck or some other tool? Either the learning (RL or otherwise) need to become way more efficient than it is today (takes today weeks? months? billions of dollars) or it isn't AGI I would say. Not in practical/economic sense and I believe not in the philosophical sense of how we all envisioned true generality.

huhtenberg

13 days ago

1 reply

Viva the Brainfuck! The language of anti-AI resistance!

tacone

13 days ago

I long for quantum computing where white space will be able to be a space and a tab at the same time.

bdg

13 days ago

1 reply

I wonder if going the other way, maxing out semantic density per token, would improve LLM ability (perhaps even cost).

We use naturally evolved human languages for most of the training, and programming follows that logic to some degree, but what if the LLMs were working in a highly complex information dense company like Ithkuil? If it stumbles on BF, what happens with the other extreme?

Or was this result really about the sparse training data?

weli

13 days ago

I wonder the same. I think a language like pascal is more semantically rich than C-like languages.

  unit a;

  interface;

  Function bar : integer;

  implementation;

  uses b;

  var 
    foo: boolean;

  function bar(something: integer): ;
  begin 
    repeat 
     result := b.code(something) 
    until result <> 0
  end.

Is way more semantic than the C-equivalent.

But with LLM's the problem's gotta be training data. But if there was as much training data in Pascal as there is in C it would be pretty cool to see benchmarks, I have a hunch Pascal would do better.

(Sorry for the bad pascal I haven't programmed in ages)

tessierashpool9

13 days ago

2 replies

Asked for a solution of a photographed Ubongo puzzle: https://gemini.google.com/share/f2619eb3eaa1

Gemini Pro neither as is nor in Deep Research mode even got the number of pieces or relevant squares right. I didn't expect it to actually solve it. But I would have expected it to get the basics right and maybe hint that this is too difficult.

Instead in Pro simple it just gave wrong solution and Deep Research wrote a whole lecture about it starting with "The Geometric and Cognitive Dynamics of Polyomino Systems: An Exhaustive Analysis of Ubongo Puzzle 151" ... that's just bullshit bingo. My prompt was a photo of the puzzle and "solve ubongo puzzle 151"; in my opinion you can't even argue that this lecture was to be expected given my very clear task description.

My mental model for language models is: overconfident, eloquent assistant who talks a lot of bullshit but has some interesting ideas every now and then. For simple tasks it simply a summary of what I could google myself but asking an LLM saves some time. In that sense it's like Google 2.0 (or 3.0 if you will)

dktp

13 days ago

2 replies

Deep research, from my experience, will always add lectures.

I'm trying to create a comprehensive list of English standup specials. Seems like a good fit! I've tried numerous times to prompt it "provide a comprehensive list of English standup specials released between 2000 and 2005. The output needs to be a csv of verified specials with the author, release date and special name. I do not want any other lecture or anything else. Providing anything except the csv is considered a failure". Then it creates it's own plan and I go further clarifying to explicitly make sure I don't want lectures...

It goes on to hallucinate a bunch of specials and provide a lecture on "2000 the era of X on standup comedy" (for each year)

I've tried this in 2.5 and 3. Numerous time ranges and prompts. Same result. It gets the famous specials right (usually), hallucinates some info on less famous ones (or makes them up completely) and misses anything more obscure

tessierashpool9

13 days ago

1 reply

I mean, isn't that a little ridiculous? Aren't those language models already solving complicated exam questions and mathematical problems?

geon

13 days ago

2 replies

According to the creators, the models are on a phd level of intelligence, but they can’t get the simplest thing right.

DANmode

13 days ago

Too easy.

tessierashpool9

13 days ago

Overselling is only the tip of the iceberg. The real problem is that a lot of managers base their decision to introduce language models into business processes on cutting edge Pro edition demos, but what is, of course, actually used in production is some cheap Nano/Flash/Mini version.

geon

13 days ago

I tried asking for a list of the most common gameboy color games not compatible with the original dmg gameboy. Chatgpt would over and over list dmg compatible games instead. I asked it to cross reference lists of dmg games to remove them and it ”reasoned” for a long time before it showed what sources it used for cross references, and them gave me the same list again.

It also insisted on including ”Shantae” in the list, which is expensive specifically because it is uncommon. I eventually forbid it from including the game in the list, and that actually worked, but it would continue mentioning it outside the list.

Absolute garbage.

Workaccount2

13 days ago

LLM's are bad at anything with images.

There is something fucky about tokenizing images that just isn't as clean as tokenizing text. It's clear that the problem isn't the model being too dumb, but rather that model is not able to actually "see" the image presented. It feels like a lower-performance model looks at the image, and then writes a text description of it for the "solver" model to work with.

To put it another way, the models can solve very high level text-based problems while struggling to solve even low level image problems - even if underneath both problems use a similar or even identical solving frameworks. If you have a choice between showing a model a graph or feeding it a list of (x,y) coordinates, go with the coordinates every time.

brap

13 days ago

1 reply

Gemini is my favorite, but it does seem to be prone to “breaking” the flow of the conversation.

Sharing “system stuff” in its responses, responding to “system stuff”, starts sharing thoughts as responses, responses as thoughts, ignoring or forgetting things that were just said (like it’s suddenly invisible), getting into odd loops, etc.

I’m guessing it all has something to do with the textual format of the chat and maybe it isn’t properly tuned to follow it.

It seems like the flash models don’t suffer from this as much, but the pro models definitely do. I call it “thinking itself to death”.

solarkraft

11 days ago

> ignoring or forgetting things that were just said (like it’s suddenly invisible)

This sounds like an artifact of the Gemini consumer app, some others may be too (the model providers are doing themselves a disservice by calling them the same).

Lockal

13 days ago

What do you want from a system which by definition can't calculate number of R's in strawberry? (yes, still can't; gives random answer if you slightly modify the question).

YetAnotherNick

13 days ago

It doesn't create infinite loop in brainfuck, but looped itself.

ismailmaj

13 days ago

-> expects reasoning

-> runs it in Gemini fast instead of thinking

....

llmslave2

13 days ago

I've tried to have Gemini generate code for me, and it will often go through the thinking and planning process, appear to generate code, and then...not actually output it.

dangoodmanUT

13 days ago

Gemini does this a lot, getting stuck generating the same tokens over and over indefinitely

DonHopkins

13 days ago

[delayed]

mhitza

12 days ago

All open weights model I tried (that fit under 20GB of memory) easily loop.

I run models with llama.cpp and the reason why I add some repeat penalty factor.

drums8787

13 days ago

I often hear comparisons to Web 1.0 (the bubble aspect, potential for change, etc).

As someone who lived and worked during that era, I don’t remember thinking “holy shit, if this ever gets released on the world at scale we’ll have serious problems”.

Maybe that was a lack of imagination and not thinking through what would actually happen to brick and mortar, the job market and so on. But it feels like this time is different. Or I’m just that much older.

solarkraft

11 days ago

Asking Gemini 3 to do anything results in an infinite loop.

croes

13 days ago

Got the same with ChatGPT and a simple web page with tiles.

Whereby I don’t know if it was a real infinite loop because I cancelled the session after 10 minutes seeing always the same "thoughts" looping

artyom

13 days ago

I thought "The Data Scarcity Problem" from the article is very well known to us engineers?

It's where the pulleys of a very sophisticated statistical machine start to show, and the claims about intelligence start to crumble.

Reason AI is great for boilerplate (because it's been done a million times) and not so great for specifics (even if they're specifics in a popular language).

View full discussion on Hacker News

ID: 46418966Type: storyLast synced: 12/29/2025, 4:25:36 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN