When Models Manipulate Manifolds: the Geometry of a Counting Task
Posted2 months agoActive2 months ago
transformer-circuits.pubSciencestory
calmmixed
Debate
60/100
LlmsNlpMachine Learning
Key topics
Llms
Nlp
Machine Learning
The post explores how LLMs perform a linebreaking task in fixed-width text, sparking discussion on the relevance of studying already algorithmically solved tasks and the terminology used to describe LLM analysis.
Snapshot generated from the HN discussion
Discussion Activity
Active discussionFirst comment
5d
Peak period
12
Day 6
Avg / period
4.3
Comment distribution17 data points
Loading chart...
Based on 17 loaded comments
Key moments
- 01Story posted
Oct 29, 2025 at 1:50 AM EDT
2 months ago
Step 01 - 02First comment
Nov 3, 2025 at 3:45 AM EST
5d after posting
Step 02 - 03Peak activity
12 comments in Day 6
Hottest window of the conversation
Step 03 - 04Latest activity
Nov 7, 2025 at 9:15 AM EST
2 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
ID: 45743105Type: storyLast synced: 11/20/2025, 8:32:40 PM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
I wonder why they focused specifically on a task that is already solved algorithmically. The paper does not seem to address this, and the references do not include any mentions of non-LLM approaches to the line-breaking problem.
It makes it tedious to figure out what they actually did (which sounds interesting) when it's couched in such terms and presented in such an LLMified style.
like the difference between Unicode code-points and UTF-8 bytes, you can't just count UTF-8 bytes to know how many code-points you have
The point is to see how LLMs implement algorithms internally, starting with this simple easily understood algorithm.
The biology metaphor they make is interesting, because I think a biologist would be the first to tell you that you need more than one datapoint.
There is no biology here, and there are so many other words that describe perfectly what they are doing here, without twisting the meaning of another word.
In the same way, a weather forecast model using a lot of complicated differential equations is not biological. A finite element model analyzing some complicated electromagnetic field, or the aerodynamics of a car is not biological. Just because someone around 70-75 years ago called them 'perceptrons' or 'neurons' instead of thingamajigs does not make them biology.
https://www.youtube.com/watch?v=Y65FRxE7uMc