Implementing Forth in Go and C

Posted4 months agoActive4 months ago

Bogdanp

162 points

28 comments

eli.thegreenplace.netProgrammingstory

informativeneutral

Debate

10/100

ForthGoProgramming Languages

Key topics

Forth

Programming Languages

The eternal quest to implement Forth in various programming languages has sparked a lively discussion, with one developer sharing their experiences implementing Forth in both Go and C. Commenters chimed in with their own Forth implementations, including a clever use of continuation passing style in C++ using Clang's `musttail` attribute. A humorous exchange ensued, with multiple developers playfully suggesting alternative titles, such as "Go forth and C" and "C go forth." Amidst the lighthearted banter, insightful observations were shared, including a likely mistranslation of "Fakultät" (factorial) to "faculty" in one of the examples.

Snapshot generated from the HN discussion

Discussion Activity

Moderate engagement

First comment

37m

Peak period

0-3h

Avg / period

3.1

Comment distribution28 data points

Loading chart...

Based on 28 loaded comments

Key moments

01Story posted
Aug 27, 2025 at 9:19 AM EDT
4 months ago
Step 01
02First comment
Aug 27, 2025 at 9:56 AM EDT
37m after posting
Step 02
03Peak activity
6 comments in 0-3h
Hottest window of the conversation
Step 03
04Latest activity
Aug 28, 2025 at 8:08 PM EDT
4 months ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (28 comments)

Showing 28 comments

s-macke

4 months ago

2 replies

Nice to see another developer who tried to implement Forth in Go (here’s mine: [0]). It’s not easy and usually takes several iterations.

His implementation, while working for his use cases, is actually quite far from the originals. He uses separate Go structures for words, memory, for-loops, etc. Most of the default Forth implementations of words will fail, since they expect the heap and stack to behave in a certain way. Also, every major word is implemented directly in Go, with no bootstrapping.

> However, it's insufficient for the hacker level, because the host language interpreter (the one in Go) has all the control, so it's impossible to implement IF...THEN in Forth

Well, I did it with a ... heap []any ... data structure. Works well enough.

[0] https://github.com/s-macke/Forthly

n4r9

4 months ago

2 replies

One of your "Click to run" examples is called "faculty calculation". I'm not a forth expert but it looks like a factorial function. Is "faculty" a replacement for "factorial" in some contexts?

chielk

4 months ago

German, Dutch, Danish, Norwegian, and Swedish use the same word for faculty (university) and factorial (mathematics), so I'm guessing it's a mistranslation from Fakultät/faculteit/fakultet.

s-macke

4 months ago

Thanks. Fixed it.

eliben

4 months ago

Thanks for the note. Much of the blog post is dealing with this - how the approach I took for the Go implementation is not sufficient for the hacker level - because the code isn't stored anywhere but remains as text. This isn't a Go issue, it's a result of the specific design decision of how to implement it.

The second implementation - in C - does it the "right way", with both code and data living in the same memory, and thus allows all the usual Forth shenanigans. Again, this has all to do with the design approach and not the implementation language. Nothing precludes one from writing such an implementation in Go :-)

NoToP

4 months ago

3 replies

Go forth and C

kbelder

4 months ago

C forth go?

willmartian

4 months ago

Noooo, I wanted to post this

andsoitis

4 months ago

Too Pythonic.

Alt: c go forth

johnisgood

4 months ago

1 reply

I feel like as if the article itself is not finished at the end:

> How can you know the arity of the functions without adding explicit comments? Sure, if you have a handful of words like bar and foo you know like the back of your hand, this is easy. But imagine reading a large, unfamiliar code base full of code like this and trying to comprehend it.

Cool project by the way! Well done.

eliben

4 months ago

1 reply

Thanks for the feedback! What do you feel is not finished?

johnisgood

4 months ago

1 reply

I think (unless I missed it) it could be mentioned that typically you are supposed to have as small words as possible, with least amount of stack shuffling, and comment code as much as possible (stack effects). That said, I have not had to maintain large Forth projects so I have no clue how well I would fare ^^.

BTW, have you heard of https://factorcode.org? You might like it!

eliben

4 months ago

1 reply

Yes, Factor is mentioned in the first paragraph of the post!

johnisgood

4 months ago

Oh I completely forgot by the time we left the comments. Embarrassing. In my defense, I always mention Factor. :D

anyfoo

4 months ago

2 replies

I must say, implementing Forth in almost any high-level language, even C, can be awkward and almost antithetical. A "high-level" Forth implementation often merely presents a Forth-compatible runtime, which runs the risk of missing what makes Forth so unique and useful. (The author actually seems to understand this: "The first implementation I tried is stubbornly different. Can we just make a pure interpreter?")

That's because of two main reasons. First, Forth relies heavily on passing control to the next word, instead of calling word implementations as subroutines. This is difficult to do correctly, and dynamically, in C. I suppose you could get by with computed gotos, but...

Secondly, Forth and assembly (or more generally any lower-level instruction/register view of what it's running on) go hand in hand. Forth words may carry their own assembly implementation within them, or they may be partially assembly.

Without those two things, you don't really have a Forth system, just a Forth interpreter. And you cannot define new primitive words, or words with very custom compilation semantics, from within Forth itself.

And therein also lies Forth's strength. It is very easily on-the-fly expandable, and it can even bootstrap itself. It is a great tool for machine-level exploration: Do complex low-level things with a lot less boilerplate.

The author mentions that this second attempt is a "hacker level" implementation, but short of reading the source code does not seem to go into any detail what that means exactly, and how it's implemented?

johnecheck

4 months ago

1 reply

Ah, but which assembly? If you just roll a simple VM in the host language, your instruction set becomes the assembly Forth integrates so well with. I'm assuming that's what they did in their C implementation.

anyfoo

4 months ago

1 reply

True!

I don't think that's what happening, though. At a quick glance, the primitive words seem to be directly implemented in C: https://github.com/eliben/goforth/blob/3949fe0cc52131fdcae0e...

So it looks like this one kind of sits in between. It has primitive words implemented in C, but a dictionary accessible by Forth. The interpreter loop however seems to be implemented in C as well, not Forth.

johnecheck

4 months ago

I remember when I realized just how little host language code was needed to bootstrap a Forth. Before that I was making the same mistake.

OP, I strongly recommend the book Threaded Interpreted Languages [1]. I had to read chapter 2 several times before it clicked (short and dense), but it's an excellent resource.

[1] https://archive.org/details/R.G.LoeligerThreadedInterpretive...

pjmlp

4 months ago

I agree, Lisp and Forth share this beauty of needing only a handfull of Assembly written primitives, and then the whole world can be bootstraped out of them.

AdieuToLogic

4 months ago

2 replies

Speaking of Forth... Here's a fun bit of trivia.

PowerPC and Intel Macs used to use Open Firmware[0] to define their firmware (BIOS). Macs with M1[1] architectures and subsequent generations may also, but I cannot attest to this.

For those Macs which do use Open Firmware[0], a key implementation component of it is FCode[2]:

  FCode is a Forth dialect compliant to ANS Forth, that is 
  available in two different forms: source and bytecode. 
  FCode bytecode is the compiled form of FCode source.

So if anyone asks, "who uses Forth anymore?", a correct answer is - a large percentage of everyday people.

0 - https://www.openbios.org/Open_Firmware.html

1 - https://en.wikipedia.org/wiki/Apple_M1

2 - https://www.openbios.org/Forth_FCode.html

LargoLasskhyfv

4 months ago

1 reply

There is no OpenFirmware on Intel Macs. They used EFI, which isn't related to any sort of FORTH.

That got the rest of X86/AMD64 world UEFI.

Whatever the M+ ARM based Macs use is totally unrelated, too.

Adieu to logic, uh? (Which LLM hallucinated that stuff for you?)

AdieuToLogic

4 months ago

1 reply

>> PowerPC and Intel Macs used to use Open Firmware ...

> There is no OpenFirmware on Intel Macs. They used EFI ...

I thought I remembered playing with Forth at the boot prompt on an Intel Mac. I guess I was wrong and it was on a PowerPC Mac.

> Adieu to logic, uh? (Which LLM hallucinated that stuff for you?)

Human memory. Yours may be perfect in every recall, being the first human ever to be able to claim this miraculous ability, but I highly doubt it.

LargoLasskhyfv

4 months ago

SCNR with that username. Think of me as moody Marvin, if it helps ;>

bdowling

4 months ago

More trivia: OpenFirmware is descended from Sun Microsystems' firmware, OpenBoot. Add-on cards could contain Forth code that would initialize their own hardware and provide a device driver that could be used at boot time (e.g., a network card could initialize itself and provide commands to download and boot an OS off the network).

uncircle

4 months ago

1 reply

I like Forth, I am aware of its shortcomings though there is one I haven't seen mentioned: the good practice with forth is factorization, i.e. splitting words into small ones where stack juggling is rarely necessary. The enormous issue with this, not seen in other languages, is that all those words have to be named, and naming is well-known as one of the hardest things in computing.

What is a simple expression in C, for example, in Forth becomes half-a-dozen words which all have to be named, and it is an enormous productivity killer. Especially when doing maths, you would have to find a name for all the intermediate steps of an equation instead of simply doing:

  float x = (-b + sqrt(4 * a * c)) / (2 * a * c);

uticus

4 months ago

> ...you would have to find a name for all the intermediate steps of an equation instead of simply doing...

Trying to understand this. Do you mean the operators in your example (-, +, *, /) would need to be named?

If so, I agree one-off terms wouldn't make it a great calculator substitute. In fact, you'd probably end up doing equations similar to the RPN on some HP calcs [0] to get the order of operations correct - minus the handy physical +, /, etc buttons. But for programming, wouldn't you either assign the parts or the whole of the term to a word?

[0] https://en.wikipedia.org/wiki/Reverse_Polish_notation

astrobe_

4 months ago

> Look at that stack wrangling! It's really hard to follow what goes where without the detailed comments showing the stack layout on the right of each instruction (a common practice for Forth programs).

The proposed definition is not particularly difficult to follow. This a readability argument, and we know it is 90% subjective. OP could be a rookie Lisper and I could be a grey beard Lisper: I don't see a problem here, cause I don't see the parens any more.

Second, Forth as 2 stacks, and it shouldn't be overlooked:

    : +! dup >R @ + R> ! ;

Of course, this will still be opaque to those here not familiar with Forth, but basically this definition copies the address you need twice to the "other" stack, fetches the memory value, adds, then retrieves the memory address to store the result.

Third, stack comments are as common as training wheels on bicycles. Ok, a bit more common actually. But you get the idea: you need them when you have not yet learned to feel forces and balance your weight accordingly. This is also a key point which is not visible because it is not syntax nor semantics, but design. Forth programmers are very insistent on simplification and factoring for this reason. Take them seriously. If you see 2+ instances of the same sequence of words, don't worry about execution speed, factor it.

Moore explained that definitions are like abbreviations except they can take a parameter or two. Follow that. He also isn't joking when he says than messing with more than two parameters on the stack (like the proposed +!) is already too much. The author even agrees, in a way.

When you do all that, this is when you can make cherries rain on your cake: you can rewrite in machine code the critical definitions, and it is easy to do because they are so short and simple. You can make up for the cost of interpretation and the cost of convenience factoring, if needed.

> How can you know the arity of the functions without adding explicit comments?

Oh, the classic example with foo and bar that give zero clue about what they do. Compare with the "+!" proposed by the author himself - any Forth programmer instantly knows it most likely takes 2 arguments and returns nothing. If it doesn't they will argue that it is as terrible name and you should know better.

This is one of the biggest challenges of Forth: stop FUDing yourself with "what ifs". They won't happen. Ok, sometimes they do happen. But by this time you will have built some confidence, stopped guarding against alien invasions from the next multiverse (which simplifies coding a lot), and you will be able to deal with those unexpected turn of events.

I won't pretend the problem doesn't exist at all, though. But it is far smaller than this type of argument makes it seem to be. It can be a problem when reading someone else's code who has an unfamiliar coding style, granted. But I have written thousands of lines of Forth and I rarely have this problem.

Also, a note on the C equivalent: again, it is focused on the syntax, but we know that semantics are where the fun is: do foo or bar have nasty side effects? Doesn't bar return a signed integer when foo expects an unsigned? I would argue that C's more "natural" syntax creates an illusion of understanding which makes it more friendly than it really is. Maybe you've felt it if you reviewed some code once, and then later had to debug or adapt it.

bertili

4 months ago

Forth can be beautifully and efficiently implemented in portable c++ using the using continuation passing style via the clang musttail attribute. See Tails [1].

[1] https://github.com/snej/tails

View full discussion on Hacker News

ID: 45039301Type: storyLast synced: 11/20/2025, 7:45:36 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN