From Zero to Qed: an Informal Introduction to Formality with Lean 4

Posted20 days agoActive12 days ago

rwosync

131 points

17 comments

sdiehl.github.ioTech Discussionstory

informativepositive

Debate

10/100

Formal VerificationLean 4Mathematics

Key topics

Formal Verification

Lean 4

Mathematics

Diving into the world of formalizing mathematical proofs, a new tutorial on Lean 4 has sparked a lively discussion on the intersection of math, programming, and artificial intelligence. Commenters are buzzing about the potential of Lean 4, with some highlighting the importance of trusted proof kernels and others pondering the role of formal specification languages in verifying mathematical truths. As one commenter astutely pointed out, the real value of a proof lies not just in verifying a statement, but in understanding the underlying concepts - echoing the wisdom of Feynman and Einstein. The discussion reveals a consensus on the significance of building a correct mental model, with Lean 4 emerging as a promising tool in this pursuit.

Snapshot generated from the HN discussion

Discussion Activity

Active discussion

First comment

Peak period

132-144h

Avg / period

Comment distribution21 data points

Loading chart...

Based on 21 loaded comments

Key moments

01Story posted
Dec 13, 2025 at 6:42 PM EST
20 days ago
Step 01
02First comment
Dec 19, 2025 at 4:23 AM EST
5d after posting
Step 02
03Peak activity
16 comments in 132-144h
Hottest window of the conversation
Step 03
04Latest activity
Dec 21, 2025 at 8:50 AM EST
12 days ago
Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (17 comments)

Showing 21 comments

randomtoast

14 days ago

4 replies

I have been following recent progress in the formalization of mathematical proofs in Lean, particularly in the context of large language models. One prominent advocate of this approach is Terence Tao, who regularly writes about developments in this area.

From a programmer's perspective, this puts up an interesting parallel. Models such as Sonnet or Opus 4.5 can generate thousands of lines of code per hour. I can review the output, ask the model to write tests, iterate on the result, and after several cycles become confident that the software is sufficiently correct.

For centuries, mathematicians developed proofs by hand, using pen and paper, and were able to check the proofs of their peers. In the context of LLMs, however, a new problem may arise. Consider an LLM that constructs a proof in Lean 4 iteratively over several weeks, resulting in more than 1,000,000 lines of Lean 4 code and concluding with a QED. At what point is an ordinary mathematician no longer able to confirm with confidence that the proof is correct?

Such a mathematician might rely on another LLM to review the proof, and that system might also report that it is correct. We may reach a stage where humans can no longer feasibly verify every proof produced by LLMs due to their length and complexity. Instead, we rely on the Lean compiler and LLMs, which confirm formal correctness, and we are effectively required to trust the toolchain rather than our own direct understanding.

lgas

14 days ago

1 reply

I'm not sure I understand what you're getting at -- your last paragraph suggestions that you understand the point of formal specification languages and theorem provers (ie. for the automated prover to verify the proof such that you just have to trust the toolchain) but in your next to last paragraph you speak as if you think that human mathematicians need to verify the lean 4 code of the proof? It doesn't matter how many lines the proof is, a proof can only be constructed in lean if it's correct. (Well, assuming it's free of escape hatches like `sorry`).

practal

14 days ago

2 replies

> Well, assuming it's free of escape hatches like `sorry`

There are bugs in theorem provers, which means there might be "sorries", maybe even malicious ones (depending on what is at stake) that are not that easy to detect. Personally, I don't think that is much of a problem, as you should be able to come up with a "superlean" version of your theorem prover where correctness is easier to see, and then let the original prover export a proof that the superlean prover can check.

I think more of a concern is that mathematicians might not "understand" the proof anymore that the machine generated. This concern is not about the fact that the proof might be wrong although checked, but that the proof is correct, but cannot be "understood" by humans. I don't think that is too much of a concern either, as we can surely design the machine in a way that the generated proofs are modular, building up beautiful theories on their own.

A final concern might be that what gets lost is that humans understand what "understanding" means. I think that is the biggest concern, and I see it all the time when formalisation is discussed here on HN. Many here think that understanding is simply being able to follow the rules, and that rules are an arbitrary game, and that is simply not true. Obviously not, because think about it, what does it mean to "correctly follow the rules"?

I think the way to address this final concern is to put beauty at the heart of our theorem provers. We need beautiful proofs, written in a beautiful language, checked and created by a beautiful machine.

mutkach

14 days ago

1 reply

Understanding IMO is "developing a correct mental model of a concept", some heuristics of correctness: Feynman: "What I cannot build. I do not understand" Einstein: "If you can't explain it to a six year old, you don't understand it yourself"

Of course none of this changes anything around the machine generated proofs. The point of the proof is to communicate ideas; formalization and verification is simply a certificate showing that those ideas are worth checking out.

practal

14 days ago

1 reply

Ideas and correctness depend on each other. You usually start with an idea, and check if it is correct. If not, you adjust the idea until it becomes correct. Once you have a correct idea, you can go looking for more ideas based on this.

Formalisation and (formulating) ideas are not separate things, they are both mathematics. In particular, it is not that one should live in Lean, and the other one in blueprints.

Formalisation and verification are not simply certificates. For example, what language are you using for the formalisation? That influences how you can express your ideas formally. The more beautiful your language, the more the formal counter part can look like the original informal idea. This capability might actually be a way to define what it means for a language to be beautiful, together with simplicity.

mutkach

14 days ago

2 replies

I share your fascination with proof assistants and formal verification, but the reality is that I am yet to see an actual mathematician working on frontier research who is excited about formalizing their ideas, or enthusiastic about putting in the actual (additional) work to build the formalization prerequisites to even begin defining the theorem's statement in that (formal) language.

hollerith

14 days ago

1 reply

British mathematician Kevin Buzzard has been heavily involved with proof assistants since 2017.

mutkach

14 days ago

1 reply

Sure, he is one of biggest advocates for it, and yet he was quite clear that it is not yet possible for him to do his actual research in Lean.

hollerith

12 days ago

Thanks for that.

practal

14 days ago

You know what? I agree with you. I have not formalised any of my stuff on abstraction logic [1] for that reason (although that would not be too difficult in Isabelle or Lean), I want to write it down in Practal [2], this becoming possible I see as the first serious milestone for Practal. Eventually, I want Practal to feel more natural than paper, and definitely more natural than LaTeX. That's the goal, and I feel many people now see that this will be possible with AI within the next decade.

[1] http://abstractionlogic.com

[2] https://practal.com

lgas

14 days ago

1 reply

> Personally, I don't think that is much of a problem, as you should be able to come up with a "superlean" version of your theorem prover where correctness is easier to see, and then let the original prover export a proof that the superlean prover can check.

I think this is sort of how lean itself already works. It has a minimal trusted kernel that everything is forced through. Only the kernel has to be verified.

practal

14 days ago

In principle, this is how these systems work. In practice, there are usually plenty of things that make it difficult to say for sure if you have a proof of something.

mutkach

14 days ago

1 reply

> more than 1,000,000 lines of Lean 4 code and concluding with a QED.

Usually the point of the proof is not to figure out whether a particular statement is true (which may be of little interest by itself, see Collatz conjecture), but to develop some good ideas _while_ proving that statement. So there's not much value in verified 1mil lines of Lean by itself. You'd want to study the (Lean) proof hoping to find some kind of new math invented in it or a particular trick worth noticing.

LLM may first develop a proof in natural language, then prove its correctness while autoformalizing it in Lean. Maybe it will be worth something in that case.

aejm

14 days ago

1 reply

No, the point of proofs in mathematics IS to prove a particular statement is true, given certain axioms (accepted truths). Yes, there are numerous benefits beyond demonstrating something is undeniably true, perhaps more “useful” than the proof itself, but math is a method of formal knowledge that doesn’t accept shortcuts.

peterkagey

14 days ago

A lot of mathematicians (myself included) would say that the point of proofs isn’t entirely to know whether or not a statement is true, but that it exists to promote human understanding. In fact, I’d argue that at some level, knowing whether or not a theorem is true can be less important than understanding an argument.

This is why having multiple different proofs is valuable to the math community—because different proofs offer different perspectives and ways of understanding.

zozbot234

14 days ago

1 reply

You don't have to trust the full Lean toolchain, only the trusted proof kernel (which is small enough to be understood by a human) and any desugarings involved in converting the theorem statement from high-level Lean to whatever the proof kernel accepts (these should also be simple enough). The proof itself can be checked automatically.

tuhgdetzhh

14 days ago

1 reply

Lets assume the kenerl is correct and a mathematian trusts it, now what happens if it is the proof itself that he cannot understand anymore because it has gotten too sophisticated? I think that this would indeed be a difference, and a crossing point in history of mathematical proofs.

zozbot234

14 days ago

In principle, it's no different than failing to understand the details of any calculation. If the calculating process is executed correctly, you can still trust the outcome.

esafak

14 days ago

I am not a mathematician either, but having read Tao's writing on developing an intuition (https://terrytao.wordpress.com/career-advice/theres-more-to-...) I can easily see how he would profitably point an LLM in the right direction, and suggest approaches that would conclude in a successful proof upon connecting the dots.

oersted

15 days ago

The motivation itself is quite fresh and compelling even as a standalone article.

https://sdiehl.github.io/zero-to-qed/02_why.html

IshKebab

14 days ago

This looks really good and complete. And I like that it starts from normal programming stuff rather than immediately jumping into type universes and proof expressions.

View full discussion on Hacker News

ID: 46259343Type: storyLast synced: 12/19/2025, 8:15:29 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Open link View on HN