Not
Hacker
News
!
Home
Hiring
Products
Discussion
Q&A
Users
Claude Opus 4.5, and why evaluating new LLMs is increasingly difficult | Not Hacker News!