Standard Ebooks: Public Domain Day 2026 in Literature
Key topics
As the literary world gears up for Public Domain Day 2026, the team behind Standard Ebooks is preparing to release a fresh batch of classics, sparking a lively discussion about their ebook production process. Contributors and enthusiasts alike chimed in, debating the absence of PDFs, with some speculating it's due to the format's problematic nature, while others revealed that Standard Ebooks offers ebooks in four alternative formats, including "compatible" and "advanced" EPUB. The conversation also touched on the team's cover art selection criteria, which favors fine-art oil painting styles, with some wiggle room for modern productions. Amidst the Q&A, a fascinating debate emerged about the value of preserving original typesetting and formatting, with some arguing that these elements can be lost in translation, while others countered that they may not have been intentional to begin with.
Snapshot generated from the HN discussion
Discussion Activity
Active discussionFirst comment
47m
Peak period
16
0-3h
Avg / period
8
Based on 56 loaded comments
Key moments
- 01Story posted
Jan 2, 2026 at 3:40 AM EST
9 days ago
Step 01 - 02First comment
Jan 2, 2026 at 4:27 AM EST
47m after posting
Step 02 - 03Peak activity
16 comments in 0-3h
Hottest window of the conversation
Step 03 - 04Latest activity
Jan 3, 2026 at 3:47 PM EST
8 days ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
One of the SE editors experimented with turning SE ebooks into PDFs, though. See more about that here: https://groups.google.com/g/standardebooks/c/Xy2bwiexLeM/m/f...
You can run their EPUB through Pandoc to convert yourself, or put some effort in and setup your own Calibre instance which will do something similar when you ask it to.
Also thanks for doing this, I've read a bunch of stuff (GK Chesterton, Dorothy Sayers, Dashiell Hammett) that I wouldn't have otherwise if it weren't for this service.
1920ish was when it started becoming much more common for books to have illustrated dust jackets, so now that more books from that era and onwards are entering the public domain, we opt to use the first edition dust jacket if possible. Fortunately for us, that era also happens to be the so-called Golden Age of Illustration so it's not hard finding beautiful art to use!
[1] https://standardebooks.org/subjects/science-fiction
[2] https://standardebooks.org/artworks
Do you know if the project try to look at other languages at all?
Something I am rather familiar with is brought out by your mention of the German edition/title; that the continental market seems to generally produce books that are far more densely formatted, i.e., smaller font and typesetting, thinner pages, and leading to overall tighter book formats. I actually appreciate it when, e.g., a book is 1/2 the size and weight, and usually also made far more durably; but it will invariably compromise any author intention related to the arrangement of the lettering.
Maybe you can confirm that based on what seems to be your English and German editions of the same novel.
The German and Swedish editions I read were similarly typeset, and the first scan I found in English felt similar. What I wanted to know was if there was some thought into it, because the website is nicely designed so striving for a unique typesetting strategy could be a goal.
I found it amusing, considering all those memes about German words with 35 letters each.
And, as I get older, I began to consider letter size relevant to choose a book edition. Gave up buying new books and went for used, older editions with bigger letters.
Apart from that, they produce nice editions.
We do have a default typography across all our works (the “Standard” in “Standard Ebooks” refers to a standard imprint; think Penguin) but we usually retain specific famous things where possible in a reflowable format. For example, the Mouse’s Tail in Alice’s Adventures in Wonderland,[1] or the letter in E. A. Poe’s “Thou Art the Man”.[2]
We don’t take on other languages, no. Our tooling[3] and style guides[4] are tailored specifically to English. Absolutely nothing stopping another project from forking the codebase (it’s GPL-3) and giving it a go.
[1] https://standardebooks.org/ebooks/lewis-carroll/alices-adven...
[2] https://standardebooks.org/ebooks/edgar-allan-poe/short-fict...
[3] https://github.com/standardebooks/tools
[4] https://standardebooks.org/manual/
First-time contributors should select something from the appropriate section, because that gives you the greatest chance of succeeding and the least burden on our reviewers as you get started.
Our toolset has a help wanted section and some outstanding issues: https://github.com/standardebooks/tools#help-wanted
Regardless, my understanding of copyright is that people broadly get annoyed when you infringe by distribution. In this case the distribution didn’t happen until the copyright had expired. People preparing these projects for later launch do it purely on their own machines, and nothing arrives at Standard Ebooks until the day of release.
I tried to find a policy page on this via a standardebooks.org site search but nothing looked relevant.
I’m asking after realizing that some of my favorite books were books where the ebook had intentional font choices, for example different fonts for chapter titles vs body text, fonts that matched the vibe of the book (historical, more modern, etc.) It would be nice if more ebook readers made it easy to import more than the ~8 fonts they include by default but the next best thing is when the book itself includes a great font.
When will they start assigning catalog numbers to each of their works in the way that Project Guteneberg does? I'd like a unique id per ebook and since you don't (for obvious reasons) use ISBNs, there's nothing really to be done. I can't use an OCLC id because Standard Ebooks aren't consistently listed in Worldcat, I can't use Bookbrainz or Open Library ids for the same reason.
It costs nothing and is a low-effort fix. It's been an industry (and library science) thing for decades or longer. Can Standard Ebooks finally stop the amateur hour crap?
Not that it matters, most of your catalog is done better by others, I'll just go with them. Why would I want books from people who have an aversion for doing things right?
The last point is why the Beatrix Potter compilation I did ended up with illustrations: the text specifically references the illustrations in a couple of the books (“Can you see…?”) so the illustrations remained. It did mean writing 602 pieces of alt text though[1] so it was a fairly major undertaking to include them.
[1] https://standardebooks.org/ebooks/beatrix-potter/short-ficti...
1. renders the cover as a JPG from the native SVG 2. converts more complicated selectors (e.g. :nth-of-type, :empty, etc.) to classes and class selectors 3. renders MathML as images 4. renders SVGs as images
…and a bunch more things to make it work better with older ereaders, which tend hang around for a long time.
It’s worth pointing out that the Kobo builds are closer to the “advanced” build as they have great support for MathML and SVG. But they add a bunch of Kobo-specific markup to make it work better there.
It would be neat to see those editions show up in when browsing collections in apps like Libby or Overdrive (at least in the U.S.).
Was referring to the Bogart version.
I have a hypothesis that we're getting closer to a cultural inflection point (maybe half a decade out). With every year, more important and very high-quality cultural artifacts enter the public domain, while at the same time, many low quality artefacts are produced (... AI slop). It'll be increasingly difficult to choose a good cultural artefict for consumption (e.g., which book to read next or which movie to watch). A very good indicator for quality is time and thus a useful filter.
In some years we could have the following: a netflix-like (legal variant of popcorntime) software system (p2p) that serves high-quality public domain movies, for those who like it, even with AI upscaling or post processing.
The same would also work for books, with this pipeline: Project Gutenberg -> Standard Ebooks. At the inflection point, there would be a steady stream of high-quality formats of high-quality content, enough to satisfy the demand of cultural consumption. You wouldn't need the latest book/movie anymore, except for interest in contemporary stuff.
The more difficult part of any such list is the editorial decision which works to include. Even if we only cared about books published in English that would be thousands each year
> On January 1, 2026, books published in 1930 enter the U.S. public domain.
The Copyright laws are different in each country, and it's a non-sense in the modern world.
A few years ago, I was searching for books written by Alexandra David-Neel. I found them on a Canadian (IIRC) website, but downloads were filtered by geo-IP, since what was in the public domain there was not yet public in France. One of the books I wanted was written before 1900, and not in print since then. Yet the author died in 1969, aged 100, so the French Public Domain for her works will start in 2040.
Another example: "As I lay dying" by William Faulkner is now Public Domain in the USA. It was Public Domain in Canada from 2013 to 2023. Then the law changed, and the copyright was extended by 20 years, and reinstated for this book until 2032 — which is 70 years after the author's death in 1962.
When you say you have the rights to it, you might only have the right to read it, not to give it to anyone else. What was the license text when you bought it?
https://indieauthorproject.com/authors/
I couldn't find a ebook online, so I found an old paperback copy and created one: https://www.hotelexistence.ca/create-epub-from-paperback/
Charles T. Currelly was like a real-life Indiana Jones, he was the first director of the Royal Ontario Museum in Toronto, and sourced much of its early collections.
Even with modern OCR (I used Mistral's here), and a book with limited formatting, it's funny how hours of touch-ups are required just to get a glitch-free reading experience (no stray headers, paragraphs, page numbers sprinkled through the text).
https://dave.autonoma.ca/blog/2020/04/11/project-gutenberg-p...
A bit tangential here, but I am really looking forward to 2035 for the public domain. A ton of culturally significant works seem to enter then - And Then There Were None, Gone with the Wind, The Wizard of Oz, Mr. Smith Goes to Washington, Batman (Detective Comics #27), Superman #1, Marvel Comics #1, and Tintin’s King Ottokar’s Sceptre.
https://en.wikipedia.org/wiki/2035_in_public_domain
Wikipedia also tells me that all of the 'life + 70" countries will have Ian Fleming's James Bond works in the public domain in 2035 as well.
Happy Public Domain Day 2026
https://news.ycombinator.com/item?id=46460440
What will enter the public domain in 2026?
https://news.ycombinator.com/item?id=46117112
Hopefully this makes discovery of books easier and lets people manage their libraries online - I like Calibre, but it is not great for people who are just getting started.