It’s Not Wrong That (for Hn) “[facepalm Emoji]”.length == 36
Key topics
The coding community is buzzing about a quirky observation: the length of a "facepalm" emoji is 36 characters, sparking a lighthearted debate about Unicode representation. Commenters chimed in with suggestions to rephrase the title to accurately reflect the issue, with some poking fun at the unexpected character count. As it turns out, the article linked in the discussion reveals that the platform in question strips certain Unicode code points, including emojis, from titles, explaining the anomaly. This thread is relevant now because it highlights the often-overlooked intricacies of Unicode handling in coding.
Snapshot generated from the HN discussion
Discussion Activity
Very active discussionFirst comment
4d
Peak period
46
84-96h
Avg / period
19.7
Based on 59 loaded comments
Key moments
- 01Story posted
Aug 25, 2025 at 1:20 PM EDT
5 months ago
Step 01 - 02First comment
Aug 29, 2025 at 9:37 AM EDT
4d after posting
Step 02 - 03Peak activity
46 comments in 84-96h
Hottest window of the conversation
Step 03 - 04Latest activity
Aug 30, 2025 at 5:07 AM EDT
4 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
It’s Not Wrong that (for HN) “[facepalm emoji]”.length == 36
Or some alternative if the above is too long, like:
“[facepalm emoji]”.length == 36
Both seem more accurate than the current:
It’s Not Wrong that (for HN) “ ”.length == 36
Checks out.
But I was definitely going to hunt for a description that made it 36 characters long.
Heh. In my code, I always (idiosyncratically, I admit) spell '\x20' as '\x20' (or even just as 0x20, if it's C), unless it's a part of a multicharacter string e.g. "Hello world!": it just feels wrong to have an empty space inside single quotes. Is it really just U+0020 in there? Is it supposed to be U+0020 there? Silly worries, I know, but I just don't like the way ' ' looks.
Edit: as demonstrated by this very comment.
Also I’m not even sure it was a good idea to put them in text. Emojis are a special case that breaks a lot. Now you have to worry about multiple colors, etc.
Neither are digits, or control characters, strictly speaking. We really shouldn't have been able to have CR and LF explicitly embedded in the text files.
Digits definitely are a form of text though. Unicode is for writing systems, which definitely includes writing numbers
Jury's still out on social media, but not definitely not emoji. Incontestable success story
The real problem is that the alphabets of certain writing systems are unbounded. Emojis are completely unbounded. That's the only reason to have concern with it in unicode. Unicode is a limited set by definition and emojis are an unbounded set.
The modern Western society is very occupied with the questions of racial and gender identity, and it is generally accepted in that society that this topic is "significant". And since it's that society that the Unicode Consortium is working within, this explains how you get six different colors of "man-pregnant" emoji in the world where there possibly haven't been six different-colored pregnant men.
I would like to stress that I am not arguing against the addition U+1FAC3 PREGNANT MAN or U+1FAC4 PREGNANT PERSON, there are good reasons to add these, but do we need mundane arbitrary everyday items line U+1FAA9 MIRROR BALL? I'd say no.
For example, this is the unicode sequence for bearded lady:
So a pregnant man could simply be this expression: But no, instead we must have this combinatorial explosion of compositions because Unicode can't decide if it wants to be a symbol library or an expression library. So now, we have duplicates like U+1F40F ram and U+1F411 ewe, U+1F404 cow and U+1F402 bull, U+1F9D2 child and U+1F466 boy and U+1F467 girl (but a baby boy must be expressed as U+1F476 U+200D U+2642), U+1F468 man and U+1F469 woman and U+1F9D1 person, and U+1F385 Santa Claus and U+1F936 Mrs Claus but also U+1F9D1 U+200D U+1F384 non-gendered Claus.Take, for example, the various skin colors for faces and persons: if emoji were a real ideographic script, the written representation would be a logograph combined with a determinative, not a set of distinct glyphs. The irony of course is that is exactly how it is encoded within Unicode (an emoji codepoint with a skin color modifier). But doing it this way is exactly why emoji is an illegitimate script: it does not represent any non-digital form of writing, and the emoji modifiers do not have any representation of themselves, neither visual nor audible. Nor is the modifier composable in the way that a real language would be: it does not modify animal colors, for example.
What about "fancy fonts" (foreign characters that look like latin letters)? Japanese / Chinese ideographs? Common pictograms like "stop sign?" Mathematical symbols?
People made emoticons out of ~100 printable ASCII characters. With thousands of "real" Unicode symbols available, they would have gone wild anyway.
As a person with accessibility needs, I'm honestly glad emojis exist. They at least carry semantic meanings (though some people do abuse them in ways inconsistent with those meanings), unlike random combinations of symbols that the internet community has agreed on.
Unicode's original self-declared mission was to encode all characters needed for written communication in the world.
Wikipedia once had a similar issue, where people used it to add all kinds of trivia and original research. There was a fight between the so called inclusionists and deletionists. The latter won and we now have strict guidelines that ensure everything in Wikipedia has to have strong relevant external validation.
In my opinion, the Unicode Consortium would have been well advised to follow Wikipedia's example. If they really only had added characters with significant organic usage we'd seen only a much smaller number of emojis added and in my opinion to nobody's disadvantage.
But this is easy for me to say. I'm curious how emojis help with your accessibility needs. Has it to do with the fact that they take up little screen space or is it something else?
When somebody sends a bunch of Ancient Egyptian hieroglyphs mixed in with some mathematical symbols, with a little Katakana on top, I have no idea what they mean. The message may encode some visual meaning due to how the characters look and the visual patterns they form when placed in combination, but its semantic meaning isn't clear, so a non-visual technology cannot interpret and pronounce it properly.
This is a very common issue with "fancy font generators", which were common in certain Twitter communities once upon a time.
It is reasonable and worthwhile to encode some nonverbal information in it, and emojis have won the day.
Generally I agree with you, but in rare cases like the article that this one is meta-commentary to it might perhaps have been justifiable to allow it? I see the "slippery slope" risk though.
For. Every. Single. Emoji.
I don't remember a case when I really wanted a sexualized version, I always want to express just an emotion. Just remove all the prefixed versions, and leave the pure one.
Are there any even mildly-popular languages that use, or allow, curly quotes for strings? I’d kinda like there to be at least one.
- https://www.youtube.com/watch?v=K93zcgFsynk&ab_channel=Vsauc...
- https://www.youtube.com/watch?v=XrHTI04i9yk&ab_channel=%E2%8...
This is done by using invisible characters such as ZWNJ to get around the title filter.
It’s not wrong that "\u{1F926}\u{1F3FC}\u200D\u2642\uFE0F".length == 7 (2019) - https://news.ycombinator.com/item?id=44981525 - Aug 2025 (274 comments)
(btw the title edit on that one was for fun, and came about this way: https://news.ycombinator.com/item?id=44981808)
Also related:
It’s not wrong that "🤦🏼♂️".length == 7 (2019) - https://news.ycombinator.com/item?id=36159443 - June 2023 (303 comments)
String length functions for single emoji characters evaluate to greater than 1 - https://news.ycombinator.com/item?id=26591373 - March 2021 (127 comments)
String Lengths in Unicode - https://news.ycombinator.com/item?id=20914184 - Sept 2019 (140 comments)
(as you can see, we've taken quite a few different whacks at that piñata of a title over the years)
---
This comment is brought to you thanks: "View Selection Source" context menu entry in Firefox.