Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News

Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News
  1. Home
  2. /Story
  3. /How Proper Names Behave in Text Embedding Space
  1. Home
  2. /Story
  3. /How Proper Names Behave in Text Embedding Space
Nov 23, 2025 at 8:32 AM EST

How Proper Names Behave in Text Embedding Space

etoud
2 points
1 comments

Mood

informative

Sentiment

neutral

Category

research

Key topics

Text_embedding

Natural_language_processing

Machine_learning

Discussion Activity

Light discussion

First comment

N/A

Peak period

1

Hour 1

Avg / period

1

Comment distribution1 data points
Loading chart...

Based on 1 loaded comments

Key moments

  1. 01Story posted

    Nov 23, 2025 at 8:32 AM EST

    18h ago

    Step 01
  2. 02First comment

    Nov 23, 2025 at 8:32 AM EST

    0s after posting

    Step 02
  3. 03Peak activity

    1 comments in Hour 1

    Hottest window of the conversation

    Step 03
  4. 04Latest activity

    Nov 23, 2025 at 8:32 AM EST

    18h ago

    Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (1 comments)
Showing 1 comments
etoud
18h ago
I was debugging a RAG system and noticed that “semantic” dense retrievers were oddly good at author names, even when hybrid clearly worked better overall. This post builds a small diagnostic around synthetic (author, topic) queries and shows that proper names carry about half as much separation power as the topic in embedding space. Then I systematically “break” the names (masks, gibberish IDs, small edit-distance corruptions, formatting and layout changes) to see what survives, and find that most of the signal comes from surface form and exact-match bias rather than any deep notion of identity.
View full discussion on Hacker News
ID: 46023432Type: storyLast synced: 11/23/2025, 6:43:48 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Read ArticleView on HN

Not

Hacker News!

AI-observed conversations & context

Daily AI-observed summaries, trends, and audience signals pulled from Hacker News so you can see the conversation before it hits your feed.

LiveBeta

Explore

  • Home
  • Jobs radar
  • Tech pulse
  • Startups
  • Trends

Resources

  • Visit Hacker News
  • HN API
  • Modal cronjobs
  • Meta Llama

Briefings

Inbox recaps on the loudest debates & under-the-radar launches.

Connect

© 2025 Not Hacker News! — independent Hacker News companion.

Not affiliated with Hacker News or Y Combinator. We simply enrich the public API with analytics.