New Book on Apache Solr/lucene
Posted3 months ago
testmysearch.comTechstory
calmpositive
Debate
0/100
Apache SolrLuceneSearch Technology
Key topics
Apache Solr
Lucene
Search Technology
A new book on Apache Solr/Lucene has been released, and the author is promoting it on HN, with a single commenter expressing interest and a personal connection to the topic.
Snapshot generated from the HN discussion
Discussion Activity
Light discussionFirst comment
N/A
Peak period
1
Start
Avg / period
1
Key moments
- 01Story posted
Oct 27, 2025 at 10:44 AM EDT
3 months ago
Step 01 - 02First comment
Oct 27, 2025 at 10:44 AM EDT
0s after posting
Step 02 - 03Peak activity
1 comments in Start
Hottest window of the conversation
Step 03 - 04Latest activity
Oct 27, 2025 at 10:44 AM EDT
3 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
ID: 45721557Type: storyLast synced: 11/17/2025, 8:05:31 AM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
This book differs from others on the topic of Solr/Lucene in that it describes the algorithms and data structures used in Solr/Lucene, starting from the problem that faced the developers, then examining naive or straightforward solutions, and further sharing how the architects in Solr/Lucene solved the problem. In very many cases, these are interesting, science-intensive solutions, backed by scientific works and papers (links are available in the book).
For example, I talk not only about the inverted index, posting lists, and how segments are stored—you can find it in other books—but also about how Finite-State Transducer (FST) is used there (this is a graph-based structure for term dictionaries, storing terms and metadata with shared prefixes/suffixes, solving high-memory usage and slow prefix/range queries in large vocabularies by providing compact, O(length of term) lookups); I talk about Pulsing Codec, Delta Encoding, Skip Lists, PackedInts (bit-packing algorithm that uses minimal bits per integer block based on the maximum value), Variable-Length Integers, LSM-Tree, HNSW for vectors, Roaring Bitmaps, LZ4 and DEFLATE compressions, Memory-Mapped I/O, Scatter-Gather Query Execution, Hash-Based Routing, SIMD Vectorization and a lot of other things of this kind. For solution architects and search engineers.
Paperback and e-book versions. Worldwide delivery.
I hope you will find it useful.