I Am Building an API That Gives You Bounding Boxes for Every Answer
Posted3 months ago
ninjadoc.aiTechstory
calmneutral
Debate
0/100
APINlpDocument Analysis
Key topics
API
Nlp
Document Analysis
The author is building an API that provides bounding boxes for answers in documents, with the discussion touching on potential applications and comparisons to existing tools.
Snapshot generated from the HN discussion
Discussion Activity
Light discussionFirst comment
N/A
Peak period
1
Start
Avg / period
1
Key moments
- 01Story posted
Oct 7, 2025 at 7:14 AM EDT
3 months ago
Step 01 - 02First comment
Oct 7, 2025 at 7:14 AM EDT
0s after posting
Step 02 - 03Peak activity
1 comments in Start
Hottest window of the conversation
Step 03 - 04Latest activity
Oct 7, 2025 at 7:14 AM EDT
3 months ago
Step 04
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
ID: 45501737Type: storyLast synced: 11/17/2025, 11:07:56 AM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.
- LLMs are great to provide answers on documents but not so great when you need bounding boxes for them.
- OCRs give you bounding boxes but they don't understand context.
- Mixing them both is a pain.
So we are using vision models to solve this. The result is an easy to use api that gives you an answer for any question you may have but also geometry information for the evidence it found.
We have a few more products lined up down the line but for now this is what you get:
- Endpoints to ask a single question
- A dashboard to define a collection of questions
- Ai enhanced Markdown transforms (our techniques are great for these)
More improvements will come down the line but it's looking good and I wish I had something like this before.
Let me know what you think, any feedback is appreciated.
Thanks!