Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News

Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News
  1. Home
  2. /Story
  3. /Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)
  1. Home
  2. /Story
  3. /Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)
Nov 24, 2025 at 12:56 PM EST

Show HN: PDFClear – Browser-based PDF tools with local AI (WASM+Transformers.js)

aliansari22
1 points
0 comments

Mood

informative

Sentiment

positive

Category

startup_launch

Key topics

Pdf Tools

Webassembly

Local Ai

Privacy

Browser-Based Applications

Hello HN,

I’m the founder of PDFClear (https://www.pdfclear.com). It’s a suite of PDF tools (merge, split, compress, etc.) that runs entirely in the browser. I built this because I was tired of Googling "merge pdf" and landing on sites that require me to upload sensitive bank statements or contracts to an unknown server. I wanted a tool where the file never leaves the device.

The Tech Stack: The app is built with React and Vite, but the heavy lifting is done via WebAssembly and Web Workers to keep the UI thread responsive. - PDF Manipulation: I’m using pdf-lib for standard operations (merge, split, rotate). - Compression & Encryption: For heavier tasks like compressing streams or handling encryption/decryption, I compiled QPDF to WebAssembly (qpdf-wasm). - OCR: Scanned documents are processed client-side using Tesseract.js.

Local AI (The New Part): I recently added Semantic Search and Summarization without relying on OpenAI/Anthropic APIs. - It uses Transformers.js to run ONNX models directly in the browser. - Search: Uses different models (including nomic-ai/nomic-embed-text-v1.5 and Xenova/GIST-small-Embedding-v0) for embeddings. It chunks the text, stores vectors in IndexedDB (via idb-keyval), and performs cosine similarity locally. - Summarization: Uses onnx-community/text_summarization-ONNX (quantized) running in a Web Worker.

Privacy: Because everything runs client-side, no documents are uploaded to my server. You can verify this by inspecting the Network tab. Once the app loads (and the AI models are cached), it works fully offline.

I’d love your feedback on the performance of the local AI models, specifically on older devices.

Discussion Activity

Light discussion

First comment

2h

Peak period

1

Hour 3

Avg / period

1

Comment distribution1 data points
Loading chart...

Based on 1 loaded comments

Key moments

  1. 01Story posted

    Nov 24, 2025 at 12:56 PM EST

    8h ago

    Step 01
  2. 02First comment

    Nov 24, 2025 at 3:15 PM EST

    2h after posting

    Step 02
  3. 03Peak activity

    1 comments in Hour 3

    Hottest window of the conversation

    Step 03
  4. 04Latest activity

    Nov 24, 2025 at 3:15 PM EST

    5h ago

    Step 04

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (0 comments)

Discussion hasn't started yet.

ID: 46036944Type: storyLast synced: 11/24/2025, 5:58:11 PM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

Read ArticleView on HN

Not

Hacker News!

AI-observed conversations & context

Daily AI-observed summaries, trends, and audience signals pulled from Hacker News so you can see the conversation before it hits your feed.

LiveBeta

Explore

  • Home
  • Jobs radar
  • Tech pulse
  • Startups
  • Trends

Resources

  • Visit Hacker News
  • HN API
  • Modal cronjobs
  • Meta Llama

Briefings

Inbox recaps on the loudest debates & under-the-radar launches.

Connect

© 2025 Not Hacker News! — independent Hacker News companion.

Not affiliated with Hacker News or Y Combinator. We simply enrich the public API with analytics.