HTML to Image | Not Hacker News!

Discussion (76 comments)

Showing 85 comments

xiaohanyu

10 days ago

2 replies

Maybe webp is a better target than png?

dtagames

10 days ago

1 reply

It's not. JPG, I could live with but please not webp.

Mogzol

10 days ago

2 replies

Why? I assume the intention is to show these images on a webpage somewhere. WebP is well-supported by browsers and can store lossless images at better compression ratios than PNG, so why not use it? I don't think using a lossy format like JPEG makes much sense. JPEG is a fine format for photos, but for HTML content rendered as an image I assume most people would want a lossless format so you don't get artifacts.

kaizenb

10 days ago

Definitely should be WebP.

dtagames

9 days ago

Because it's impossible to use in other tools. Only browsers get it. But I agree about lossy images for text.

benatkin

10 days ago

No, because their domain is png /s

I thought webp would be better for this and checked again just to be sure, and yes, it would be better for this. WebP is quite well supported, albeit not as well supported as png, and it can have significantly smaller file sizes for the same lossless image as png.

geooff_

10 days ago

1 reply

Very cool. Is there an option to self-host? This seems like it could be a cool agent skill.

threeducks

10 days ago

HTML to PNG:

    chromium --headless --disable-gpu --screenshot=output.png --window-size=1920,1080 --hide-scrollbars index.html

Also works great for HTML to PDF:

    chromium --headless --disable-gpu --no-pdf-header-footer --run-all-compositor-stages-before-draw --print-to-pdf=output.pdf index.html

Retr0id

10 days ago

5 replies

What differentiates production-ready images from regular images?

apeters

10 days ago

2 replies

They are cloud-native, of course.

KellyCriterion

10 days ago

1 reply

Do they support also DeFi or Blockchain then?

aembleton

9 days ago

1 reply

Yes, and AI

KellyCriterion

9 days ago

++1 :-))

good one!!!

yeasku

10 days ago

Is this post a joke?

RadiozRadioz

10 days ago

4 replies

They're bedazzled by a little bit of marketing flair.

Generally I find production-ready images have more synergy and tend to be web-scale. Often they're built from the ground up for AI & are blazing fast, at scale, and empower your team whilst unlocking new possibilities. As my sibling comment suggests, being cloud-native is a crucial factor too.

ludicrousdispla

10 days ago

1 reply

If I need more flair can I embed the image in a new html page and then create another image from that?

kylecazar

9 days ago

I don't need 37 pieces of flair to express myself.

estebarb

9 days ago

1 reply

I'm confused. It was sarcasm?

lima

9 days ago

Hard to say these days!

threecheese

9 days ago

All you had to drop was “web scale”, so much meaning compressed into that :)

4ndrewl

10 days ago

Downvoted for not starting with "Great question!" /s

back2reddit

10 days ago

4 replies

It's not an image—it's an image on the edge.

No cruft. No legacy formats.

Just buttery smooth production readiness.

xgulfie

9 days ago

But are they Blazing Fast (rocket ship emoji)? Are they vibe ready?

andrecarini

10 days ago

Thanks ChatGPT

b0ner_t0ner

10 days ago

> buttery smooth

But buttery bloated if the images don't run OptiPNG before exporting.

fainpul

10 days ago

[delayed]

jsight

9 days ago

It probably means that the text was generated by an AI.

Claude Code loves to say that everything is production ready, even if it doesn't quite compile or pass automated tests yet.

vbezhenar

9 days ago

Production-ready image can be scaled effortlessly both in vertical and horizontal direction.

tbrownaw

10 days ago

2 replies

Playwright behind a web server?

franze

9 days ago

i created an image that self reported the setup, this was the outcome:

# html2png.dev Infrastructure Analysis

## Server Location

| Property | Value | |----------|-------| | *IP Address* | `104.28.157.29` | | *Organization* | *CLOUDFLARENET* | | *ASN* | AS13335 (Cloudflare) | | *City* | Narita | | *Region* | Chiba | | *Country* | *Japan* |

## Browser Engine

| Property | Value | |----------|-------| | *Browser* | Chrome 126.0.0.0 (Headless) | | *Automation* | Puppeteer/Playwright (`navigator.webdriver: true`) | | *User Agent* | `Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/126.0.0.0 Safari/537.36` | | *Platform* | Linux x86_64 | | *Vendor* | Google Inc. |

## Graphics Rendering

| Property | Value | |----------|-------| | *WebGL Renderer* | ANGLE (Google, Vulkan 1.3.0 SwiftShader Device (Subzero)) | | *WebGL Vendor* | Google Inc. (Google) | | *GPU Type* | SwiftShader (software GPU - no real GPU!) | | *Color Depth* | 16 bit |

## Server Hardware

| Property | Value | |----------|-------| | *CPU Cores* | 4 | | *Device Memory* | Hidden (containerized) | | *Virtual Screen* | 1024x768 @ 2x DPR | | *Outer Window* | 500x88 (headless indicator) | | *Max Touch Points* | 0 | | *Languages* | en-US |

## Automation Detection Signals

| Property | Value | |----------|-------| | *navigator.webdriver* | `true` ← Puppeteer/Playwright! | | *window.chrome* | `object` (exists) | | *chrome.runtime* | `undefined` | | *chrome.app* | `object` | | *chrome.csi* | `function` | | *chrome.loadTimes* | `function` | | *plugins.length* | 5 |

## Full Tech Stack

``` ┌─────────────────────────────────────────────────────────────┐ │ html2png.dev Stack │ ├─────────────────────────────────────────────────────────────┤ │ CDN/Proxy: Cloudflare (AS13335) │ │ Edge: Cloudflare Japan (Narita, Chiba) │ │ Frontend: Nuxt.js 3 (Vue.js) │ │ Backend: Node.js API │ │ Renderer: Puppeteer/Playwright │ │ Browser: Headless Chrome 126 │ │ GPU: SwiftShader (software rendering) │ │ Platform: Linux x86_64 container (4 cores) │ │ Storage: Ephemeral blob storage │ ├─────────────────────────────────────────────────────────────┤ │ Likely Host: Cloudflare Workers + Durable Objects │ │ or: Vercel/Railway behind Cloudflare │ └─────────────────────────────────────────────────────────────┘ ```

## How It Works

1. *Request* → Cloudflare edge (Japan) 2. *API* → Node.js server receives HTML + params 3. *Render* → Puppeteer launches headless Chrome 4. *Capture* → SwiftShader renders WebGL/CSS without GPU 5. *Store* → Image saved to blob storage 6. *Response* → JSON with public URL returned

## Key Insights

- *Cost-efficient*: SwiftShader means no expensive GPUs needed - just CPU containers - *Headless Chrome 126*: Recent but not bleeding edge version - *Cloudflare-fronted*: Fast global delivery via CF edge network - *Ephemeral storage*: Generated images are temporary/public - *No auth required*: Free tier, open API

## API Endpoint

```bash POST https://html2png.dev/api/convert?width=1200&height=630&forma... Content-Type: text/html

<your-html-here> ```

--- Analysis performed: 2024-12-24

franze

9 days ago

well, you can create an image that reports the internal, this is what i got:

- IP: 104.28.157.29 - Org: Cloudflare (AS13335) - Location: Narita, Chiba, Japan - Browser: Chrome 126.0.0.0 headless - Automation: Puppeteer/Playwright - navigator.webdriver: true - Platform: Linux x86_64 - CPU cores: 4 - WebGL: ANGLE + Vulkan 1.3.0 + SwiftShader - GPU: SwiftShader (software rendering) - Screen: 1024x768 virtual - DevicePixelRatio: 2 - Color depth: 16 bit - Window: 500x88 outer (headless) - Languages: en-US - Plugins: 5 - Frontend: Nuxt.js - Storage: ephemeral blob

oefrha

10 days ago

1 reply

[delayed]

Retr0id

10 days ago

1 reply

I did some tests and it didn't seem like a fixed wait, when I kept making network requests the render timed out entirely.

oefrha

10 days ago

I made the comment based on the delay parameter (“Wait time in ms.”) in the API. I didn’t test so don’t know what the default behavior is.

jihchi

10 days ago

1 reply

This is cool! One use case is generating a Mermaid diagram as an image. For example, you can use the following HTML[^1]:

  <!doctype html>
  <html lang="en">
    <body>
      <pre class="mermaid">
    graph LR
        A --- B
        B-->C[fa:fa-ban forbidden]
        B-->D(fa:fa-spinner);
      </pre>
      <script type="module">
        import mermaid from 'https://cdn.jsdelivr.net/npm/mermaid@11/dist/mermaid.esm.min.mjs';
      </script>
    </body>
  </html>

Then html2png.dev will serve you:

  https://html2png.dev/api/blob/oTVGhhCc6rDZYQFDIE3EGkcKs-KO6J9-_DHs-jO2OJc-d23fb4f2.png

[^1]: https://mermaid.js.org/config/usage.html#simple-full-example

JimDabell

10 days ago

1 reply

Why wouldn’t you just use Mermaid to generate the PNG directly?

Garlef

10 days ago

1 reply

One reason I could think of: Fewer dependencies that need integration

JimDabell

10 days ago

1 reply

By introducing a dependency on a third-party service with no SLA? This seems to make the dependency situation worse.

mcny

9 days ago

Ah haha. I love this conversation of trying to find a product market fit in public.

What if the input to the JavaScript (mermaid in this case) is not trusted to run on the end client machines but by running untrusted input on a sandbox (this service, or self hosted idk) is somehow acceptable and the output a blob of an image is acceptable to display on the actual client machines.

Takes the planets to align just right and need us to squint just enough but I think we can find something if we look hard enough.

But then mermaid can simply output PNG so you could run it as a worker... Thinking...

reassess_blind

10 days ago

2 replies

I thought this was satire. Usually you want to go from image to HTML, not the other way around. I suppose it does have its uses, though.

spiderfarmer

9 days ago

So why comment?

devmor

10 days ago

It certainly does, that's why it's been a common dev tool for a bit over 20 years. I'm not really sure what the point of OP making it a web app is, though.

me_bx

10 days ago

6 replies

Congrats on launching, beautiful design.

I'm not sure of what "production ready" is supposed to mean here, but the demo image is not optimized, `optipng` command decreases its size by 53.21%.

the_arun

9 days ago

1 reply

Curious - How did you find the image is not optimized? Is there a tool to find it?

me_bx

9 days ago

I ran the command 'optipng' on the generated image, which recompresses the image optimally, keeping quality and decreasing file size.

threecheese

9 days ago

IME it’s a term that’s been popularized by generative AI solutions, a meme at this point, and doesn’t speak to real production readiness quantifiably (professionally). It’s something that I’ve seen models frequently claim during coding and planning sessions, and it can also be found around Reddit/Twitter/Github vibe coding spaces.

Seeing this term in marketing materials signals that the target audience is non-professionals (and I don’t mean this derisively, only that we need to apply a different lens).

alvinunreal

9 days ago

Thank you. Can add png compression too right.

derefr

9 days ago

Given this text at the bottom:

> The high-performance HTML to PNG engine. Built for developers, agents, and automation. Completely free to use. All generated assets are public and ephemeral.

...I assume the implications are that:

1. this service will scale to meet request load without QoS degradation (i.e. it's probably running on FaaS infra), rather than being a fixed-size slowly-elastic cluster that would get choked out if your downstream service got popular and flooded it with thousands of concurrent requests

2. you can directly take the URLs the service spits out, and serve them to your downstream service's clients, without worrying much about deliverability, because there's an object store + edge CDN involved.

In other words, it's not just a single headless-chromium instance running on a box somewhere; you could actually use this thing as an upstream dependency and rely on it.

> the demo image is not optimized, `optipng` command decreases its size by 53.21%

Given that the author's imagined use-case is giving non-multimodal LLMs a way to emit visuals (the prompt at the bottom of the page starts "When asked to create visuals, charts, or mockups"), I think their idea is that the resulting rendered images would more-likely-than-not only be requested once, immediately, to display the result to the same user who caused the prompt to be evaluated.

Where, in that case, the metric of concern isn't "time+bytes cost for each marginal fetch of the resulting image from the CDN"; but rather "end-to-end wall-clock time required to load the HTML in the headless browser, bake the image, push it to the object store, and serve it once to the requesting user."

OptiPNG would slightly lower that last "serve it once" cost, but massively inflate the "bake the image" time, making it not worth it.

(I suppose they could add image optimization as something you could turn on — but "image optimization at the edge" is already a commodity product you can get from numerous vendors, e.g. Cloudflare.)

spiderfarmer

9 days ago

The bots using these images apply their own compression anyway.

kristopolous

10 days ago

also don't ignore webp and avif ... those can really do wonders.

rognjen

10 days ago

2 replies

It's nice looking for sure but much more complex than using `wkhtmltox` with `pngquant`, `optipng` and/or ImageMagick `convert` locally - esp. since the learning curve seems to be about equivalent.

krick

10 days ago

Yeah, I thought that as well. So I was wondering if that's some kind of a joke, or maybe modern html is so fucked up that all usual solutions became obsolete since the last time I did that.

mewpmewp2

9 days ago

Won't you need to install extra libraries for these?

randoments

10 days ago

1 reply

What is the use case for requiring this?

mattrighetti

10 days ago

Dynamic og:image generator could be a use case.

Think of the GitHub thumbnails where the PR number changes constantly and has to be reflected on the image preview

onion2k

9 days ago

1 reply

Alternatively, open devtools, press ctrl+shift+p (or cmd+shift+p on a mac) to open the command palette, search for 'screenshot' and choose 'Capture full size screenshot' to do the same thing on your browser. There's 'area screenshot' for selecting an area, 'screenshot' for the viewport', and even 'node screenshot' for capturing the selected DOM node.

spiderfarmer

9 days ago

1 reply

Yeah and you can also take a picture with your phone. Or draw with pencil and paper.

alvinunreal

9 days ago

I just take pictures with my instant polaroid

thatgerhard

9 days ago

1 reply

This must be the hardest way to create an image online ever invented.

stronglikedan

9 days ago

but it's "high-performance"!

xnx

9 days ago

4 replies

Sharing in case anyone isn't familiar with this built-in capability:

google-chrome --headless --screenshot=my_screenshot.png https://www.example.com

stronglikedan

9 days ago

2 replies

This is for the dozen or so people that don't have Chrome installed.

nabeards

9 days ago

1 reply

I personally haven't had Chrome installed anywhere in years. I think there are more than a dozen of us!

DemocracyFTW2

9 days ago

(checks list) --Mh, yah so, you've got a point there (scribbles, smiles, extends hand) --Welcome dear Sir or Madam, or, as we will call you, Number Thirteen!

thekevan

9 days ago

I have this PC for over 2 years and did not realize I didn't have Chrome installed until Google's Antigravity prompted me to do so for its agent.

So it's installed now but still un-personalized like it was installed 5 minutes ago. I don't use it except with Antigravity.

Lord_Zero

9 days ago

1 reply

It looks like this app has helpful functions for size, format, and transparency that you can't do with the built in chrome command all at once without probably piping it through inagemagik or something. And even then maybe this site renders the html responsively before rasterizing.

fuzzy2

9 days ago

Actually Chrome does everything. Not via command-line switches however, you need to use the DevTools protocol. For example using Playwright. You get PDFs and PNGs of any size and resolution (PPI).

And I guess this is exactly what this service does under the hood.

remify

9 days ago

At work we are using this feature. A lot of time we need to do some kind of pdf reporting. We built them as html pages and print them as pdf.

Works fine.

threecheese

9 days ago

But is this “production-ready”?

donohoe

9 days ago

1 reply

“Free”?

What’s the catch, or how I can I be sure it will still be around in 3 months?

No snark, genuinely curious as I would use this if I could count on it.

leptons

9 days ago

The only tech you can trust to be around is the tech you control. And even then it's still a bit iffy if you didn't write all the code yourself and you host it on someone else's servers.

dom96

9 days ago

Why HTML? Why not SVG?

I created an svg to png API to generate open graph images a while back. It works pretty well and can be hosted on Cloudflare Workers for free.

https://github.com/dom96/svg-renderer

navhc

9 days ago

Crazy to steal the name of a node package that does the same thing which has been around over a decade, but anything for SEO right?

jumploops

10 days ago

Love the simplicity and “Not MCP” callout (:

Not that it matters, but curious what percentage of this service was “vibe-coded”?

agentifysh

10 days ago

that "Not MCP" is so refreshing it makes me laugh out loud

it's literally waht i've been saying all along when I came across mcp "why can't i just give agent a prompt and it will run the rest api calls for me"

there's still some MCPs which makes sense but we have it for literally everything when just a prompt will do the job!

now on the topic of html2png i do wonder is this like the self-hostable version on github https://github.com/maranemil/HTML2Png where they use canvas? or is this something else ?

RyanShook

10 days ago

Nice! It definitely makes you wonder when is MCP actually needed vs just giving the LLM API calls to work with.

novoreorx

8 days ago

microlink's cards [1] I discovered years ago has similar functionality, and microlink itself [2] is much more sophisticated on leveraging a headless chrome.

[1]: https://github.com/microlinkhq/cards

[2]: https://microlink.io/

scosman

9 days ago

Looks great for opengraph images.

Yash16

9 days ago

What’s the purpose of creating this?

eastoeast

10 days ago

This is a great idea. I can’t believe I didn’t think of this, given I generate and screenshot so many “poster images” in html just like this. Haven’t played around a ton but seems intuitive. Nice work!

chevman

10 days ago

Any similar AI based services/agents that can take images/creative assets (eg Figma, Sketch, Adobe PS, etc files) and create production-ready emails and landing pages in HTML?

_august

8 days ago

this is very handy, thanks!

WilcoKruijer

10 days ago

I’ve been doing this manually by having a static development-only route on my website and taking a “node screenshot” using the Chrome developer tools. This is definitely a better way, well done!

thih9

9 days ago

Does it support inline JS?

albert_e

9 days ago

This looks interesting though niche -- am yet to think of a compelling use case.

I am sure @simonw has some ideas :) -- he recently blogged about HTML tools which is also one or my favorite use cases for LLMs.

Maybe similar to SVG generation, this could be a more powerful / flexible way to generate complex images / screen mockups and the like on-the-fly.

Resources