Product Launch
anonymous
3 points
1 comments
Posted3 months agoActive3 months ago
Show HN: ModernBERT in Pure C
github.comBERTC programmingAI deploymentNatural Language Processing
Discussion (1 comments)
Showing 1 comments
3 months ago
Hey, cool initiative!
Worth mentioning in the title that it's CPU-only: >1200 tokens/s on a single thread is impressive.
Have you considered doing optimization iterations like nanogpt-speedrun? Would be interesting to see how far you can push the performance.