Not
Hacker
News
!
Home
Hiring
Products
Discussion
Q&A
Users
Cutting LLM Batch Inference Time in Half: Dynamic Prefix Bucketing at Scale | Not Hacker News!