Not

Hacker

News!

Not

Hacker

News!

AI-observed conversations & context

Daily AI-observed summaries, trends, and audience signals pulled from Hacker News so you can see the conversation before it hits your feed.

LiveBeta

Explore

Home
Hiring
Products
Companies
Discussion
Q&A
Privacy Policy

Resources

Visit Hacker News
HN API
Modal cronjobs
Meta Llama

Briefings

Inbox recaps on the loudest debates & under-the-radar launches.

Connect

© 2026 Not Hacker News! — independent Hacker News companion.

Not affiliated with Hacker News or Y Combinator. We simply enrich the public API with analytics.

LLM Optimization | Trending Topic on Hacker News | Not Hacker News!

Not

Hacker

News!

Home
Discussion
LLM Optimization

LLM Optimization

19 stories

•

24h: 0%

•

7d: 0

•

298 comments

Top contributors:tdchaitanya blndrt anuarsh JnBrymn che_shr_cat

Stories

Related Stories

19 stories tagged with llm optimization

Adaptive LLM Routing Under Budget Constraints

20678 commentsby tdchaitanya

Posted4 months agoActiveabout 2 months ago

Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22%

19765 commentsby blndrt

Posted4 months agoActiveabout 2 months ago

Run Qwen3-Next-80b on 8gb GPU at 1tok/2s Throughput

12317 commentsby anuarsh

Posted4 months agoActiveabout 2 months ago

Don't Build Multi-Agents

12389 commentsby JnBrymn

Posted4 months agoActiveabout 2 months ago

Deepconf: Scaling LLM Reasoning with Confidence, Not Just Compute

9835 commentsby che_shr_cat

Posted5 months ago

Chunkllm: a Lightweight Pluggable Framework for Accelerating Llms Inference

968 commentsby PaulHoule

Posted3 months agoActiveabout 2 months ago

Efficient Llm:bandwidth, Compute, Synchronization, and Capacity Are All You Need

60 commentsby matt_d

Posted3 months agoActiveabout 1 month ago

The Alien Artifact: Dspy and the Cargo Cult of LLM Optimization

30 commentsby valgaze

Posted3 months agoActiveabout 2 months ago

LLM-Use – an LLM Router That Chooses the Right Model for Each Prompt

32 commentsby justvugg

Posted3 months agoActiveabout 2 months ago

Cutting LLM Batch Inference Time by Half with Dynamic Prefix Bucketing

20 commentsby DISCURSIVE

Postedabout 2 months agoActiveabout 2 months ago

Kv Marketplace – Share LLM Attention Caches Across Gpus Like Memcached

21 commentsby nsomani

Postedabout 2 months agoActiveabout 2 months ago

Un-Locc Reduce LLM API Costs by Compressing Text Into Images

20 commentsby MaxDevv

Posted2 months agoActiveabout 2 months ago

Prompt Refiner – Lightweight Python Lib to Clean and Compress LLM Input

11 commentsby xinghaohuang

Postedabout 1 month agoActiveabout 1 month ago

Fixing LLM Memory Degradation in Long Coding Sessions

10 commentsby robertomisuraca

Postedabout 2 months agoActiveabout 1 month ago

Dspy on a Pi: Cheap Prompt Optimization with Gepa and Qwen3

10 commentsby lsb

Postedabout 2 months agoActiveabout 2 months ago

A Self-Tuning Open-Source Agent for LLM KPI Optimization

10 commentsby dan-carp-builds

Posted3 months agoActiveabout 2 months ago

LLM Optimization Notes: Memory, Compute and Inference Techniques

10 commentsby gmays

Posted3 months agoActiveabout 2 months ago

Modular, LLM-Optimized Openapi Docs – Deterministic Urls

10 commentsby saransh_01

Posted4 months agoActiveabout 2 months ago

My Friend Says He Has an AI Optimized Language

12 commentsby samweb3

Posted4 months agoActiveabout 2 months ago

Not

Hacker

News!

AI-observed conversations & context

Daily AI-observed summaries, trends, and audience signals pulled from Hacker News so you can see the conversation before it hits your feed.

LiveBeta

Explore

Home
Hiring
Products
Companies
Discussion
Q&A
Privacy Policy

Resources

Visit Hacker News
HN API
Modal cronjobs
Meta Llama

Briefings

Inbox recaps on the loudest debates & under-the-radar launches.

Connect

© 2026 Not Hacker News! — independent Hacker News companion.

Not affiliated with Hacker News or Y Combinator. We simply enrich the public API with analytics.