Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
LLM Optimization | Trending Topic on Hacker News | Not Hacker News!
Not
Hacker
News
!
Home
Hiring
Products
Companies
Discussion
Q&A
Users
Home
/
Discussion
/
LLM Optimization
Back to Discussion
LLM Optimization
Loading...
17 stories
•
24h:
0%
•
7d: 0
•
297 comments
Top contributors:
tdchaitanya
blndrt
JnBrymn
anuarsh
che_shr_cat
Stories
Related Stories
17 stories tagged with llm optimization
Adaptive LLM Routing Under Budget Constraints
206
78 comments
by tdchaitanya
•
2mo ago
LLM optimization
cost reduction
AI research
Tau² Benchmark: How a Prompt Rewrite Boosted GPT-5-Mini by 22%
197
65 comments
by blndrt
•
2mo ago
LLM optimization
prompt engineering
AI benchmarking
Don't Build Multi-Agents
123
89 comments
by JnBrymn
•
2mo ago
AI development
multi-agent systems
LLM optimization
Run Qwen3-Next-80b on 8gb GPU at 1tok/2s Throughput
123
17 comments
by anuarsh
•
2mo ago
LLM optimization
GPU limitations
AI model deployment
Deepconf: Scaling LLM Reasoning with Confidence, Not Just Compute
98
35 comments
by che_shr_cat
•
3mo ago
LLM optimization
AI research
compute efficiency
Chunkllm: a Lightweight Pluggable Framework for Accelerating Llms Inference
96
8 comments
by PaulHoule
•
1mo ago
LLM Optimization
AI Inference
Machine Learning Frameworks
Efficient Llm:bandwidth, Compute, Synchronization, and Capacity Are All You Need
6
0 comments
by matt_d
•
1mo ago
LLM optimization
AI research
Machine Learning
The Alien Artifact: Dspy and the Cargo Cult of LLM Optimization
3
0 comments
by valgaze
•
1mo ago
LLM Optimization
DSPy
AI Development
LLM-Use – an LLM Router That Chooses the Right Model for Each Prompt
3
2 comments
by justvugg
•
1mo ago
LLM optimization
cost reduction
AI routing
Cutting LLM Batch Inference Time by Half with Dynamic Prefix Bucketing
2
0 comments
by DISCURSIVE
•
6d ago
LLM optimization
batch inference
dynamic prefix bucketing
Un-Locc Reduce LLM API Costs by Compressing Text Into Images
2
0 comments
by MaxDevv
•
21d ago
LLM optimization
text compression
AI cost reduction
Kv Marketplace – Share LLM Attention Caches Across Gpus Like Memcached
2
1 comments
by nsomani
•
14d ago
LLM optimization
GPU computing
distributed systems
Modular, LLM-Optimized Openapi Docs – Deterministic Urls
1
0 comments
by saransh_01
•
2mo ago
API documentation
LLM optimization
software development
Dspy on a Pi: Cheap Prompt Optimization with Gepa and Qwen3
1
0 comments
by lsb
•
8d ago
DSPy
Raspberry Pi
LLM optimization
My Friend Says He Has an AI Optimized Language
1
2 comments
by samweb3
•
2mo ago
artificial intelligence
query languages
LLM optimization
LLM Optimization Notes: Memory, Compute and Inference Techniques
1
0 comments
by gmays
•
1mo ago
LLM Optimization
Distributed Machine Learning
AI Inference Techniques
A Self-Tuning Open-Source Agent for LLM Kpi Optimization
1
0 comments
by dan-carp-builds
•
1mo ago
LLM optimization
open-source
AI agents