LLM Optimization Notes: Memory, Compute and Inference Techniques | Not Hacker News!