Show HN: Optimizing DeepSeek's NSA for TPUs – A Kernel Worklog | Not Hacker News!